Website: https://debabratapruseth.com/
There has been a wave of speculation and awe surrounding how Deepseek, a relatively new player, managed to challenge industry giants like OpenAI with its competitive LLM modelsโachieving comparable performance in a fraction of the time and cost.
One term that frequently comes up in this discussion is “๐ฒ๐๐๐๐๐๐
๐๐ ๐ซ๐๐๐๐๐๐๐๐๐๐๐.” In fact, Microsoft, OpenAI, and even the U.S. government are reportedly investigating whether Deepseek leveraged this technique in ways that might raise ethical or legal concerns.
๐๐ก๐๐ญ ๐ข๐ฌ ๐๐ง๐จ๐ฐ๐ฅ๐๐๐ ๐ ๐๐ข๐ฌ๐ญ๐ข๐ฅ๐ฅ๐๐ญ๐ข๐จ๐ง?
Knowledge distillation is a machine learning technique where a smaller, more efficient model (the ๐ฌ๐ญ๐ฎ๐๐๐ง๐ญ) learns from a larger, highly trained model (the ๐ญ๐๐๐๐ก๐๐ซ), rather than directly from raw data. This allows the student model to retain much of the teacher’s intelligence while being significantly more compact and resource-efficient.
๐๐จ๐ฐ ๐๐จ๐๐ฌ ๐ข๐ญ ๐ฐ๐จ๐ซ๐ค?
1) ๐ป๐๐๐๐ ๐ ๐ฉ๐๐ ๐ป๐๐๐๐๐๐ ๐ด๐๐
๐๐
– A very powerful AI model learns from a huge dataset.
– This model becomes super accurate but is too big and slow
2) ๐ป๐๐ ๐ป๐๐๐๐๐๐ ๐ฎ๐๐๐๐ “๐บ๐๐๐” ๐จ๐๐๐๐๐๐
– Instead of just saying “This is a cat” or “This is a dog,” the teacher gives confidence scores. Example: “Iโm 85% sure itโs a cat and 5% sure itโs a dog”
-These soft answers help the student model understand relationships between different classes.
3) ๐ป๐๐๐๐ ๐ ๐บ๐๐๐๐๐๐ ๐บ๐๐๐
๐๐๐ ๐ด๐๐
๐๐
– The student model is trained using both ๐๐๐๐ ๐
๐๐๐ and the teacherโs ๐๐๐๐ ๐๐๐๐๐๐๐.
– The student learns to mimic the teacher but with fewer resources.
4) ๐ต๐๐ ๐พ๐ ๐ฏ๐๐๐ ๐ ๐ญ๐๐๐๐๐, ๐บ๐๐๐๐๐๐ ๐จ๐ฐ!
– The student model is much smaller and faster
– Even though itโs not as big as the teacher, it performs almost as well.
๐๐ข๐ ๐๐๐๐ฉ๐ฌ๐๐๐ค ๐๐ฌ๐ ๐๐ฉ๐๐ง๐๐ ๐๐ฌ ๐๐ญ๐ฌ “๐๐๐๐๐ก๐๐ซ”?
In this case, the suspected ๐๐๐๐๐๐๐ ๐๐ ๐ถ๐๐๐๐จ๐ฐ, and the ๐๐๐๐
๐๐๐ ๐๐ ๐ซ๐๐๐๐๐๐๐. This raises an important question: Did OpenAI directly assist Deepseek? The answer is noโbut OpenAIโs models and datasets are accessible via APIs that can be purchased. What regulators and industry experts are now scrutinizing is whether Deepseek systematically leveraged OpenAIโs APIs for large-scale knowledge distillation.
In the coming days, we may gain clarity on whether Deepseekโs approach violated ethical AI development guidelinesโor if it simply found a clever way to compete with the biggest names in AI.
Until then, enjoy both Deepseek and OpenAI.
Discover more from Debabrata Pruseth
Subscribe to get the latest posts sent to your email.