Hacker News new | past | comments | ask | show | jobs | submit login

It is not open source, it's just open weight (which is an artifact instead of source) and open "recipe". They do not make their training / serving code available.

If you started to copy what they released in May immediately after release (DeepSeek-V2, which already contained non-trivial architecture innovation - MLA), you'd likely have slightly inferior but mostly on par optimized implementation maybe after some months. And here you go: DeepSeek-V3, try to play the catch up game again!

If you don't replicate their engineering work then your cost would be 10x~20x higher, which renders the entire point moot.

As long as the team can continue this trend there is no hope for copycats. And they are trying to "hijack" the mind of chip designers, too, see the "suggestions to chip manufactures" section. If they succeed you need to beat them in their own game.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: