Part of the reason their API is so cheap because they explicitly state they are ...

eldenring · 2024-12-31T14:58:29 1735657109

This comment is misleading. There is a "free lunch" here in the sense that serving this model is far cheaper than worse, open source models at scale.

Yes they probably are more willing to go down in price due to this, but the architecture is open, and they are charging similarly to a 30B-50B dense model, which is about how many active params deepseek-v3 has.

sanjams · 2025-01-01T16:00:23 1735747223

So then OP is correct? Your comment confirms the same sentiment about the tradeoff API users make: cheaper inference means you pay with your data.

Sure Deepseek may publish their weights so you dont have to use the API, but the point still stands for the API.

eldenring · 2025-01-02T22:17:59 1735856279

Its a matter of degree. If 90% of the cost savings are from a new, smarter architecture, it doesn't make sense to point to the API terms as the reason for it being so cheap.