Hacker News new | past | comments | ask | show | jobs | submit login

Are you referring to the distilled models?



yes, they are not r1


Can you explain what you mean by this?


For example, the model named "deepseek-r1:8b" by ollama is not a deepseek r1 model. It is actually a fine tune of Meta's Llama 8b, fine tuned on data generated by deepseek r1.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: