Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Pretty cool, what they need is to build a tool that can take any model to chip in short a time as possible. How quick can they give me DeepSeek, Kimi, Qwen or GLM on a chip? I'll take 5k tk/sec for those!


also imagine it will cost 300$/unit, we all will host our own set of models locally, dream dream




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: