Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Together.ai seems to be the best, incredibly fast.


Not so sure about that. Check out https://github.com/ray-project/llmperf-leaderboard

And try mixtral on chat.groq.com


These guys are much faster than openrouter, and their llama2 runs faster than 3.5-turbo. Amazing work.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: