Hacker News new | past | comments | ask | show | jobs | submit login
Petals runs Llama 2 (70B) from Colab at 5 tokens/sec (github.com/bigscience-workshop)
5 points by borzunov on July 19, 2023 | hide | past | favorite | 3 comments




We've moved to a new domain, the chat is now at https://chat.petals.dev


Great project and I'm happy to see it expand to more models!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: