Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Model 4 bit inference 4.2x faster than 16 bit with full HF support (twitter.com/tim_dettmers)
2 points by convexstrictly on July 11, 2023 | past | 1 comment
99% ChatGPT performance on a consumer GPU (twitter.com/tim_dettmers)
5 points by dragonwriter on May 25, 2023 | past
Guanaco, a chatbot on a single GPU, achieving 99% ChatGPT performance (twitter.com/tim_dettmers)
10 points by rkwasny on May 24, 2023 | past | 1 comment
Tim Dettmers: QLoRA finetunes a 65B model on a single 48 GB GPU (twitter.com/tim_dettmers)
3 points by convexstrictly on May 12, 2023 | past | 1 comment
Single desktop machine ML up to scales of 176B params (twitter.com/tim_dettmers)
4 points by nl on Aug 11, 2022 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: