Hacker News new | past | comments | ask | show | jobs | submit login
Transformer Inference Arithmetic (carolchen.me)
65 points by throwawaybutwhy on April 8, 2022 | hide | past | favorite | 2 comments



This is one of the more practical and useful articles I've read about ML in practice. Going through these calculations feels much more like solid engineering vs the hand-wavy 'use a different model' or 'have you tried model distillation' type performance discussions I see elsewhere.


Wow! so helpful! I love the illustrations.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: