Hacker News new | past | comments | ask | show | jobs | submit login

They are pretty slow even on GPU. The problem is that it's an autoregressive model. So it needs to do a forward pass for each token.



Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: