Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The vocabulary size is fairly small (128,256) for a multilingual model. I would guess it doesn't require many additional parameters to support these 5 languages as many tokens can be shared.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: