Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Thanks for publishing this. I quickly skimmed the paper, I saw the impressive linear scaling as you scaled to 16 nodes. How long did it take to train the various models in wall clock time?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: