Hacker News new | past | comments | ask | show | jobs | submit login

Google has papers on device speech recognition, these are used in the keyboard & for live caption on Pixel devices.



This article from the Google AI blog about the Gboard speech recognition is really interesting: https://ai.googleblog.com/2019/03/an-all-neural-on-device-sp...


They are trained on a ton of non-public data though, and Iā€™m not sure if pre-trained models are around.


Nope, they aren't available. CC YouTube videos with captions or radio broadcasts + transcripts could prove helpful for multiple languages as well as being able to create a multilingual ASR.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: