Hacker News new | past | comments | ask | show | jobs | submit login

They might be interested in integrating Vosk, it's a speech-to-text engine that is just a shared library (.so file on Linux) and comes with API support for a variety of languages:

https://alphacephei.com/vosk/

https://github.com/alphacep/vosk-api

Still, I've found that the Big players have much better recognition models, and the post-processing that I assume they do (grammatical, maybe syntactical inferences that improve the end result) are probably much more powerful too.




Yes Vosk will definitely be part of Leon when the focus will be on implementing new voice solutions.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: