Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I love that everyone is making their own TTS model as they are not as expensive as many other models to train. Also there are plenty of different architecture.

Another recent example: https://github.com/supertone-inc/supertonic



In-browser demo of Supertonic with WASM:

https://huggingface.co/spaces/Supertone/supertonic-2


Another one is Soprano-1.1.

It seems like it is being trained by one person, and it is surprisingly natural for such a small model.

I remember when TTS always meant the most robotic, barely comprehensible voices.

https://www.reddit.com/r/LocalLLaMA/comments/1qcusnt/soprano...

https://huggingface.co/ekwek/Soprano-1.1-80M


Thank you. Very good suggestion with code available and bindings for so many languages.


Thanks for heads up, this looks really interesting and claimed speed is nuts..




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: