I'm as high as a kite on this stuff and have to be, but I'm not sure you're actually using ex. vision API.
Also, Whisper isn't lower WER than Google unfortunately or even close, and that I know for a fact, I designed & implemented both the server/client side of the last big Assistant audio format change, and also the UI for the New Google Assistantâ„¢, i.e. Google's first offline model.
Whisper is still really good, even Whisper Tiny, and I'm happy to ship it.
Absolutely not.
I'm as high as a kite on this stuff and have to be, but I'm not sure you're actually using ex. vision API.
Also, Whisper isn't lower WER than Google unfortunately or even close, and that I know for a fact, I designed & implemented both the server/client side of the last big Assistant audio format change, and also the UI for the New Google Assistantâ„¢, i.e. Google's first offline model.
Whisper is still really good, even Whisper Tiny, and I'm happy to ship it.