The implementation of the Web Speech API usually involves the specific browser v...

scarface_74 · 2025-02-08T17:06:22 1739034382

Just to be clear, are you really saying that speech with text to speech is server hosted and not on device for Windows?

You could do text to speech on a 1Mhz Apple //e using the 1 bit speaker back in the 80s (software automated mouth) and MacinTalk was built into the Mac in 1984. I know it’s built into both the Mac and iOS devices and run off line.

But I do see how cross platform browsers like Firefox would want a built in solution that doesn’t depend on the vendor.

moron4hire · 2025-02-08T18:07:54 1739038074

If the application is still using the deprecated Microsoft Speech API (SAPI), it's being done locally, but that API hasn't received updates in like a decade and the output is considerably lower quality than what people expect to hear today.

Firefox on Windows is one such application that still uses SAPI. I don't know what uses does on other operating systems. Like, on Android, I imagine it uses whatever is the built-in OS TTS API, which likely goes through Google Cloud.

But anything that sounds at all natural, from any of the OS or browser vendors, is going through some cloud TTS API now.

asqueella · 2025-02-13T11:57:36 1739447856

I’m pretty sure that built-in TTS on Mac and iPhone is local (and has been for ages).