> Deepfake technology is not sophisticated enough to mimic an entire phone call ...

crazygringo · on July 27, 2020

Voice imitation isn't just timbre.

It's prosody, rhythm, accent, word choice, and so on.

By the time you've mastered all those, you're practically halfway to becoming a professional voiceover artist.

Remember, trained voiceover artists have been mimicking voices for a long, long time. Their timbre isn't always perfect, but faking voices doesn't need deepfakes.

Enginerrrd · on July 27, 2020

If you're looking for it sure. But I'm willing to bet existing technology is sufficient to catch an awful lot of people off guard. Hearing a familiar voice is usually quite disarming.

seesawtron · on July 27, 2020

indeed, the text to speech conversation works extremely fast even in browser (like Colab) once you get your model tuned right.

jcims · on July 27, 2020

Kids in 2020 are going to be doing prank phone calls with GPT-3 and voice models from untraceable SIP phone numbers.

holoduke · on July 27, 2020

and Facebook is currently generating billions of voice profiles based on recorded WhatsApp sessions. Soon these profiles are sold to advertising agencies.

personjerry · on July 27, 2020

No, WhatsApp messages and calls are end-to-end encrypted.

jcims · on July 27, 2020

Sounds like a conspiracy theory but those aren’t mutually exclusive.

smegma2 · on July 27, 2020

Is this true? On Google I only see mentions of using voice data to improve their speech transcribing.