Hacker News new | past | comments | ask | show | jobs | submit login

> Deepfake technology is not sophisticated enough to mimic an entire phone call with someone.

With modern voice conversion technology it is perfectly possible actually.




Voice imitation isn't just timbre.

It's prosody, rhythm, accent, word choice, and so on.

By the time you've mastered all those, you're practically halfway to becoming a professional voiceover artist.

Remember, trained voiceover artists have been mimicking voices for a long, long time. Their timbre isn't always perfect, but faking voices doesn't need deepfakes.


If you're looking for it sure. But I'm willing to bet existing technology is sufficient to catch an awful lot of people off guard. Hearing a familiar voice is usually quite disarming.


indeed, the text to speech conversation works extremely fast even in browser (like Colab) once you get your model tuned right.


Kids in 2020 are going to be doing prank phone calls with GPT-3 and voice models from untraceable SIP phone numbers.


and Facebook is currently generating billions of voice profiles based on recorded WhatsApp sessions. Soon these profiles are sold to advertising agencies.


No, WhatsApp messages and calls are end-to-end encrypted.


Sounds like a conspiracy theory but those aren’t mutually exclusive.


Is this true? On Google I only see mentions of using voice data to improve their speech transcribing.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: