Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio


Text-to-speech model can preserve speaker’s emotional tone and acoustic environment.
Read more at Ars Technica…

Discover more from Emsi's feed

Subscribe now to keep reading and get access to the full archive.

Continue reading