Microsoft’s NaturalSpeech 3 clones voices and emotions
Summary NaturalSpeech 3 is Microsoft’s latest text-to-speech system that can clone voices and emotions. Microsoft Research Asia, Azure Speech, and partner universities have developed a new speech synthesis system called NaturalSpeech 3. The system uses a new approach that breaks down speech into different sub-units such as content, prosody, timbre, and acoustic details. The research …
Microsoft’s NaturalSpeech 3 clones voices and emotions Read More »