When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.
Apaper out of Cornell Universityused VALL-E to synthesize several voices.
Some examples of the work areavailable on GitHub.

The voice samples shared by Microsoft range in quality.
While some of them sound natural, others are clearly machine-generated and sound robotic.
Additionally, VALL-E only uses 3-second recordings as a prompt.

If the technology was used with a larger sample set, it could undoubtedly create more realistic samples.
As artificial intelligence becomes more powerful, the voices generated by VALL-E and similar technologies will become more convincing.
Politicians and other public figures could also be impersonated.

Security concerns also come to mind.
My bank uses my voice as a password when I call.
That beings said, it still makes me uneasy.

There’s a good chance that the arms race will escalate between AI-generated content and AI-detecting software.
While it’s unfortunate to see people lose work, I don’t see a way around this.
That’s just the reality of technology advancing.

In fact, Apple recently announced a feature thatuses AI to read audio books.
Like any technology, VALL-E will be used for good, evil, and everything in between.
Microsoft President Brad Smith has discussed regulating AI in the past (viaGeekWire).

We’ll have to see what measures Microsoft puts in place to regulate the use of VALL-E.






