Sunday, November 24, 2024

OpenAI Unveils Audio Device That Recreates Human Voices

First, OpenAI provided a device that allowed folks to create digital pictures just by describing what they wished to see. Then, it constructed related expertise that generated full-motion video like one thing from a Hollywood film.

Now, it has unveiled expertise that may recreate somebody’s voice.

The high-profile A.I. start-up mentioned on Friday {that a} small group of companies was testing a brand new OpenAI system, Voice Engine, that may recreate an individual’s voice from a 15-second recording. In the event you add a recording of your self and a paragraph of textual content, it will probably learn the textual content utilizing an artificial voice that appears like yours.

The textual content doesn’t should be in your native language. In case you are an English speaker, for instance, it will probably recreate your voice in Spanish, French, Chinese language or many different languages.

OpenAI isn’t sharing the expertise extra broadly as a result of it’s nonetheless making an attempt to grasp its potential risks. Like picture and video turbines, a voice generator may assist unfold disinformation throughout social media. It may additionally permit criminals to impersonate folks on-line or throughout cellphone calls.

The corporate mentioned it was notably anxious that this type of expertise could possibly be used to interrupt voice authenticators that management entry to on-line banking accounts and different private functions.

“It is a delicate factor, and it is very important get it proper,” an OpenAI product supervisor, Jeff Harris, mentioned in an interview.

The corporate is exploring methods of watermarking artificial voices or including controls that forestall folks from utilizing the expertise with the voices of politicians or different outstanding figures.

Final month, OpenAI took an identical strategy when it unveiled its video generator, Sora. It confirmed off the expertise however didn’t publicly launch it.

OpenAI is among the many many corporations which have developed a brand new breed of A.I. expertise that may shortly and simply generate artificial voices. They embody tech giants like Google in addition to start-ups just like the New York-based ElevenLabs. (The New York Occasions has sued OpenAI and its accomplice, Microsoft, on claims of copyright infringement involving synthetic intelligence techniques that generate textual content.)

Companies can use these applied sciences to generate audiobooks, give voice to on-line chatbots and even construct an automatic radio station DJ. Since final yr, OpenAI has used its expertise to energy a model of ChatGPT that speaks. And it has lengthy provided companies an array of voices that can be utilized for related functions. All of them had been constructed from clips offered by voice actors.

However the firm has not but provided a public device that might permit people and companies to recreate voices from a brief clip as Voice Engine does. The flexibility to recreate any voice on this means, Mr. Harris mentioned, is what makes the expertise harmful. The expertise could possibly be notably harmful in an election yr, he mentioned.

In January, New Hampshire residents acquired robocall messages that dissuaded them from voting within the state major in a voice that was most probably artificially generated to sound like President Biden. The Federal Communications Fee later outlawed such calls.

Mr. Harris mentioned OpenAI had no quick plans to generate profits from the expertise. He mentioned the device could possibly be notably helpful to individuals who misplaced their voices by sickness or accident.

He demonstrated how the expertise had been used to recreate a lady’s voice after mind most cancers broken it. She may now communicate, he mentioned, after offering a quick recording of a presentation she had as soon as made as a excessive schooler.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles