News

OpenAI Voice Engine Synthesizer: Can Re-Create Human Voice?

By Alexander Johnson - Mar 31, 2024 | Updated On: 31 March, 2024 | 2 min read

By Alexander Johnson , 2 min read - Mar 31, 2024

Updated On: 31 March, 2024

OpenAI Voice Engine Synthesizer: Can Re-Create Human Voice?

OpenAI Voice Engine Synthesizer. Image Credit: Social Media.

One more groundbreaking innovation from the OpenAI! While Hollywood is on alert after AI model Sora’s groundbreaking videos, the U.S.-based AI research organization has raised everyone’s eyebrows again.

As per the recent update, the OpenAI voice engine synthesizer is the next unique AI product in the line. From ChatGPT to DALL-E, OpenAI has worked tirelessly over the past few years to create safe and beneficial systems.

But when it comes to the latest innovation, why is OpenAI restraining from releasing its product publicly? Let’s find out what a Voice Engine is and what it can do.

Voice Engine Synthesizer – What Is OpenAI Cooking Next?

OpenAI introduces Voice Engine, a text-to-speech algorithm that produces natural-sounding speech from a single 15-second audio clip, similar to the actual speaker’s voice.

The technology has shown potential in various applications, including reading aids, content translation, improved vital service delivery, support for nonverbal individuals, and patient voice recovery. The firm sees Voice Engine as an opportunity to push the technical boundaries and share AI developments, which aligns with its commitment to AI safety.

According to OpenAI, it might be used for teaching, podcast translation into new languages, remote community outreach, and nonverbal support.

While OpenAI Sora is shaking Hollywood’s ground with its hyperrealistic videos, most people think it won’t be able to create that humanoid feel.

Can OpenAI Voice Engine Synthesizer Recreate Human Voice?

Unlike OpenAI’s past attempts to generate audio content, Voice Engine can produce speech that sounds like individuals, complete with their cadence and intonations. To replicate a person’s voice, the software requires only 15 seconds of recorded audio of them speaking.

OpenAI Voice Engine Synthesizer. Image Credit: Social Media.

Once a voice is cloned, users can enter text into the Voice Engine and receive an AI-generated speech return.

During a tool demonstration, Bloomberg heard a recording of OpenAI CEO Sam Altman briefly discussing the technique in a voice that sounded identical to his own but was fully AI-generated.

When Will It Release? Is It Facing Any Difficulty?

OpenAI is demonstrating its technology but is not yet willing to put itself at risk for the possible social instability that a widespread release could cause. As a result, it appears that there is still time to debut the voice engine officially.

At the same time, the US government is attempting to prevent the unethical use of AI speech technology. Last month, the Federal Communications Commission outlawed robocalls utilizing AI voices after individuals reported receiving spam calls from an AI-cloned voice of President Joe Biden.

As OpenAI points out, this can potentially generate issues with voice authentication methods and frauds in which you don’t know who you’re speaking with over the phone or who has left you a message. So, the OpenAI voice engine synthesizer is still far from reach.

Keep in touch with ICT-Mirror for more updates on Best, Entertainment, Featured, How to’s, Innovation, News, Reviews,