Microsoft AI Powered To Clone Voice By 3-Second Audio

With just a three-second audio clip, Microsoft’s new text-to-speech AI will be able to duplicate voices, including tone and pitch.

Despite being a complicated system, VALL-“neural E’s codec language model” is extremely simple to use and only requires the insertion of audio and text. The developers of the programme are certain that it can be applied to high-quality text-to-speech tasks including speech modifying and audio content production. Microsoft’s application is based on EnCodec, which Meta unveiled in October of the previous year.

VALL-E analyses how someone sounds and separates that information into discrete components, producing discrete audio codec codes from text and acoustic stimuli. EnCodec compares what it knows about how that voice would sound if it delivered a different phrase to training data.

The speech-synthesis abilities of VALL-E were taught using audio from a library that Meta put together, which contained 60,000 hours of English speakers from more than 7,000 speakers. The submitted training data must closely match the three-second voice clip sample for a successful outcome.

Although speech-generating software is frequently used by news sites, it requires a lot of input. What’s more, the voice lacks a human-like quality and is unable to communicate expressions or inflections. The application carries possible problems in the misuse of the model, such as spoofing voice recognition or mimicking a certain speaker. VALL-E is extremely advanced and offers a better and more accurate result with minimum required input.

By altering the random seed used in the generating process, the computer can produce differences in voice tone, as shown by the sample provided by Microsoft. VALL-E can simulate the acoustic environment of the audio that was present in the sample audio, such as simulating a voice over the phone.

Microsoft AI Powered To Clone Voice By 3-Second Audio

Leave a Reply Cancel reply

KP Launches Digital NOC System to Simplify Travel for Foreign Tourists

Shaza Fatima, Alibaba Discuss Strengthening Pakistan’s Digital Trade and Global E-Commerce Ties

Pakistan Delays Satellite Internet Launch to Finalize Rules, Attract More LEO Operators

IESCO Launches Self-Meter Reading Feature Through Apna Meter Apni Reading App

Pakistan Railways to Launch Digital Business Train with Wi-Fi, Enhanced Onboard Services

CWPK Legacy

Explore Computerworld Sites Globally

Content from other IDG brands

SBP Easing Off IT Exporters To Increase Their Income

Pakistan Stands At Second Highest Average Mobile Broadband Speed: PTA

You May Also Like

Leave a Reply Cancel reply