CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Wired

Microsoft AI Powered To Clone Voice By 3-Second Audio

  • January 16, 2023
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

With just a three-second audio clip, Microsoft’s new text-to-speech AI will be able to duplicate voices, including tone and pitch.


Despite being a complicated system, VALL-“neural E’s codec language model” is extremely simple to use and only requires the insertion of audio and text. The developers of the programme are certain that it can be applied to high-quality text-to-speech tasks including speech modifying and audio content production. Microsoft’s application is based on EnCodec, which Meta unveiled in October of the previous year.


VALL-E analyses how someone sounds and separates that information into discrete components, producing discrete audio codec codes from text and acoustic stimuli. EnCodec compares what it knows about how that voice would sound if it delivered a different phrase to training data.

The speech-synthesis abilities of VALL-E were taught using audio from a library that Meta put together, which contained 60,000 hours of English speakers from more than 7,000 speakers. The submitted training data must closely match the three-second voice clip sample for a successful outcome.

Although speech-generating software is frequently used by news sites, it requires a lot of input. What’s more, the voice lacks a human-like quality and is unable to communicate expressions or inflections. The application carries possible problems in the misuse of the model, such as spoofing voice recognition or mimicking a certain speaker. VALL-E is extremely advanced and offers a better and more accurate result with minimum required input.

By altering the random seed used in the generating process, the computer can produce differences in voice tone, as shown by the sample provided by Microsoft. VALL-E can simulate the acoustic environment of the audio that was present in the sample audio, such as simulating a voice over the phone.

Share
Tweet
Share
Share
Share
Previous Article
  • Business
  • Wired

SBP Easing Off IT Exporters To Increase Their Income

  • January 15, 2023
Read More
Next Article
  • Cellcos

Pakistan Stands At Second Highest Average Mobile Broadband Speed: PTA

  • January 16, 2023
Read More
You May Also Like
Read More
  • Wired

Marka-e-Haq Lego Tribute: One Year On In AI

  • Press Desk
  • May 6, 2026
Read More
  • Wired

MyCloud By Multinet Launches Pakistan’s First GPU-As-A-Service Platform For AI And Machine Learning Workloads

  • Press Desk
  • May 6, 2026
Read More
  • Wired

Punjab Government Formally Exempts IT Companies, Call Centers And Gyms From Market Closure Timings

  • Press Desk
  • May 6, 2026
Read More
  • Wired

Pakistan Faces Electric Bike And Scooter Shortage As Surging Petrol Prices Drive Demand Beyond Supply

  • Press Desk
  • May 5, 2026
Read More
  • Wired

Careem Conducts Fresh Round Of Layoffs With Pakistani Developers Among Those Affected

  • Press Desk
  • May 5, 2026
Read More
  • Wired

Pakistan Could Benefit From ADB’s $70 Billion AI-Powered Energy And Digital Infrastructure Plan

  • Press Desk
  • May 5, 2026
Read More
  • Wired

Pakistani Researchers Present At Nanjing International Forum On Artificial Intelligence And Green Sustainability

  • Press Desk
  • May 5, 2026
Read More
  • Wired

UK Launches Noor, Pakistan’s First Voice-Based AI Platform For Disaster Response

  • Press Desk
  • May 5, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trending Posts
  • NED University Software Engineering Department To Showcase AI And Tech Final Year Projects At FYDP Expo 2026
    • May 6, 2026
  • Pakistan Launches RFP For National Open Data Ecosystem To Strengthen Digital Infrastructure
    • May 6, 2026
  • KhiNext Launches AI Expo 26 In Karachi To Showcase Artificial Intelligence Solutions And Innovation
    • May 6, 2026
  • Telenor Pakistan Secures Multiple Wins At Effie Pakistan 2026 Across Health, Youth And Telecom Categories
    • May 6, 2026
  • AI Seekho Phase II Launches Google Antigravity Hackathon With PKR 2.5 Million Prize Pool For AI Agent Developers
    • May 6, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.