CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

Meta Launches Omnilingual ASR To Support 1,600 Plus Languages With Open Source Apache License

  • November 18, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Meta has introduced Omnilingual ASR, a new multilingual automatic speech recognition system that supports more than 1,600 languages natively and expands to over 5,400 languages through zero shot in context learning. This marks a return to open source releases for the company after several restricted licensing models in recent years. The system can transcribe spoken audio into text by processing paired audio and text examples during inference time, even for languages it has not encountered during training. It is available under the Apache 2.0 license and is accessible through Meta’s website, Github, and a demonstration page on Hugging Face. Along with the models, Meta has published a technical paper and a large speech corpus of more than 350 underserved languages under open licenses.

Meta stated on its AIatMeta account on X that its aim is to remove language barriers and widen digital access, and the Omnilingual ASR suite is designed to support speech to text functions across areas such as voice assistance, transcription tools, subtitle generation, and oral archive preservation. The system includes several model families, such as wav2vec 2.0 for speech representation learning, CTC based models for supervised training, and LLM based models for advanced transcription. A zero shot model variant allows new languages to be supported by using only a small number of example audio and text pairs. The models follow an encoder to decoder structure that converts raw audio signals into language agnostic representations before decoding them into written output.

The system’s scale sets it apart from competing products. OpenAI’s Whisper supports 99 languages, while Meta’s system directly supports more than 1,600 and can extend to thousands more. Benchmarks indicate that Omnilingual ASR achieves character error rates under ten percent in more than seventy percent of supported languages, including more than five hundred that have not been included in ASR tools before. Meta’s research highlights the advantage of this extended coverage for communities whose languages have often been missing from digital speech technologies. The suite’s largest model requires high end hardware for inference, while smaller models can run on lower power devices, making them feasible for both enterprise level setups and compact deployments.

The release arrives during a significant transition period for Meta’s AI division. Llama 4, released earlier in 2025, faced poor enterprise uptake and contributed to organisational restructuring. Mark Zuckerberg appointed Alexandr Wang as Chief AI Officer and approved new recruitment efforts to strengthen the research pipeline. Omnilingual ASR is positioned as a corrective step by providing a practical and accessible contribution in language technology with transparent data and permissive licensing. It aligns with Meta’s wider efforts to direct investment toward foundational AI, supported by its new AI accelerators and infrastructure improvements announced in September, as well as renewed access to public dataset training across Europe after regulatory adjustments. This approach marks a shift toward cohesive platform development rather than fragmented updates.

The dataset behind Omnilingual ASR was created in collaboration with community organisations across Africa and Asia. African Next Voices, Mozilla Common Voice, and Lanfrica were among the contributors, enabling the inclusion of hundreds of low resource languages. The recordings consist of natural speech gathered through culturally familiar prompts, and transcriptions follow established writing standards. Meta emphasises that expanding speech recognition to thousands of languages requires local partnerships, and the open source release is intended to allow communities to personalise the models with their own data. All resources, including code and datasets, are available through Github, Hugging Face, and Meta’s AI blog, with installation support through PyPI.

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Share
Tweet
Share
Share
Share
Related Topics
  • AI
  • Apache
  • ASR
  • GITHUB
  • HuggingFace
  • Llama
  • Meta
  • Omnilingual
  • Open source
  • speech recognition
  • transcription
  • Whisper
Previous Article
  • Digital Pakistan

Sindh Government Orders Officials To Pay Their Own E Challans After Rising Violations

  • November 18, 2025
Read More
Next Article
  • Cellcos

Pakistan Must Align People Policy And Product To Compete In AI Era, Says Jazz Strategy Chief

  • November 18, 2025
Read More
You May Also Like
Read More
  • TechAdvisor

Google Translate Receives Gemini AI Upgrade With Enhanced Real-Time Speech And Language Features

  • Press Desk
  • December 20, 2025
Read More
  • TechAdvisor

Hyundai Pakistan Announces Year-End Discounts On Hybrid Lineup With EMI And Cash Offers

  • Press Desk
  • December 20, 2025
Read More
  • TechAdvisor

Google Launches Gemini 3 Flash AI Model For Faster Responses Across Services

  • Press Desk
  • December 20, 2025
Read More
  • TechAdvisor

Global Wrist-Worn Smart Device Market Shows Strong Growth With Huawei Leading

  • Press Desk
  • December 20, 2025
Read More
  • TechAdvisor

Rising Memory Prices Force Smartphone Shipment Forecasts Down For 2026

  • Press Desk
  • December 19, 2025
Read More
  • TechAdvisor

Hard Drive Prices Surge Amid AI Data Demand And Supply Constraints

  • Press Desk
  • December 19, 2025
Read More
  • TechAdvisor

Understanding How VPNs Work To Protect Privacy And Bypass Restrictions

  • Press Desk
  • December 19, 2025
Read More
  • TechAdvisor

OnePlus Confirms Turbo Smartphone Series With Gaming Focus And Massive Battery Claims

  • Press Desk
  • December 19, 2025
Trending Posts
  • Jhuggi Wala Community Network Launched to Promote Digital Inclusion in Muzaffargarh
    • December 20, 2025
  • Google Translate Receives Gemini AI Upgrade With Enhanced Real-Time Speech And Language Features
    • December 20, 2025
  • Hyundai Pakistan Announces Year-End Discounts On Hybrid Lineup With EMI And Cash Offers
    • December 20, 2025
  • PTCL Business Solutions Hosts Connect 2025 Showcasing Enterprise Innovation And Digital Infrastructure
    • December 20, 2025
  • Google Launches Gemini 3 Flash AI Model For Faster Responses Across Services
    • December 20, 2025
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.