CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Wired

Young Pakistani Developer Builds First AI Tools for Sindhi Language

  • April 8, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

A breakthrough in language technology has emerged from Pakistan, where a 23-year-old software developer from Hyderabad, Fahad Maqsood Qazi, has developed the first-ever artificial intelligence-based tools for the Sindhi language. These tools enable text-to-speech (TTS) and speech-to-text (STT) functions in Sindhi—a landmark achievement for a language spoken by nearly 40 million people globally but long overlooked in the realm of AI-driven language services.

Qazi began the project in 2023 while working on an AI dubbing system for Flis Technologies, a company he co-founded. It was during this work that he realized the complete absence of foundational AI tools for the Sindhi language. Unlike more globally dominant languages like English or Mandarin, Sindhi had no public tools available for speech recognition or voice synthesis, making it largely invisible in the digital AI era.

Determined to fill this gap, Qazi began building his own dataset from scratch. He sourced hours of Sindhi audio from YouTube, audiobooks, and news broadcasts, and manually transcribed them to create a training base for his AI models. During this time, he discovered that Google employee Asad Memon had enabled Sindhi support on Mozilla’s Common Voice platform. Qazi merged this open-source dataset with his own, providing a robust foundation for his machine learning models.

By January 2024, Qazi had completed the first working versions of both TTS and STT models for Sindhi. Realizing the language also lacked a tokenizer—a basic software component that breaks down sentences into individual words or characters for AI processing—he developed one himself. The addition of a tokenizer was critical, as it allowed the language to be processed by machine learning systems, enabling better accuracy and functionality.

The implications of this work go far beyond software development. In many countries where Sindhi-speaking diaspora communities live, the language is not formally taught, particularly to younger generations. This has led to a gradual erosion of reading and writing skills in Sindhi. Qazi hopes that his tools will bridge that gap, making it easier for these communities to stay connected to their linguistic heritage through voice-based technology.

He emphasized that the tools could help both the tech-savvy and the tech-shy engage with the language. A child or adult who cannot read Sindhi can now listen to stories or information via TTS. Conversely, someone unfamiliar with writing the language can speak into a phone or computer and have their words transcribed using STT. This is particularly significant for older generations or individuals with limited literacy, who may struggle to use digital devices in their native language.

In March 2024, Qazi uploaded his models to HuggingFace, a collaborative platform for AI models used by developers worldwide. By making his work open-source, he hopes to encourage further development in Sindhi language technology. Researchers, developers, and language activists can now build upon his models, enabling a broader ecosystem of applications that include translation tools, educational content, and even voice-controlled interfaces.

Qazi stressed that for Sindhi to remain relevant in the modern world, it must be accessible across digital platforms.

 “Without access to tools like these, Sindhi could be excluded from digital spaces.”

 “Now it can be part of systems like voice interfaces, educational resources, and translation tools.”

This accomplishment marks a new chapter for Sindhi language inclusion in the AI era. By building the foundational tools himself, Qazi has not only addressed a glaring digital gap but has also laid the groundwork for a more inclusive future where regional languages are part of global technological advancement.

Share
Tweet
Share
Share
Share
Previous Article
  • Wired

PITB Launches Online Auction for Fancy Vehicle Number Plates

  • April 8, 2025
Read More
Next Article
  • Wired

LESCO Tackles Electricity Theft and Boosts Efficiency with New Upgrades

  • April 8, 2025
Read More
You May Also Like
Read More
  • Wired

Pakistan vs India ICC Men’s T20 World Cup 2026 Live Coverage And Match Preview

  • Press Desk
  • February 16, 2026
Read More
  • Wired

Pakistan Cambodia Discuss Tech And Innovation Partnership During NUST Visit

  • Press Desk
  • February 14, 2026
Read More
  • Wired

US Offers Support To Unlock Pakistan IT Potential Through Industry Webinar

  • Press Desk
  • February 14, 2026
Read More
  • Wired

SUPARCO Forecasts Ramazan 2026 To Begin On February 19 In Pakistan

  • Press Desk
  • February 14, 2026
Read More
  • Wired

Finance Minister Muhammad Aurangzeb Advocates Stronger Role for Emerging Economies at AlUla Conference

  • Press Desk
  • February 13, 2026
Read More
  • Wired

Islamabad High Court Rules Rs. 32 Billion PEMRA Levy On TV Channels Unlawful

  • Press Desk
  • February 13, 2026
Read More
  • Wired

KP Imposes Ban On Male Faculty One-On-One Meetings With Female Students In Public Universities

  • Press Desk
  • February 13, 2026
Read More
  • Wired

STZA Conducts Awareness Session With PSW To Streamline Compliance For Licensees

  • Press Desk
  • February 13, 2026
Trending Posts
  • JazzWorld Partners With USF To Deliver Broadband And Mobile Services In Sindh’s Badin District
    • February 17, 2026
  • Samsung Pakistan President Highlights AI Training At Samsung Innovation Campus With Knowledge Streams
    • February 17, 2026
  • Fasset And HRL Collaboration Aims To Modernize Digital Finance And Asset Tokenization In Pakistan
    • February 17, 2026
  • Pakistan Digital Authority To Design AI Native Cognitive Government Operating System
    • February 17, 2026
  • BMW Ramadan 2026 Offer Brings Up To PKR 8.25 Million Discount On Electric Vehicles
    • February 17, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.