CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Wired

Young Pakistani Developer Builds First AI Tools for Sindhi Language

  • April 8, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

A breakthrough in language technology has emerged from Pakistan, where a 23-year-old software developer from Hyderabad, Fahad Maqsood Qazi, has developed the first-ever artificial intelligence-based tools for the Sindhi language. These tools enable text-to-speech (TTS) and speech-to-text (STT) functions in Sindhi—a landmark achievement for a language spoken by nearly 40 million people globally but long overlooked in the realm of AI-driven language services.

Qazi began the project in 2023 while working on an AI dubbing system for Flis Technologies, a company he co-founded. It was during this work that he realized the complete absence of foundational AI tools for the Sindhi language. Unlike more globally dominant languages like English or Mandarin, Sindhi had no public tools available for speech recognition or voice synthesis, making it largely invisible in the digital AI era.

Determined to fill this gap, Qazi began building his own dataset from scratch. He sourced hours of Sindhi audio from YouTube, audiobooks, and news broadcasts, and manually transcribed them to create a training base for his AI models. During this time, he discovered that Google employee Asad Memon had enabled Sindhi support on Mozilla’s Common Voice platform. Qazi merged this open-source dataset with his own, providing a robust foundation for his machine learning models.

By January 2024, Qazi had completed the first working versions of both TTS and STT models for Sindhi. Realizing the language also lacked a tokenizer—a basic software component that breaks down sentences into individual words or characters for AI processing—he developed one himself. The addition of a tokenizer was critical, as it allowed the language to be processed by machine learning systems, enabling better accuracy and functionality.

The implications of this work go far beyond software development. In many countries where Sindhi-speaking diaspora communities live, the language is not formally taught, particularly to younger generations. This has led to a gradual erosion of reading and writing skills in Sindhi. Qazi hopes that his tools will bridge that gap, making it easier for these communities to stay connected to their linguistic heritage through voice-based technology.

He emphasized that the tools could help both the tech-savvy and the tech-shy engage with the language. A child or adult who cannot read Sindhi can now listen to stories or information via TTS. Conversely, someone unfamiliar with writing the language can speak into a phone or computer and have their words transcribed using STT. This is particularly significant for older generations or individuals with limited literacy, who may struggle to use digital devices in their native language.

In March 2024, Qazi uploaded his models to HuggingFace, a collaborative platform for AI models used by developers worldwide. By making his work open-source, he hopes to encourage further development in Sindhi language technology. Researchers, developers, and language activists can now build upon his models, enabling a broader ecosystem of applications that include translation tools, educational content, and even voice-controlled interfaces.

Qazi stressed that for Sindhi to remain relevant in the modern world, it must be accessible across digital platforms.

 “Without access to tools like these, Sindhi could be excluded from digital spaces.”

 “Now it can be part of systems like voice interfaces, educational resources, and translation tools.”

This accomplishment marks a new chapter for Sindhi language inclusion in the AI era. By building the foundational tools himself, Qazi has not only addressed a glaring digital gap but has also laid the groundwork for a more inclusive future where regional languages are part of global technological advancement.

Share
Tweet
Share
Share
Share
Previous Article
  • Wired

PITB Launches Online Auction for Fancy Vehicle Number Plates

  • April 8, 2025
Read More
Next Article
  • Wired

LESCO Tackles Electricity Theft and Boosts Efficiency with New Upgrades

  • April 8, 2025
Read More
You May Also Like
Read More
  • Wired

Pakistan Explores EV Charging And Smart Energy Partnership With StarCharge In Changzhou

  • Press Desk
  • May 23, 2026
Read More
  • Wired

British Pakistani Scientist Develops AI Eye Scan to Detect Dementia Early

  • Press Desk
  • May 23, 2026
Read More
  • Wired

Evee Opens Giga Factory in Pakistan to Scale Electric Two-Wheeler Production

  • Press Desk
  • May 22, 2026
Read More
  • Wired

Iran IRGC Threatens Submarine Cables in Strait of Hormuz

  • Press Desk
  • May 21, 2026
Read More
  • Wired

Sindh Approves 500 Electric Buses for Karachi and Hyderabad

  • Press Desk
  • May 21, 2026
Read More
  • Wired

Sindh Pink EV Scooter Scheme Expands to Hyderabad District

  • Press Desk
  • May 21, 2026
Read More
  • Wired

NAVTTC Opens Applications for Blended International IT Training Program

  • Press Desk
  • May 19, 2026
Read More
  • Wired

Arcana Info TechByte Session Covers IBM App Connect and Enterprise Integration

  • Press Desk
  • May 19, 2026
Trending Posts
  • Abu Dhabi Lunate Backs $70 Million Manara Ventures Fund for Jordan Tech Startups
    • May 23, 2026
  • Kuwait Awards 2.7 Billion PPP Deal To Beyons Beyon Group For Next-Generation Telecom Network
    • May 23, 2026
  • Pakistan Local Mobile Phone Manufacturing Falls 35 Percent In April 2026
    • May 23, 2026
  • Aramco And Pasqal Launch Saudi Arabias First Quantum Computer With 200 Qubit System
    • May 23, 2026
  • Samsung Pakistan Expands Trade-In Programme For Galaxy S26 Ultra Upgrade
    • May 23, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.