CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Wired

Young Pakistani Developer Builds First AI Tools for Sindhi Language

  • April 8, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

A breakthrough in language technology has emerged from Pakistan, where a 23-year-old software developer from Hyderabad, Fahad Maqsood Qazi, has developed the first-ever artificial intelligence-based tools for the Sindhi language. These tools enable text-to-speech (TTS) and speech-to-text (STT) functions in Sindhi—a landmark achievement for a language spoken by nearly 40 million people globally but long overlooked in the realm of AI-driven language services.

Qazi began the project in 2023 while working on an AI dubbing system for Flis Technologies, a company he co-founded. It was during this work that he realized the complete absence of foundational AI tools for the Sindhi language. Unlike more globally dominant languages like English or Mandarin, Sindhi had no public tools available for speech recognition or voice synthesis, making it largely invisible in the digital AI era.

Determined to fill this gap, Qazi began building his own dataset from scratch. He sourced hours of Sindhi audio from YouTube, audiobooks, and news broadcasts, and manually transcribed them to create a training base for his AI models. During this time, he discovered that Google employee Asad Memon had enabled Sindhi support on Mozilla’s Common Voice platform. Qazi merged this open-source dataset with his own, providing a robust foundation for his machine learning models.

By January 2024, Qazi had completed the first working versions of both TTS and STT models for Sindhi. Realizing the language also lacked a tokenizer—a basic software component that breaks down sentences into individual words or characters for AI processing—he developed one himself. The addition of a tokenizer was critical, as it allowed the language to be processed by machine learning systems, enabling better accuracy and functionality.

The implications of this work go far beyond software development. In many countries where Sindhi-speaking diaspora communities live, the language is not formally taught, particularly to younger generations. This has led to a gradual erosion of reading and writing skills in Sindhi. Qazi hopes that his tools will bridge that gap, making it easier for these communities to stay connected to their linguistic heritage through voice-based technology.

He emphasized that the tools could help both the tech-savvy and the tech-shy engage with the language. A child or adult who cannot read Sindhi can now listen to stories or information via TTS. Conversely, someone unfamiliar with writing the language can speak into a phone or computer and have their words transcribed using STT. This is particularly significant for older generations or individuals with limited literacy, who may struggle to use digital devices in their native language.

In March 2024, Qazi uploaded his models to HuggingFace, a collaborative platform for AI models used by developers worldwide. By making his work open-source, he hopes to encourage further development in Sindhi language technology. Researchers, developers, and language activists can now build upon his models, enabling a broader ecosystem of applications that include translation tools, educational content, and even voice-controlled interfaces.

Qazi stressed that for Sindhi to remain relevant in the modern world, it must be accessible across digital platforms.

 “Without access to tools like these, Sindhi could be excluded from digital spaces.”

 “Now it can be part of systems like voice interfaces, educational resources, and translation tools.”

This accomplishment marks a new chapter for Sindhi language inclusion in the AI era. By building the foundational tools himself, Qazi has not only addressed a glaring digital gap but has also laid the groundwork for a more inclusive future where regional languages are part of global technological advancement.

Share
Tweet
Share
Share
Share
Previous Article
  • Wired

PITB Launches Online Auction for Fancy Vehicle Number Plates

  • April 8, 2025
Read More
Next Article
  • Wired

LESCO Tackles Electricity Theft and Boosts Efficiency with New Upgrades

  • April 8, 2025
Read More
You May Also Like
Read More
  • Wired

Pakistan Hockey Team Qualifies for FIH World Cup 2026 with Support from PTCL and Ufone

  • Press Desk
  • March 7, 2026
Read More
  • Wired

PAKAMTECH Expands Operations With Tri-State Chapter To Boost IT Collaboration Between Pakistan And US

  • Press Desk
  • March 7, 2026
Read More
  • Wired

Government Procures 100,000 Laptops Under Prime Minister Youth Laptop Scheme IV at Cost of PKR 16.80 Billion

  • Press Desk
  • March 6, 2026
Read More
  • Wired

Punjab Government Launches Rehmat Card Scheme Offering Rs100000 Assistance To Widows

  • Press Desk
  • March 6, 2026
Read More
  • Wired

Pakistan Science Foundation Launches First National Contest For Need‑Based Technologies 2026

  • Press Desk
  • March 4, 2026
Read More
  • Wired

Punjab Information Technology Board Receives Approximately 16,000 Applications Through Global Information Technology Certifications Program

  • Press Desk
  • March 4, 2026
Read More
  • Wired

Pakistan Embassy Abu Dhabi Introduces Online Registration Form For Pakistani Community In UAE

  • Press Desk
  • March 3, 2026
Read More
  • Wired

Claude AI Down As Users Report Temporary Service Disruption Worldwide

  • Press Desk
  • March 3, 2026
Trending Posts
  • Chitral Residents Continue To Face Internet And Mobile Connectivity Challenges Despite USF Projects
    • March 11, 2026
  • PITB Opens Registrations For SheWins Training Program Empowering Women With Digital Skills
    • March 11, 2026
  • Shaza Khawaja To Hold Emergency Meeting On IT Industry Amid Austerity Measures
    • March 11, 2026
  • Pakistan’s 5G Spectrum Auction: Jazz Secures 190 MHz Spectrum For $239.3 Million In Pakistan’s Latest Auction
    • March 11, 2026
  • Pakistan’s 5G Spectrum Auction: PTA To Conduct Second Phase Of 5G Spectrum Auction For Band Positioning
    • March 11, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.