CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Wired

UrduHack – the First of Its Kind Open Source Python Library

  • March 21, 2019
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Many Natural Language Processing modules have been developed in English and other major languages, to extract useful insights from unstructured data, however not much work has been done for our local languages in Pakistan.

No work has been currently done in NLP with regards to Urdu, and one of the major reason why is that so is the lack of basic tools and a framework to process the Urdu language.

Therefore, in order to change that a Pakistani duo, Ikram Ali and Mujadad Rao, have developed an open source Python library for Urdu called UrduHack. 

Ikram has over 7 years of experience in the software industry and is an avid Machine Learning practitioner with a bachelors in Computer Science from Virtual University, Pakistan. He is currently working as the Principal Software Engineer at Arbisoft, Lahore.

Mujadad has a Bachelors in Computer Science from the University of Central Punjab, and is currently employed as a Machine Learning Developer at Arbisoft.

Ikram Ali speaking to the local news talked about his idea on how one day he decided to research on the work being done on Urdu in the field of Natural Language Processing. He only found a few organizations working on making applications for Urdu and was disappointed to see only commercial work. Therefore, as a result, he set about to change that by making a full-fledged Urdu library.

Mujadad speaking about UrduHack said, “Our plan is to achieve the maximum possible heights with UrduHack. We want to make it a full-fledged Urdu NLP library which people can use to make thousands of interesting applications for desktop, mobile or web.”

Currently, the duo has managed to develop two core modules of the library, Normalization and Tokenization that are essential in cleaning and converting data from a cluttered form to a standard form. The library is still very much a work in progress according to the duo they are planning to use TensorFlow v2 in their upcoming modules later this month. 

Their journey hasn’t been without its own set of challenges while developing the app and have faced a number of technical difficulties such as the use of Unicode for the Urdu script. 

However, they were able to over the challenge by contacting Unicode Consortium and demanded a separate fixed Unicode block for Urdu. 

The second challenge is finding reliable and authentic data in Urdu. The UrduHack team is actively looking for Urdu data available in digital form and if anyone has access to Urdu data, they can contact mujadad.ali@arbisoft.com.

 

Reference links: propakistani.pk

Share
Tweet
Share
Share
Share
Related Topics
  • Ikram Ali
  • Mujadad Rao
  • Open source
  • python
  • UrduHack
Previous Article
  • Computerworld

EFU LifeBot – The AI Powered Messenger Chatbot

  • March 21, 2019
Read More
Next Article
  • Computerworld

Apollo & Huawei Host Customer Summit

  • March 23, 2019
Read More
You May Also Like
Read More
  • Wired

Snapchat Parent Snap To Cut 1000 Jobs Amid AI Driven Workforce Restructuring And Efficiency Push

  • Press Desk
  • April 17, 2026
Read More
  • Wired

Google Introduces Personal Intelligence Feature In Gemini App For Pakistan

  • Press Desk
  • April 17, 2026
Read More
  • Wired

Over 20,000 WordPress Websites Infected After Backdoor Planted In Essential Plugin Following Acquisition

  • Press Desk
  • April 16, 2026
Read More
  • Wired

Foodpanda Partners With Karachi Traffic Police For Rider Safety Workshop

  • Press Desk
  • April 16, 2026
Read More
  • Wired

TikTok Removes Over 22 Million Videos In Pakistan In Q4 2025

  • Press Desk
  • April 16, 2026
Read More
  • Wired

KP Government Plans Internship Program For BS Students With Monthly Stipend

  • Press Desk
  • April 14, 2026
Read More
  • Wired

Pakistan Auto Sales Drop 9% Month-On-Month In March 2026 As Electric Vehicle Sales Surge 61 Percent

  • Press Desk
  • April 13, 2026
Read More
  • Wired

Pakistani Food Delivery Platform FoodPapa Suffers Major Data Breach With Entire Database Leaked Online

  • Press Desk
  • April 13, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trending Posts
  • P@SHA Participation At LeadsCon 2026 Highlights Pakistan Tech Industry On Global Stage
    • April 18, 2026
  • P@SHA And HBL Leadership Meeting Focuses On Tech And Financial Sector Collaboration In Pakistan
    • April 18, 2026
  • uConnect And Mountain Communities Cooperative Society Renew Rupeeba Software Agreement For Digital Financial Inclusion
    • April 18, 2026
  • Punjab Launches AI Powered E Challan System In Khanewal For Smart Traffic Monitoring And Enforcement
    • April 18, 2026
  • Samsung Galaxy S27 Leak Points To Faster UFS 5.0 Storage Upgrade
    • April 18, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.