CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Global Insights

MIT And KAUST Researchers Release MathNet, The World’s Largest Collection Of Olympiad-Level Mathematics Problems Spanning 47 Countries And 17 Languages

  • April 22, 2026
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Researchers at the Massachusetts Institute of Technology’s Computer Science and Artificial Intelligence Laboratory, King Abdullah University of Science and Technology, and artificial intelligence research organisation HUMAIN have released what is being described as the world’s largest and most diverse collection of Olympiad-level mathematics problems, opening up a resource that has significant implications for both artificial intelligence research and mathematics education globally. The dataset, named MathNet, will be presented at the International Conference on Learning Representations in Brazil later this month and is freely available to the public through MIT’s Computer Science and Artificial Intelligence Laboratory.

MathNet comprises more than 30,000 expert-authored problems and solutions spanning 47 countries, 17 languages, and 143 competitions, making it five times larger than the next biggest dataset of its kind. The scale alone sets it apart, but what distinguishes MathNet more fundamentally from previous Olympiad-level datasets is its geographic and linguistic breadth. Previous Olympiad-level datasets draw almost exclusively from competitions in the United States and China, whereas MathNet spans dozens of countries across six continents, covers 17 languages, includes both text and image-based problems and solutions, and spans four decades of competition mathematics. Building the dataset required tracking down 1,595 PDF volumes totalling more than 25,000 pages, including decades-old scans in more than a dozen languages. A significant portion of that archive came from Navid Safaei, a longtime International Mathematical Olympiad community figure and co-author who had been collecting and scanning those national competition booklets by hand since 2006, and whose personal archive formed the backbone of the dataset. The solutions contained in those booklets are expert-written and peer-reviewed, often running to multiple pages with authors walking through several distinct approaches to the same problem, giving artificial intelligence models a far richer training signal than the shorter, informal solutions typical of community-sourced datasets.

Testing on MathNet reveals that even the most capable frontier models struggle meaningfully at this level of mathematical reasoning: GPT-5, the top-performing model tested, averaged around 69.3 percent on MathNet’s main benchmark of 6,400 problems, failing nearly one in three Olympiad-level problems, and when problems include figures, performance drops significantly across the board, exposing visual reasoning as a consistent weak point. Several open-source models scored zero percent on Mongolian-language problems, highlighting the degree to which current artificial intelligence systems remain brittle when confronted with less common languages despite their overall capabilities. Beyond problem-solving, MathNet introduces a retrieval benchmark that asks whether models can recognise when two problems share the same underlying mathematical structure, testing eight state-of-the-art embedding models and finding that even the strongest identified the correct match only about 5 percent of the time on the first attempt, with models frequently ranking structurally unrelated problems as more similar than mathematically equivalent ones. For the broader mathematics community, the dataset also addresses a longstanding gap: Olympiad problem booklets shared between national delegations had never been systematically collected and made publicly accessible, leaving students in many countries to train for these competitions largely in isolation. Lead author Shaden Alshammari, an MIT doctoral student, noted that for many students the Olympiad preparation experience had always been an individual effort with no communal resource to draw from, a gap that MathNet is now designed to close.

Source

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem.

Share
Tweet
Share
Share
Share
Related Topics
  • AI Mathematical Reasoning
  • Artificial Intelligence
  • International Mathematical Olympiad
  • KAUST
  • large language models
  • Math Dataset
  • MathNet
  • MIT CSAIL
  • Olympiad Mathematics
Previous Article
  • Global Insights

Australian Care Facility Builds World’s First Permanent Virtual Reality Rail Carriage To Take Elderly Residents On Immersive Journeys Across Ten Countries

  • April 22, 2026
Read More
Next Article
  • Digital Pakistan

National IT Board Signs MoU With Excise And Taxation Department Islamabad To Bring Services Onto Digital Economy Enhancement Project Platform

  • April 22, 2026
Read More
You May Also Like
Read More
  • Global Insights

Australian Care Facility Builds World’s First Permanent Virtual Reality Rail Carriage To Take Elderly Residents On Immersive Journeys Across Ten Countries

  • Press Desk
  • April 22, 2026
Read More
  • Global Insights

Sam Altman’s World ID Expands Iris Scan Verification To Major Platforms With 18 Million Users Across 160 Countries

  • Press Desk
  • April 22, 2026
Read More
  • Global Insights

Apple CEO Transition John Ternus To Lead As Tim Cook Moves To Executive Chairman

  • Press Desk
  • April 21, 2026
Read More
  • Global Insights

stc Netflix Partnership Saudi Arabia Streaming Bundled Services stc TV Baity Fiber

  • Press Desk
  • April 21, 2026
Read More
  • Global Insights

Bangladesh Faces Imminent Telecom Shutdowns As Middle East Fuel Crisis Cuts Off Diesel Supply

  • Press Desk
  • April 21, 2026
Read More
  • Global Insights

Google And Marvell Two Chip TPU Plan Targets AI Inference Efficiency And ASIC Market Shift

  • Press Desk
  • April 20, 2026
Read More
  • Global Insights

Humanoid Robot Breaks Half Marathon Record In Beijing Highlighting AI And Robotics Advancements

  • Press Desk
  • April 20, 2026
Read More
  • Global Insights

France Criminalizes Planned Obsolescence Under Anti-Waste Law

  • Press Desk
  • April 20, 2026
Trending Posts
  • NADRA e Sahulat Expansion 173 Franchises Lahore Digital Identity Services Pakistan
    • April 22, 2026
  • Mobilink Bank WIN Incubator 18 Women Startups Pakistan DEI Digital Entrepreneurship
    • April 22, 2026
  • DIB Pakistan Pocket Money USD Inflows Freelancers Remittances Digital Payments Pakistan
    • April 22, 2026
  • Punjab E Learn She Earn Programme Digital Skills Women Online Earning Pakistan
    • April 22, 2026
  • Supreme Court Of Pakistan Advances Digital Justice With Multi-City Video Link Hearings And Nationwide E-Court System Rollout By August 2026
    • April 22, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.