CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

xAI’s Grok-2 and Grok-2 Mini Get Major Speed Boost, Grok-2 Claims #2 Spot in AI Model Performance

  • August 27, 2024
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Elon Musk’s xAI has taken significant strides in improving the performance of its Grok-2 large language model (LLM) chatbot. In just three days, xAI developers completely rewrote the inference code stack using SGLang, an open-source system designed for efficient execution of complex language models. This resulted in a dramatic speed increase for both the full Grok-2 model and the streamlined Grok-2 Mini.

The improvements were announced by xAI developer Igor Babuschkin on the social network X. He revealed that Grok-2 Mini is now twice as fast as it was previously. This impressive feat is attributed to the collaborative effort of Lianmin Zheng and Saeed Maleki, who rewrote the inference code using SGLang.

SGLang’s efficiency extends beyond speed. It also enables xAI to serve the larger Grok-2 model, which requires multi-host inference, at a significantly faster pace. Additionally, both models experienced a slight increase in accuracy alongside the speed boost.

Developed by a team from Stanford University, UC Berkeley, Texas A&M, and Shanghai Jiao Tong University, SGLang offers a versatile platform. It supports a wide range of models, including Llama, Mistral, and LLaVA, and is compatible with open-weight and API-based models like OpenAI’s GPT-4. The system’s strength lies in its ability to optimize execution through automatic cache reuse and program-level parallelism.

This performance boost is accompanied by impressive results on the third-party Lmsys Chatbot Arena leaderboard, which benchmarks AI model performance. The full Grok-2 model has secured the number two spot with an Arena Score of 1293, placing it alongside Google’s Gemini-1.5 Pro and just behind OpenAI’s latest ChatGPT-4o.

Grok-2 Mini, also benefiting from the recent enhancements, climbed to the number five position with an Arena Score of 1268, trailing only GPT-4o mini and Claude 3.5 Sonnet. Notably, both Grok-2 and Grok-2 Mini are proprietary models developed by xAI, showcasing the company’s dedication to pushing the boundaries of AI technology.

Grok-2 has established itself as a leader, particularly in mathematical tasks, where it currently holds the top spot. The model also demonstrates strong performance across various categories, including Hard Prompts, Coding, and Instruction-Following, consistently ranking near the top. This performance surpasses prominent models like OpenAI’s GPT-4o (May 2024), which now sits at number four.

According to Babuschkin, the primary advantage of Grok-2 Mini lies in its enhanced speed. However, he assures further advancements are underway to make it even faster, catering to users who prioritize high performance without significant computational resources.

The addition of Grok-2 and Grok-2 Mini to the Lmsys Chatbot Arena leaderboard and their subsequent performance have garnered significant attention within the AI community. These achievements highlight xAI’s ongoing commitment to innovation and its relentless pursuit of pushing the boundaries of AI capabilities. As xAI continues to refine its models, we can expect further improvements in speed, accuracy, and overall performance, ensuring Grok-2 and Grok-2 Mini remain at the forefront of AI development.

Share
Tweet
Share
Share
Share
Previous Article
  • Ignite

Ultimate Guide to Finding Investors: VC Firms, Angel Investors, and More 

  • August 27, 2024
Read More
Next Article
  • Ignite

Pakistani Startups Make Forbes’ Most Innovative Companies List

  • August 27, 2024
Read More
You May Also Like
Read More
  • TechAdvisor

Google Launches Veo 3 and Flow in Pakistan to Strengthen Creative AI Tools

  • Press Desk
  • July 11, 2025
Read More
  • TechAdvisor

Digital Heritage Trails Project Brings Ancient Sites to Life with VR in Karachi

  • Press Desk
  • July 10, 2025
Read More
  • TechAdvisor

realme 14 Series Launching in Pakistan with Snapdragon 6 Gen 4 and Segment-Leading Performance

  • Press Desk
  • July 8, 2025
Read More
  • TechAdvisor

HONOR Brings Android 15 Smartphones with Full Google Support to Pakistan

  • Press Desk
  • June 28, 2025
Read More
  • TechAdvisor

Kaspersky Introduces eSIM Connectivity Store for Global Travellers Including Pakistanis

  • Press Desk
  • June 20, 2025
Read More
  • TechAdvisor

HONOR Set to Launch New Smartphones and Wearables in Pakistan

  • Press Desk
  • June 20, 2025
Read More
  • TechAdvisor

WhatsApp to Display Ads in Updates Tab While Keeping Chats Ad-Free

  • Press Desk
  • June 18, 2025
Read More
  • TechAdvisor

OPPO Launches Official Flagship Store in Lahore with Exclusive Find N5 Foldable Phone

  • Press Desk
  • June 16, 2025

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trending Posts
  • U.S. Mission, LMKT, and PUAN Empower Pakistani Entrepreneurs with Business Masterclass
    • July 13, 2025
  • MOITT Launches Tri-Party AI Training for University Faculties Backed by Meta, HEC and NCEAC
    • July 13, 2025
  • KP Launches Digital NOC System to Simplify Travel for Foreign Tourists
    • July 12, 2025
  • Shaza Fatima, Alibaba Discuss Strengthening Pakistan’s Digital Trade and Global E-Commerce Ties
    • July 12, 2025
  • Pakistan Delays Satellite Internet Launch to Finalize Rules, Attract More LEO Operators
    • July 12, 2025
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2025. Read Privacy Policy.

Input your search keywords and press Enter.