CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

xAI’s Grok-2 and Grok-2 Mini Get Major Speed Boost, Grok-2 Claims #2 Spot in AI Model Performance

  • August 27, 2024
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Elon Musk’s xAI has taken significant strides in improving the performance of its Grok-2 large language model (LLM) chatbot. In just three days, xAI developers completely rewrote the inference code stack using SGLang, an open-source system designed for efficient execution of complex language models. This resulted in a dramatic speed increase for both the full Grok-2 model and the streamlined Grok-2 Mini.

The improvements were announced by xAI developer Igor Babuschkin on the social network X. He revealed that Grok-2 Mini is now twice as fast as it was previously. This impressive feat is attributed to the collaborative effort of Lianmin Zheng and Saeed Maleki, who rewrote the inference code using SGLang.

SGLang’s efficiency extends beyond speed. It also enables xAI to serve the larger Grok-2 model, which requires multi-host inference, at a significantly faster pace. Additionally, both models experienced a slight increase in accuracy alongside the speed boost.

Developed by a team from Stanford University, UC Berkeley, Texas A&M, and Shanghai Jiao Tong University, SGLang offers a versatile platform. It supports a wide range of models, including Llama, Mistral, and LLaVA, and is compatible with open-weight and API-based models like OpenAI’s GPT-4. The system’s strength lies in its ability to optimize execution through automatic cache reuse and program-level parallelism.

This performance boost is accompanied by impressive results on the third-party Lmsys Chatbot Arena leaderboard, which benchmarks AI model performance. The full Grok-2 model has secured the number two spot with an Arena Score of 1293, placing it alongside Google’s Gemini-1.5 Pro and just behind OpenAI’s latest ChatGPT-4o.

Grok-2 Mini, also benefiting from the recent enhancements, climbed to the number five position with an Arena Score of 1268, trailing only GPT-4o mini and Claude 3.5 Sonnet. Notably, both Grok-2 and Grok-2 Mini are proprietary models developed by xAI, showcasing the company’s dedication to pushing the boundaries of AI technology.

Grok-2 has established itself as a leader, particularly in mathematical tasks, where it currently holds the top spot. The model also demonstrates strong performance across various categories, including Hard Prompts, Coding, and Instruction-Following, consistently ranking near the top. This performance surpasses prominent models like OpenAI’s GPT-4o (May 2024), which now sits at number four.

According to Babuschkin, the primary advantage of Grok-2 Mini lies in its enhanced speed. However, he assures further advancements are underway to make it even faster, catering to users who prioritize high performance without significant computational resources.

The addition of Grok-2 and Grok-2 Mini to the Lmsys Chatbot Arena leaderboard and their subsequent performance have garnered significant attention within the AI community. These achievements highlight xAI’s ongoing commitment to innovation and its relentless pursuit of pushing the boundaries of AI capabilities. As xAI continues to refine its models, we can expect further improvements in speed, accuracy, and overall performance, ensuring Grok-2 and Grok-2 Mini remain at the forefront of AI development.

Share
Tweet
Share
Share
Share
Previous Article
  • Ignite

Ultimate Guide to Finding Investors: VC Firms, Angel Investors, and More 

  • August 27, 2024
Read More
Next Article
  • Ignite

Pakistani Startups Make Forbes’ Most Innovative Companies List

  • August 27, 2024
Read More
You May Also Like
Read More
  • TechAdvisor

Apple Launches Creator Studio Subscription for iPhone, iPad, and Mac Users

  • webdesk
  • January 17, 2026
Read More
  • TechAdvisor

Apple Partners With Google Gemini AI To Power Revamped Siri

  • webdesk
  • January 17, 2026
Read More
  • TechAdvisor

OpenAI Acquires Health Startup Torch For $100 Million To Enhance ChatGPT Health

  • webdesk
  • January 17, 2026
Read More
  • Digital Pakistan
  • TechAdvisor

Pakistan to Host Indus AI Week 2026 With National and Global AI Engagement

  • Press Desk
  • January 17, 2026
Read More
  • TechAdvisor

Samsung Prepares Advanced AI-Powered Bixby Launch With One UI 8.5

  • webdesk
  • January 17, 2026
Read More
  • TechAdvisor

Google Refreshes Snapseed With Modern Android Redesign After Years Of Silence

  • Press Desk
  • January 16, 2026
Read More
  • TechAdvisor

Disney+ To Launch Vertical Video Content To Boost Daily Engagement

  • Press Desk
  • January 16, 2026
Read More
  • TechAdvisor

US And Taiwan Sign $250 Billion Deal To Expand Semiconductor Production Stateside

  • webdesk
  • January 16, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trending Posts
  • Most Fixed Broadband Operators Meet PTA Quality Standards In Q4 2025
    • January 17, 2026
  • PEC Chairman Outlines Vision For Graduate Engineer Trainee Placement Program
    • January 17, 2026
  • Mari Energies Launches Sovereign Cloud And AI Platform In Pakistan
    • January 17, 2026
  • Ahson Bin Saeed Takes Charge As CEO Of Raast Payments Pakistan
    • January 17, 2026
  • OpenAI Launches ChatGPT Health Amid Rising Debate Over AI In Healthcare
    • January 17, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.