CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

xAI’s Grok-2 and Grok-2 Mini Get Major Speed Boost, Grok-2 Claims #2 Spot in AI Model Performance

  • August 27, 2024
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Elon Musk’s xAI has taken significant strides in improving the performance of its Grok-2 large language model (LLM) chatbot. In just three days, xAI developers completely rewrote the inference code stack using SGLang, an open-source system designed for efficient execution of complex language models. This resulted in a dramatic speed increase for both the full Grok-2 model and the streamlined Grok-2 Mini.

The improvements were announced by xAI developer Igor Babuschkin on the social network X. He revealed that Grok-2 Mini is now twice as fast as it was previously. This impressive feat is attributed to the collaborative effort of Lianmin Zheng and Saeed Maleki, who rewrote the inference code using SGLang.

SGLang’s efficiency extends beyond speed. It also enables xAI to serve the larger Grok-2 model, which requires multi-host inference, at a significantly faster pace. Additionally, both models experienced a slight increase in accuracy alongside the speed boost.

Developed by a team from Stanford University, UC Berkeley, Texas A&M, and Shanghai Jiao Tong University, SGLang offers a versatile platform. It supports a wide range of models, including Llama, Mistral, and LLaVA, and is compatible with open-weight and API-based models like OpenAI’s GPT-4. The system’s strength lies in its ability to optimize execution through automatic cache reuse and program-level parallelism.

This performance boost is accompanied by impressive results on the third-party Lmsys Chatbot Arena leaderboard, which benchmarks AI model performance. The full Grok-2 model has secured the number two spot with an Arena Score of 1293, placing it alongside Google’s Gemini-1.5 Pro and just behind OpenAI’s latest ChatGPT-4o.

Grok-2 Mini, also benefiting from the recent enhancements, climbed to the number five position with an Arena Score of 1268, trailing only GPT-4o mini and Claude 3.5 Sonnet. Notably, both Grok-2 and Grok-2 Mini are proprietary models developed by xAI, showcasing the company’s dedication to pushing the boundaries of AI technology.

Grok-2 has established itself as a leader, particularly in mathematical tasks, where it currently holds the top spot. The model also demonstrates strong performance across various categories, including Hard Prompts, Coding, and Instruction-Following, consistently ranking near the top. This performance surpasses prominent models like OpenAI’s GPT-4o (May 2024), which now sits at number four.

According to Babuschkin, the primary advantage of Grok-2 Mini lies in its enhanced speed. However, he assures further advancements are underway to make it even faster, catering to users who prioritize high performance without significant computational resources.

The addition of Grok-2 and Grok-2 Mini to the Lmsys Chatbot Arena leaderboard and their subsequent performance have garnered significant attention within the AI community. These achievements highlight xAI’s ongoing commitment to innovation and its relentless pursuit of pushing the boundaries of AI capabilities. As xAI continues to refine its models, we can expect further improvements in speed, accuracy, and overall performance, ensuring Grok-2 and Grok-2 Mini remain at the forefront of AI development.

Share
Tweet
Share
Share
Share
Previous Article
  • Ignite

Ultimate Guide to Finding Investors: VC Firms, Angel Investors, and More 

  • August 27, 2024
Read More
Next Article
  • Ignite

Pakistani Startups Make Forbes’ Most Innovative Companies List

  • August 27, 2024
Read More
You May Also Like
Read More
  • TechAdvisor

OnePlus 15R Gets Surprise 16GB RAM Variant Despite Memory Shortage

  • Press Desk
  • June 19, 2026
Read More
  • TechAdvisor

WhatsApp Tests View Once Feature for Regular Text Messages

  • Press Desk
  • June 19, 2026
Read More
  • TechAdvisor

OpenAI Launches Scheduled Tasks Hub for ChatGPT

  • Press Desk
  • June 18, 2026
Read More
  • TechAdvisor

Google Discontinues Nest Mini and Nest Audio Smart Speakers

  • Press Desk
  • June 18, 2026
Read More
  • TechAdvisor

Google Launches Android 17 and Wear OS 7 With Gemini Omni and Bubble Bar

  • Press Desk
  • June 18, 2026
Read More
  • TechAdvisor

Microsoft Launches Surface Pro 12 and Surface Laptop With Snapdragon X2

  • Press Desk
  • June 17, 2026
Read More
  • TechAdvisor

OnePlus N6 Pakistan Release Expected Soon

  • Press Desk
  • June 16, 2026
Read More
  • TechAdvisor

WhatsApp Web Beta Adds Group Voice and Video Calls for Up to 32 Participants

  • Press Desk
  • June 16, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trending Posts
  • Clarification on the Amendment Pakistan Telecom Reorganisation Act
    • June 19, 2026
  • Saudi Arabia NHC Innovation Signs AI Smart City Deals With Huawei Lenovo and ByteDance
    • June 19, 2026
  • CCP And Pakistan Digital Authority Explore Collaboration On Digital Markets
    • June 19, 2026
  • UAE Becomes First Arab Nation To Ban Social Media For Under 15s
    • June 19, 2026
  • NITB And MCI Launch Digital Birth And Death Registration Through Pak App
    • June 19, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.