CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

xAI’s Grok-2 and Grok-2 Mini Get Major Speed Boost, Grok-2 Claims #2 Spot in AI Model Performance

  • August 27, 2024
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Elon Musk’s xAI has taken significant strides in improving the performance of its Grok-2 large language model (LLM) chatbot. In just three days, xAI developers completely rewrote the inference code stack using SGLang, an open-source system designed for efficient execution of complex language models. This resulted in a dramatic speed increase for both the full Grok-2 model and the streamlined Grok-2 Mini.

The improvements were announced by xAI developer Igor Babuschkin on the social network X. He revealed that Grok-2 Mini is now twice as fast as it was previously. This impressive feat is attributed to the collaborative effort of Lianmin Zheng and Saeed Maleki, who rewrote the inference code using SGLang.

SGLang’s efficiency extends beyond speed. It also enables xAI to serve the larger Grok-2 model, which requires multi-host inference, at a significantly faster pace. Additionally, both models experienced a slight increase in accuracy alongside the speed boost.

Developed by a team from Stanford University, UC Berkeley, Texas A&M, and Shanghai Jiao Tong University, SGLang offers a versatile platform. It supports a wide range of models, including Llama, Mistral, and LLaVA, and is compatible with open-weight and API-based models like OpenAI’s GPT-4. The system’s strength lies in its ability to optimize execution through automatic cache reuse and program-level parallelism.

This performance boost is accompanied by impressive results on the third-party Lmsys Chatbot Arena leaderboard, which benchmarks AI model performance. The full Grok-2 model has secured the number two spot with an Arena Score of 1293, placing it alongside Google’s Gemini-1.5 Pro and just behind OpenAI’s latest ChatGPT-4o.

Grok-2 Mini, also benefiting from the recent enhancements, climbed to the number five position with an Arena Score of 1268, trailing only GPT-4o mini and Claude 3.5 Sonnet. Notably, both Grok-2 and Grok-2 Mini are proprietary models developed by xAI, showcasing the company’s dedication to pushing the boundaries of AI technology.

Grok-2 has established itself as a leader, particularly in mathematical tasks, where it currently holds the top spot. The model also demonstrates strong performance across various categories, including Hard Prompts, Coding, and Instruction-Following, consistently ranking near the top. This performance surpasses prominent models like OpenAI’s GPT-4o (May 2024), which now sits at number four.

According to Babuschkin, the primary advantage of Grok-2 Mini lies in its enhanced speed. However, he assures further advancements are underway to make it even faster, catering to users who prioritize high performance without significant computational resources.

The addition of Grok-2 and Grok-2 Mini to the Lmsys Chatbot Arena leaderboard and their subsequent performance have garnered significant attention within the AI community. These achievements highlight xAI’s ongoing commitment to innovation and its relentless pursuit of pushing the boundaries of AI capabilities. As xAI continues to refine its models, we can expect further improvements in speed, accuracy, and overall performance, ensuring Grok-2 and Grok-2 Mini remain at the forefront of AI development.

Share
Tweet
Share
Share
Share
Previous Article
  • Ignite

Ultimate Guide to Finding Investors: VC Firms, Angel Investors, and More 

  • August 27, 2024
Read More
Next Article
  • Ignite

Pakistani Startups Make Forbes’ Most Innovative Companies List

  • August 27, 2024
Read More
You May Also Like
Read More
  • TechAdvisor

Google TV To Display YouTube Shorts On Home Page And Gain Veo Video Generation And Gemini Integration

  • Press Desk
  • April 30, 2026
Read More
  • TechAdvisor

Intel’s Wildcat Lake Chip Outperforms Apple A18 Pro On Benchmarks But Windows Laptops Still Struggle To Match MacBook Neo’s Value

  • Press Desk
  • April 30, 2026
Read More
  • TechAdvisor

Google Gemini Set To Offer Proactive Assistance Without Being Asked As Memories Feature Rolls Out In The UK

  • Press Desk
  • April 30, 2026
Read More
  • TechAdvisor

Samsung Launches Galaxy A37 And Galaxy A57 5G In Pakistan With Six Years Of Software Support

  • Press Desk
  • April 30, 2026
Read More
  • TechAdvisor

xAI Rolls Out Custom Shareable Imagine Templates For Grok With Photo-To-Video And Style Edit Workflows

  • Press Desk
  • April 30, 2026
Read More
  • TechAdvisor

Pakistan Becomes First Country Outside China To Locally Assemble Great Wall Motor Tank 500 PHEV

  • Press Desk
  • April 29, 2026
Read More
  • TechAdvisor

YouTube Begins Testing Ask YouTube AI Search Feature for Smarter Video Discovery

  • Press Desk
  • April 28, 2026
Read More
  • TechAdvisor

ChatGPT Images 2.0 Review Shows Major Leap In AI Image Generation For Real Work

  • Press Desk
  • April 28, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trending Posts
  • Spotify Hosts Intimate Padel Evening In Karachi Celebrating Five Years Of Music Growth In Pakistan
    • May 1, 2026
  • Inseego To Acquire Nokia’s Fixed Wireless Access Business In Deal That Will Double Its Revenue And Create A Global Broadband Leader
    • May 1, 2026
  • Spotify Posts Record Operating Profit Of €715 Million In First Quarter 2026 As Monthly Active Users Hit 761 Million
    • May 1, 2026
  • Spotify Marks Five Years In Pakistan With 750% Listenership Growth And Over 15 Million User-Created Playlists
    • May 1, 2026
  • Reports Suggest OnePlus And Realme Are Merging Under A New Combined Unit Within The Oppo Group
    • May 1, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.