CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

DeepSeek V4 Pro Arrives With 1.6 Trillion Parameters At 98 Percent Less Than GPT-5.5 Pro, Hours After OpenAI’s Latest Launch

  • April 25, 2026
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

DeepSeek has released preview versions of DeepSeek-V4-Pro and DeepSeek-V4-Flash, its most capable models to date, arriving within hours of OpenAI’s GPT-5.5 launch in what the artificial intelligence community has interpreted as a pointed statement of timing from a Chinese lab that has spent three years operating under United States chip export restrictions. V4-Pro carries 1.6 trillion total parameters but activates only 49 billion per inference pass through the Mixture-of-Experts architecture DeepSeek has refined since V3, making it the largest open-weight model currently available, while V4-Flash carries 284 billion total parameters with 13 billion active, designed for speed and cost efficiency. Both models support one million token context windows as a standard feature and are available under a Massachusetts Institute of Technology licence on Hugging Face, free for anyone capable of running them locally.

The pricing gap with Western competitors is striking: V4-Pro costs USD 1.74 per million input tokens and USD 3.48 per million output tokens, compared to GPT-5.5 Pro at USD 30 input and USD 180 output per million tokens — a 98 percent cost difference on output. V4-Flash goes further at USD 0.14 input and USD 0.28 output, undercutting every comparable budget model from major frontier labs. Cline Chief Executive Officer Saoud Rizwan noted that if Uber had used DeepSeek instead of Claude, its 2026 artificial intelligence budget reportedly sized for four months of usage would have stretched to seven years. DeepSeek trained V4 partly on Huawei Ascend chips, directly circumventing United States export restrictions on Nvidia graphics processing units, and has indicated that once 950 new Huawei Ascend 950 supernodes come online later in 2026, the already-low pricing on V4-Pro will fall further.

The efficiency gains behind this pricing are architectural. DeepSeek developed two new attention mechanisms: Compressed Sparse Attention, which compresses groups of tokens then selects only the most relevant entries using a Lightning Indexer; and Heavily Compressed Attention, which collapses every 128 tokens into a single entry for an extremely cheap global view of long contexts. The result is that at one million tokens, V4-Pro uses only 27 percent of the compute its predecessor V3.2 required, while key-value cache memory drops to just 10 percent of V3.2. On benchmarks, V4-Pro-Max scored 90.2 percent on Apex Shortlist against Claude Opus 4.6’s 85.9 percent, matched Claude Opus 4.6 on SWE-Verified at 80.6 percent for resolving real GitHub issues, and ranked first among all open-weight models on GDPval-AA, an agentic real-world work benchmark covering finance, legal, and research tasks, scoring 1,554 Elo against Claude Opus 4.6’s 1,619. The models are text-only for now, with multimodal capabilities still in development, and the existing DeepSeek API endpoints will be retired on July 24, 2026.

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem.

Share
Tweet
Share
Share
Share
Related Topics
  • AI Model Pricing
  • China AI
  • DeepSeek V4
  • DeepSeek V4 Flash
  • DeepSeek V4 Pro
  • Huawei Ascend AI
  • LLM 2026
  • Mixture of Experts
  • Open Source AI
  • OpenAI GPT-5.5
Previous Article
  • PASHA News

PASHA Chairman Meets Punjab’s AI Office Advisor Ali Dar To Explore Collaboration On AI Training And Youth Employability Across The Province

  • April 25, 2026
Read More
Next Article
  • PayTech

LUMS CHISEL Lab Signs MoU With Allied Bank To Explore Robotics And Human-Robot Interaction In Banking Services

  • April 25, 2026
Read More
You May Also Like
Read More
  • TechAdvisor

Google May Reduce Free Gmail Storage To 5GB For New Accounts

  • Press Desk
  • May 15, 2026
Read More
  • TechAdvisor

X Launches History Tab To Track All Content In One Place

  • Press Desk
  • May 15, 2026
Read More
  • TechAdvisor

Samsung One UI 9: Every Device Getting The Update

  • Press Desk
  • May 15, 2026
Read More
  • TechAdvisor

Apple iOS 27 To Introduce Fully Customizable Camera App

  • Press Desk
  • May 14, 2026
Read More
  • TechAdvisor

Instagram Launches Instants App For Disappearing Candid Photos

  • Press Desk
  • May 14, 2026
Read More
  • TechAdvisor

Amazon Head Of Devices Panos Panay Refuses To Rule Out Return To Smartphone Market

  • Press Desk
  • May 14, 2026
Read More
  • TechAdvisor

Android 17 Features Revealed At Google Android Show 2026 Including Gemini Intelligence And Rambler

  • Press Desk
  • May 13, 2026
Read More
  • TechAdvisor

Microsoft Tests Windows 11 Low Latency Profile Delivering Up To 70 Percent Faster Load Times

  • Press Desk
  • May 13, 2026
Trending Posts
  • PASHA To Showcase Pakistan Tech Industry At TechEx North America 2026
    • May 15, 2026
  • NCAI Opens AI Summer Programs 2026 In Islamabad
    • May 15, 2026
  • Google May Reduce Free Gmail Storage To 5GB For New Accounts
    • May 15, 2026
  • Pakistan Loses Rs 860 Billion Annually To Intellectual Property Violations
    • May 15, 2026
  • Rawalpindi Launches Video Evidence Digital Traffic Challan System
    • May 15, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.