CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

DeepSeek V4 Pro Arrives With 1.6 Trillion Parameters At 98 Percent Less Than GPT-5.5 Pro, Hours After OpenAI’s Latest Launch

  • April 25, 2026
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

DeepSeek has released preview versions of DeepSeek-V4-Pro and DeepSeek-V4-Flash, its most capable models to date, arriving within hours of OpenAI’s GPT-5.5 launch in what the artificial intelligence community has interpreted as a pointed statement of timing from a Chinese lab that has spent three years operating under United States chip export restrictions. V4-Pro carries 1.6 trillion total parameters but activates only 49 billion per inference pass through the Mixture-of-Experts architecture DeepSeek has refined since V3, making it the largest open-weight model currently available, while V4-Flash carries 284 billion total parameters with 13 billion active, designed for speed and cost efficiency. Both models support one million token context windows as a standard feature and are available under a Massachusetts Institute of Technology licence on Hugging Face, free for anyone capable of running them locally.

The pricing gap with Western competitors is striking: V4-Pro costs USD 1.74 per million input tokens and USD 3.48 per million output tokens, compared to GPT-5.5 Pro at USD 30 input and USD 180 output per million tokens — a 98 percent cost difference on output. V4-Flash goes further at USD 0.14 input and USD 0.28 output, undercutting every comparable budget model from major frontier labs. Cline Chief Executive Officer Saoud Rizwan noted that if Uber had used DeepSeek instead of Claude, its 2026 artificial intelligence budget reportedly sized for four months of usage would have stretched to seven years. DeepSeek trained V4 partly on Huawei Ascend chips, directly circumventing United States export restrictions on Nvidia graphics processing units, and has indicated that once 950 new Huawei Ascend 950 supernodes come online later in 2026, the already-low pricing on V4-Pro will fall further.

The efficiency gains behind this pricing are architectural. DeepSeek developed two new attention mechanisms: Compressed Sparse Attention, which compresses groups of tokens then selects only the most relevant entries using a Lightning Indexer; and Heavily Compressed Attention, which collapses every 128 tokens into a single entry for an extremely cheap global view of long contexts. The result is that at one million tokens, V4-Pro uses only 27 percent of the compute its predecessor V3.2 required, while key-value cache memory drops to just 10 percent of V3.2. On benchmarks, V4-Pro-Max scored 90.2 percent on Apex Shortlist against Claude Opus 4.6’s 85.9 percent, matched Claude Opus 4.6 on SWE-Verified at 80.6 percent for resolving real GitHub issues, and ranked first among all open-weight models on GDPval-AA, an agentic real-world work benchmark covering finance, legal, and research tasks, scoring 1,554 Elo against Claude Opus 4.6’s 1,619. The models are text-only for now, with multimodal capabilities still in development, and the existing DeepSeek API endpoints will be retired on July 24, 2026.

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem.

Share
Tweet
Share
Share
Share
Related Topics
  • AI Model Pricing
  • China AI
  • DeepSeek V4
  • DeepSeek V4 Flash
  • DeepSeek V4 Pro
  • Huawei Ascend AI
  • LLM 2026
  • Mixture of Experts
  • Open Source AI
  • OpenAI GPT-5.5
Previous Article
  • PASHA News

PASHA Chairman Meets Punjab’s AI Office Advisor Ali Dar To Explore Collaboration On AI Training And Youth Employability Across The Province

  • April 25, 2026
Read More
Next Article
  • PayTech

LUMS CHISEL Lab Signs MoU With Allied Bank To Explore Robotics And Human-Robot Interaction In Banking Services

  • April 25, 2026
Read More
You May Also Like
Read More
  • TechAdvisor

Samsung Galaxy Book6 Edge Surfaces Online With Snapdragon X2 Elite Chip, OLED Display And 22-Hour Battery Life

  • Press Desk
  • April 25, 2026
Read More
  • TechAdvisor

How To Secure Your Laptop Data In 25 Minutes With Full Encryption And Privacy Controls

  • Press Desk
  • April 24, 2026
Read More
  • TechAdvisor

Samsung Galaxy S27 Rumours Specs Release Date Price Exynos 2700 Camera Upgrade TechAdvisor Report

  • Press Desk
  • April 23, 2026
Read More
  • TechAdvisor

DJI Osmo Pocket 4 Review Specs Features 4K 240fps Vlogging Camera

  • Press Desk
  • April 23, 2026
Read More
  • TechAdvisor

Humane AI Device Custom Assistant Context Aware Wearable AI Technology

  • Press Desk
  • April 23, 2026
Read More
  • TechAdvisor

Google Photos Face Editing Tools AI Portrait Editing Features Android Update

  • Press Desk
  • April 23, 2026
Read More
  • TechAdvisor

OpenAI Codex Chronicle Uses Screen Monitoring To Improve AI Context On Mac

  • Press Desk
  • April 21, 2026
Read More
  • TechAdvisor

Google AI Studio Usage Limits Increased For AI Pro And Ultra Subscribers With Gemini Models

  • Press Desk
  • April 21, 2026
Trending Posts
  • Pakistan Successfully Launches Indigenous EO-3 Electro-Optical Satellite From Taiyuan Launch Center In China
    • April 25, 2026
  • Iran-Linked Tasnim News Agency Maps Gulf Undersea Internet Cables In What Analysts Describe As A Strategic Signal To Arab Neighbours
    • April 25, 2026
  • PTA Publishes Mobile Network Experience Benchmarking Report For Q1 2026 In Collaboration With Opensignal Covering 15 Cities
    • April 25, 2026
  • PTA And ConnectHear Partner On International Girls In ICT Day 2026 To Advance Digital Inclusion For Women With Hearing Impairments
    • April 25, 2026
  • ITU Academy And UNDP Open Applications For Free Online Course On Data Governance For Inclusive Digital And AI Futures With May 31 Deadline
    • April 25, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.