CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Global Insights

GSI Technology’s Associative Processing Unit Challenges Nvidia’s AI GPU Leadership

  • October 29, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

GSI Technology is positioning its associative processing unit, or APU, as a potential alternative to traditional GPUs in artificial intelligence processing. The approach moves computation directly into memory, a shift that could improve speed and efficiency across AI workloads. The concept was explored in a new Cornell University study, published in the ACM journal and presented at the Micro ’25 conference, which analyzed how GSI’s Gemini-I APU performed against conventional CPUs and GPUs, including Nvidia’s A6000, on retrieval-augmented generation workloads.

The Cornell team tested datasets ranging from 10 to 200GB to simulate realistic AI inference scenarios. The results indicated that by embedding computation within static RAM, the APU can significantly reduce the back-and-forth data transfer between processor and memory — one of the biggest contributors to power consumption and latency in GPU-based architectures. This architectural difference allowed the APU to deliver comparable throughput to high-end GPUs while consuming dramatically less energy. According to GSI, the APU used up to 98 percent less energy than a standard GPU and completed retrieval operations up to 80 percent faster than high-end CPUs. These results highlight its potential for edge applications such as drones, robotics, IoT systems, and defense environments where energy efficiency and thermal constraints are critical.

GSI’s compute-in-memory technology has been under development for several years, but this independent academic validation provides new data points for the broader AI hardware community. While the technology promises major efficiency gains, experts note that it faces challenges in scaling to compete with the well-established GPU ecosystem. GPUs from vendors like Nvidia benefit from mature software frameworks, developer tools, and deep integration with AI platforms such as TensorFlow and PyTorch. In contrast, compute-in-memory devices still require extensive optimization work, and programming environments are not yet standardized, which could delay adoption in large-scale data centers and enterprise settings.

GSI Technology, however, remains confident about the scalability and future of its architecture. The company has already introduced a next-generation model, Gemini-II, which it claims delivers ten times higher throughput and lower latency compared to the first generation. In parallel, GSI is developing another design, known as Plato, aimed at embedded and edge systems requiring even faster compute performance under strict power budgets. Lee-Lean Shu, Chairman and Chief Executive Officer of GSI Technology, said that Cornell’s findings validate the company’s long-standing vision for compute-in-memory. He emphasized that the APU delivers GPU-class performance at a fraction of the power cost, making it an attractive choice for memory-intensive AI inference workloads. Shu added that Gemini-II’s silicon demonstrates roughly ten times faster throughput and reduced latency, positioning the technology for a growing share of the global AI inference market, estimated at over $100 billion.

With further refinement and ecosystem development, compute-in-memory devices like the APU could play a meaningful role in reshaping how AI workloads are processed, balancing high performance with efficiency across emerging applications in both edge and enterprise computing.

Source

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Share
Tweet
Share
Share
Share
Related Topics
  • AI efficiency
  • AI hardware
  • APU
  • compute-in-memory
  • Cornell University
  • data centers
  • edge computing
  • Gemini-I
  • Gemini-II
  • GPU
  • GSI Technology
  • NVIDIA
  • Plato
Previous Article
  • Global Insights

US And Japan Secure Rare Earths Supply Deal Ahead Of Trump-Xi Talks

  • October 29, 2025
Read More
Next Article
  • Wired

Meta Launches Instagram Teen Accounts In Pakistan: A Step Towards Safer Digital Spaces

  • October 29, 2025
Read More
You May Also Like
Read More
  • Global Insights

Faz Australia’s MedTalk AI Secures ACT Health Pilot Contract For AI-Powered Medical Scribe Platform

  • Press Desk
  • April 8, 2026
Read More
  • Global Insights

Iran Threatens To Strike $30 Billion Stargate AI Data Center Backed By OpenAI And Nvidia In The UAE

  • Press Desk
  • April 8, 2026
Read More
  • Global Insights

CIA Uses Ghost Murmur AI Powered Technology To Detect Heartbeats And Rescue Downed Airman

  • Press Desk
  • April 8, 2026
Read More
  • Global Insights

Iran University Claims US Israel Attack Targeted AI Research And Scientific Progress

  • Press Desk
  • April 8, 2026
Read More
  • Global Insights

UAE Launches Commercial Upper 6GHz Ecosystem At SAMENA Leaders’ Summit 2026

  • Press Desk
  • April 7, 2026
Read More
  • Global Insights

US And UAE Deepen AI Partnership To Strengthen Global Tech Leadership Amid Regional Tensions

  • Press Desk
  • April 7, 2026
Read More
  • Global Insights

Chinese Robotics Firm UBTECH Offers Up To $18 Million Annual Salary To Recruit Chief Scientist

  • Press Desk
  • April 6, 2026
Read More
  • Global Insights

Operation Epic Fury: How The Pentagon’s Project Maven AI System Is Reshaping Modern Warfare Against Iran

  • Press Desk
  • April 6, 2026
Trending Posts
  • KP Government To Launch Electric Scooter Scheme For Women And Female Students
    • April 9, 2026
  • HEC And Huawei Reaffirm Partnership To Advance AI, Cloud Infrastructure And Digital Education In Pakistan
    • April 9, 2026
  • Realme 16 Pro Series 5G Launches In Pakistan With 200MP Camera, Snapdragon 7 Gen 4 And 7000mAh Battery
    • April 9, 2026
  • Pakistan Conference At Harvard 2026: Speakers, Agenda, And Everything You Need To Know
    • April 9, 2026
  • PM Fuel Subsidy Scheme Available On PakApp For Motorcyclists With Digital Registration
    • April 9, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.