CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Global Insights

GSI Technology’s Associative Processing Unit Challenges Nvidia’s AI GPU Leadership

  • October 29, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

GSI Technology is positioning its associative processing unit, or APU, as a potential alternative to traditional GPUs in artificial intelligence processing. The approach moves computation directly into memory, a shift that could improve speed and efficiency across AI workloads. The concept was explored in a new Cornell University study, published in the ACM journal and presented at the Micro ’25 conference, which analyzed how GSI’s Gemini-I APU performed against conventional CPUs and GPUs, including Nvidia’s A6000, on retrieval-augmented generation workloads.

The Cornell team tested datasets ranging from 10 to 200GB to simulate realistic AI inference scenarios. The results indicated that by embedding computation within static RAM, the APU can significantly reduce the back-and-forth data transfer between processor and memory — one of the biggest contributors to power consumption and latency in GPU-based architectures. This architectural difference allowed the APU to deliver comparable throughput to high-end GPUs while consuming dramatically less energy. According to GSI, the APU used up to 98 percent less energy than a standard GPU and completed retrieval operations up to 80 percent faster than high-end CPUs. These results highlight its potential for edge applications such as drones, robotics, IoT systems, and defense environments where energy efficiency and thermal constraints are critical.

GSI’s compute-in-memory technology has been under development for several years, but this independent academic validation provides new data points for the broader AI hardware community. While the technology promises major efficiency gains, experts note that it faces challenges in scaling to compete with the well-established GPU ecosystem. GPUs from vendors like Nvidia benefit from mature software frameworks, developer tools, and deep integration with AI platforms such as TensorFlow and PyTorch. In contrast, compute-in-memory devices still require extensive optimization work, and programming environments are not yet standardized, which could delay adoption in large-scale data centers and enterprise settings.

GSI Technology, however, remains confident about the scalability and future of its architecture. The company has already introduced a next-generation model, Gemini-II, which it claims delivers ten times higher throughput and lower latency compared to the first generation. In parallel, GSI is developing another design, known as Plato, aimed at embedded and edge systems requiring even faster compute performance under strict power budgets. Lee-Lean Shu, Chairman and Chief Executive Officer of GSI Technology, said that Cornell’s findings validate the company’s long-standing vision for compute-in-memory. He emphasized that the APU delivers GPU-class performance at a fraction of the power cost, making it an attractive choice for memory-intensive AI inference workloads. Shu added that Gemini-II’s silicon demonstrates roughly ten times faster throughput and reduced latency, positioning the technology for a growing share of the global AI inference market, estimated at over $100 billion.

With further refinement and ecosystem development, compute-in-memory devices like the APU could play a meaningful role in reshaping how AI workloads are processed, balancing high performance with efficiency across emerging applications in both edge and enterprise computing.

Source

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Share
Tweet
Share
Share
Share
Related Topics
  • AI efficiency
  • AI hardware
  • APU
  • compute-in-memory
  • Cornell University
  • data centers
  • edge computing
  • Gemini-I
  • Gemini-II
  • GPU
  • GSI Technology
  • NVIDIA
  • Plato
Previous Article
  • Global Insights

US And Japan Secure Rare Earths Supply Deal Ahead Of Trump-Xi Talks

  • October 29, 2025
Read More
Next Article
  • Wired

Meta Launches Instagram Teen Accounts In Pakistan: A Step Towards Safer Digital Spaces

  • October 29, 2025
Read More
You May Also Like
Read More
  • Global Insights

AI Data Centres Raising Ground Temperatures by Up to 9 Degrees Study Finds

  • Press Desk
  • June 11, 2026
Read More
  • Global Insights

FIFA World Cup 2026 Deploys AI Smart Ball 3D Player Avatars and Referee View

  • Press Desk
  • June 11, 2026
Read More
  • Global Insights

US Adds Alibaba Baidu BYD and Nio to Pentagon Military Company List

  • Press Desk
  • June 10, 2026
Read More
  • Global Insights

Bahrain Migrates Islamiyat App Services to MyGov Platform

  • Press Desk
  • June 9, 2026
Read More
  • Global Insights

Anthropic Calls on AI Labs to Build Framework for Pausing AI Development

  • Press Desk
  • June 9, 2026
Read More
  • Global Insights

Uber and Wayve to Launch London First Robotaxis With AI Self-Driving Technology

  • Press Desk
  • June 9, 2026
Read More
  • Global Insights

Nvidia CEO Jensen Huang Signs Major Deals With SK Hynix Naver and Doosan in South Korea

  • Press Desk
  • June 9, 2026
Read More
  • Global Insights

NASA Reverses ISS Evacuation Order After Russia Seals Air Leak

  • Press Desk
  • June 7, 2026
Trending Posts
  • LUMS and EnablifyAI Sign MOU for AI Industry Academia Collaboration
    • June 12, 2026
  • Pakistan Digital Authority and PPRA Form Working Group to Digitise Public Procurement
    • June 12, 2026
  • CITADEL Pakistan and AIgilityX Host Sovereign AI Executive Briefing on June 15
    • June 12, 2026
  • Punjab Ends Free Public Transport Subsidy as Regular Fares Resume June 13
    • June 12, 2026
  • PAFLA Online Session on Safety Net for Pakistani Freelancers and Entrepreneurs
    • June 12, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.