CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • DFDI
  • PSEB
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Global Insights

GSI Technology’s Associative Processing Unit Challenges Nvidia’s AI GPU Leadership

  • October 29, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

GSI Technology is positioning its associative processing unit, or APU, as a potential alternative to traditional GPUs in artificial intelligence processing. The approach moves computation directly into memory, a shift that could improve speed and efficiency across AI workloads. The concept was explored in a new Cornell University study, published in the ACM journal and presented at the Micro ’25 conference, which analyzed how GSI’s Gemini-I APU performed against conventional CPUs and GPUs, including Nvidia’s A6000, on retrieval-augmented generation workloads.

The Cornell team tested datasets ranging from 10 to 200GB to simulate realistic AI inference scenarios. The results indicated that by embedding computation within static RAM, the APU can significantly reduce the back-and-forth data transfer between processor and memory — one of the biggest contributors to power consumption and latency in GPU-based architectures. This architectural difference allowed the APU to deliver comparable throughput to high-end GPUs while consuming dramatically less energy. According to GSI, the APU used up to 98 percent less energy than a standard GPU and completed retrieval operations up to 80 percent faster than high-end CPUs. These results highlight its potential for edge applications such as drones, robotics, IoT systems, and defense environments where energy efficiency and thermal constraints are critical.

GSI’s compute-in-memory technology has been under development for several years, but this independent academic validation provides new data points for the broader AI hardware community. While the technology promises major efficiency gains, experts note that it faces challenges in scaling to compete with the well-established GPU ecosystem. GPUs from vendors like Nvidia benefit from mature software frameworks, developer tools, and deep integration with AI platforms such as TensorFlow and PyTorch. In contrast, compute-in-memory devices still require extensive optimization work, and programming environments are not yet standardized, which could delay adoption in large-scale data centers and enterprise settings.

GSI Technology, however, remains confident about the scalability and future of its architecture. The company has already introduced a next-generation model, Gemini-II, which it claims delivers ten times higher throughput and lower latency compared to the first generation. In parallel, GSI is developing another design, known as Plato, aimed at embedded and edge systems requiring even faster compute performance under strict power budgets. Lee-Lean Shu, Chairman and Chief Executive Officer of GSI Technology, said that Cornell’s findings validate the company’s long-standing vision for compute-in-memory. He emphasized that the APU delivers GPU-class performance at a fraction of the power cost, making it an attractive choice for memory-intensive AI inference workloads. Shu added that Gemini-II’s silicon demonstrates roughly ten times faster throughput and reduced latency, positioning the technology for a growing share of the global AI inference market, estimated at over $100 billion.

With further refinement and ecosystem development, compute-in-memory devices like the APU could play a meaningful role in reshaping how AI workloads are processed, balancing high performance with efficiency across emerging applications in both edge and enterprise computing.

Source

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Share
Tweet
Share
Share
Share
Related Topics
  • AI efficiency
  • AI hardware
  • APU
  • compute-in-memory
  • Cornell University
  • data centers
  • edge computing
  • Gemini-I
  • Gemini-II
  • GPU
  • GSI Technology
  • NVIDIA
  • Plato
Previous Article
  • Global Insights

US And Japan Secure Rare Earths Supply Deal Ahead Of Trump-Xi Talks

  • October 29, 2025
Read More
You May Also Like
Read More
  • Global Insights

US And Japan Secure Rare Earths Supply Deal Ahead Of Trump-Xi Talks

  • Press Desk
  • October 29, 2025
Read More
  • Global Insights

Google Achieves Quantum Computing Breakthrough With Algorithm Surpassing Supercomputer Capabilities

  • Press Desk
  • October 26, 2025
Read More
  • Global Insights

Trump Grants Presidential Pardon To Binance Founder Changpeng Zhao

  • Press Desk
  • October 25, 2025
Read More
  • Global Insights

China And India Dominate Global Telecom Rankings With Over 2.6 Billion Subscribers

  • Press Desk
  • October 22, 2025
Read More
  • Global Insights

China Launches Pakistan Remote-Sensing Satellite PRSS-2 Into Orbit With Lijian-1 Rocket

  • Press Desk
  • October 22, 2025
Read More
  • Global Insights

Global Internet Disruption as Major AWS Outage Knocks Out Popular Apps

  • webdesk
  • October 20, 2025
Read More
  • Global Insights

Türkiye Identifies Second-Largest Rare Earth Deposit in Eskisehir Province with 694 Million Tons of Reserves

  • Press Desk
  • October 18, 2025
Read More
  • Global Insights

Google Announces $15 Billion Investment To Establish India’s Largest AI And Data Hub

  • Press Desk
  • October 17, 2025
Trending Posts
  • US And Japan Secure Rare Earths Supply Deal Ahead Of Trump-Xi Talks
    • October 29, 2025
  • Systems Limited Reports 46% Growth In Nine-Month Profit For FY25
    • October 29, 2025
  • PIA Transfers Precision Engineering Complex To Pakistan Air Force Entity
    • October 29, 2025
  • SMEDA Introduces AI Legal Support Platform For Pakistan’s SMEs In Collaboration With PakLawAssist
    • October 29, 2025
  • Microsoft Teams Wi-Fi Attendance Feature Set to Influence Hybrid Work Culture in Pakistan
    • October 29, 2025
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2025. Read Privacy Policy.

Input your search keywords and press Enter.