CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Global Insights

GSI Technology’s Associative Processing Unit Challenges Nvidia’s AI GPU Leadership

  • October 29, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

GSI Technology is positioning its associative processing unit, or APU, as a potential alternative to traditional GPUs in artificial intelligence processing. The approach moves computation directly into memory, a shift that could improve speed and efficiency across AI workloads. The concept was explored in a new Cornell University study, published in the ACM journal and presented at the Micro ’25 conference, which analyzed how GSI’s Gemini-I APU performed against conventional CPUs and GPUs, including Nvidia’s A6000, on retrieval-augmented generation workloads.

The Cornell team tested datasets ranging from 10 to 200GB to simulate realistic AI inference scenarios. The results indicated that by embedding computation within static RAM, the APU can significantly reduce the back-and-forth data transfer between processor and memory — one of the biggest contributors to power consumption and latency in GPU-based architectures. This architectural difference allowed the APU to deliver comparable throughput to high-end GPUs while consuming dramatically less energy. According to GSI, the APU used up to 98 percent less energy than a standard GPU and completed retrieval operations up to 80 percent faster than high-end CPUs. These results highlight its potential for edge applications such as drones, robotics, IoT systems, and defense environments where energy efficiency and thermal constraints are critical.

GSI’s compute-in-memory technology has been under development for several years, but this independent academic validation provides new data points for the broader AI hardware community. While the technology promises major efficiency gains, experts note that it faces challenges in scaling to compete with the well-established GPU ecosystem. GPUs from vendors like Nvidia benefit from mature software frameworks, developer tools, and deep integration with AI platforms such as TensorFlow and PyTorch. In contrast, compute-in-memory devices still require extensive optimization work, and programming environments are not yet standardized, which could delay adoption in large-scale data centers and enterprise settings.

GSI Technology, however, remains confident about the scalability and future of its architecture. The company has already introduced a next-generation model, Gemini-II, which it claims delivers ten times higher throughput and lower latency compared to the first generation. In parallel, GSI is developing another design, known as Plato, aimed at embedded and edge systems requiring even faster compute performance under strict power budgets. Lee-Lean Shu, Chairman and Chief Executive Officer of GSI Technology, said that Cornell’s findings validate the company’s long-standing vision for compute-in-memory. He emphasized that the APU delivers GPU-class performance at a fraction of the power cost, making it an attractive choice for memory-intensive AI inference workloads. Shu added that Gemini-II’s silicon demonstrates roughly ten times faster throughput and reduced latency, positioning the technology for a growing share of the global AI inference market, estimated at over $100 billion.

With further refinement and ecosystem development, compute-in-memory devices like the APU could play a meaningful role in reshaping how AI workloads are processed, balancing high performance with efficiency across emerging applications in both edge and enterprise computing.

Source

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Share
Tweet
Share
Share
Share
Related Topics
  • AI efficiency
  • AI hardware
  • APU
  • compute-in-memory
  • Cornell University
  • data centers
  • edge computing
  • Gemini-I
  • Gemini-II
  • GPU
  • GSI Technology
  • NVIDIA
  • Plato
Previous Article
  • Global Insights

US And Japan Secure Rare Earths Supply Deal Ahead Of Trump-Xi Talks

  • October 29, 2025
Read More
Next Article
  • Wired

Meta Launches Instagram Teen Accounts In Pakistan: A Step Towards Safer Digital Spaces

  • October 29, 2025
Read More
You May Also Like
Read More
  • Global Insights

Egypt ICT Minister Outlines Digital Transformation Priorities at EITESAL 2026

  • Press Desk
  • May 22, 2026
Read More
  • Global Insights

India Cockroach Janta Party Goes Viral With 15 Million Followers on Instagram

  • Press Desk
  • May 22, 2026
Read More
  • Global Insights

Kenya Launches AI for Disability Project With Huawei at Connected Africa Summit

  • Press Desk
  • May 19, 2026
Read More
  • Global Insights

Elon Musk Loses OpenAI Lawsuit Over Statute of Limitations

  • Press Desk
  • May 19, 2026
Read More
  • Global Insights

Saudi Arabia Deploys 22000 Staff and 88000 Waste Units for Hajj 2026

  • Press Desk
  • May 19, 2026
Read More
  • Global Insights

Egypt To Launch Child SIM Card With Parental Controls And Age-Based Social Media Restrictions By June 2026

  • Press Desk
  • May 16, 2026
Read More
  • Global Insights

Saudi Arabia Recasts NEOM As Regional Logistics Hub Amid Strait Of Hormuz War Disruption

  • Press Desk
  • May 16, 2026
Read More
  • Global Insights

UAE Ranked 15th Globally In AI Healthcare And Biotech Competitiveness Index

  • Press Desk
  • May 16, 2026
Trending Posts
  • Apple iPhone 19 Pro Leak Reveals Quad-Curved Display Design
    • May 22, 2026
  • Chinese DDR5 RAM Could Slash Memory Prices for Pakistan Gamers
    • May 22, 2026
  • SkillBridge Apprenticeship Program Batch III Applications Now Open
    • May 22, 2026
  • PASHA Hosts Webinar on IP Copyright and Patents for Tech Founders With IPO Pakistan
    • May 22, 2026
  • Samsung Patent Shows Rollable Smartphone With Moving Camera
    • May 22, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.