CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Global Insights

GSI Technology’s Associative Processing Unit Challenges Nvidia’s AI GPU Leadership

  • October 29, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

GSI Technology is positioning its associative processing unit, or APU, as a potential alternative to traditional GPUs in artificial intelligence processing. The approach moves computation directly into memory, a shift that could improve speed and efficiency across AI workloads. The concept was explored in a new Cornell University study, published in the ACM journal and presented at the Micro ’25 conference, which analyzed how GSI’s Gemini-I APU performed against conventional CPUs and GPUs, including Nvidia’s A6000, on retrieval-augmented generation workloads.

The Cornell team tested datasets ranging from 10 to 200GB to simulate realistic AI inference scenarios. The results indicated that by embedding computation within static RAM, the APU can significantly reduce the back-and-forth data transfer between processor and memory — one of the biggest contributors to power consumption and latency in GPU-based architectures. This architectural difference allowed the APU to deliver comparable throughput to high-end GPUs while consuming dramatically less energy. According to GSI, the APU used up to 98 percent less energy than a standard GPU and completed retrieval operations up to 80 percent faster than high-end CPUs. These results highlight its potential for edge applications such as drones, robotics, IoT systems, and defense environments where energy efficiency and thermal constraints are critical.

GSI’s compute-in-memory technology has been under development for several years, but this independent academic validation provides new data points for the broader AI hardware community. While the technology promises major efficiency gains, experts note that it faces challenges in scaling to compete with the well-established GPU ecosystem. GPUs from vendors like Nvidia benefit from mature software frameworks, developer tools, and deep integration with AI platforms such as TensorFlow and PyTorch. In contrast, compute-in-memory devices still require extensive optimization work, and programming environments are not yet standardized, which could delay adoption in large-scale data centers and enterprise settings.

GSI Technology, however, remains confident about the scalability and future of its architecture. The company has already introduced a next-generation model, Gemini-II, which it claims delivers ten times higher throughput and lower latency compared to the first generation. In parallel, GSI is developing another design, known as Plato, aimed at embedded and edge systems requiring even faster compute performance under strict power budgets. Lee-Lean Shu, Chairman and Chief Executive Officer of GSI Technology, said that Cornell’s findings validate the company’s long-standing vision for compute-in-memory. He emphasized that the APU delivers GPU-class performance at a fraction of the power cost, making it an attractive choice for memory-intensive AI inference workloads. Shu added that Gemini-II’s silicon demonstrates roughly ten times faster throughput and reduced latency, positioning the technology for a growing share of the global AI inference market, estimated at over $100 billion.

With further refinement and ecosystem development, compute-in-memory devices like the APU could play a meaningful role in reshaping how AI workloads are processed, balancing high performance with efficiency across emerging applications in both edge and enterprise computing.

Source

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Share
Tweet
Share
Share
Share
Related Topics
  • AI efficiency
  • AI hardware
  • APU
  • compute-in-memory
  • Cornell University
  • data centers
  • edge computing
  • Gemini-I
  • Gemini-II
  • GPU
  • GSI Technology
  • NVIDIA
  • Plato
Previous Article
  • Global Insights

US And Japan Secure Rare Earths Supply Deal Ahead Of Trump-Xi Talks

  • October 29, 2025
Read More
Next Article
  • Wired

Meta Launches Instagram Teen Accounts In Pakistan: A Step Towards Safer Digital Spaces

  • October 29, 2025
Read More
You May Also Like
Read More
  • Global Insights

KERNO Launches UAE First Enterprise IT Hardware Manufacturing Plant

  • Press Desk
  • June 30, 2026
Read More
  • Global Insights

WIPO 2026 Innovation Cluster Rankings Coming In September

  • Press Desk
  • June 30, 2026
Read More
  • Global Insights

Australia Doubles Social Media Ban Penalty to AUD 99 Million Amid Compliance Failures

  • Press Desk
  • June 29, 2026
Read More
  • Global Insights

Alef Education Promotes Digital Learning During Summer Break

  • Press Desk
  • June 26, 2026
Read More
  • Global Insights

SpaceX Shares Recover After Post-IPO Selloff Wiped $600 Billion

  • Press Desk
  • June 26, 2026
Read More
  • Global Insights

Forbes 2026 Top Creators List Reveals Over $1 Billion In Combined Creator Earnings

  • webdesk
  • June 25, 2026
Read More
  • Global Insights

China LineShine Supercomputer Tops TOP500 Rankings With 2.198 Exaflops And Domestic Chips

  • Press Desk
  • June 25, 2026
Read More
  • Global Insights

Alibaba Sues US Department Of Defense Over Chinese Military Company Label

  • Press Desk
  • June 25, 2026
Trending Posts
  • Pakistan 5G Economy
    PTCL Names Mohammad Nadeem Khan Interim CEO For Two Weeks
    • July 2, 2026
  • Pakistan Expands EV Infrastructure With Ultra-Fast Charging Stations On Major Motorway
    • July 2, 2026
  • PTA Announces Temporary DIRBS Shutdown For System Upgrade
    • July 2, 2026
  • Ignite Celebrates Graduation Of 30 Engineers From IC Design And Verification Programme
    • July 2, 2026
  • KP Introduces 24/7 Integrated Emergency Helpline For Faster Public Response
    • July 2, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.