Tuesday, December 16, 2025
25.4 F
New York

GSI’s bold new technology merges compute and memory

GSI Gemini-I APU
(Image credit: TechPowerUp)

  • GSI Gemini-I APU reduces constant data shuffling between the processor and memory systems
  • Completes retrieval tasks up to 80% faster than comparable CPUs
  • GSI Gemini-II APU will deliver ten times higher throughput

GSI Technology is promoting a new approach to artificial intelligence processing that places computation directly within memory.

A new study by Cornell University draws attention to this design, known as the associative processing unit (APU).

It aims to overcome long-standing performance and efficiency limits, suggesting it could challenge the dominance of the best GPUs currently used in AI tools and data centers.

A new contender in AI hardware

Published in the ACM journal and presented at the recent Micro ’25 conference, the Cornell research evaluated GSI’s Gemini-I APU against leading CPUs and GPUs, including Nvidia’s A6000, using retrieval-augmented generation (RAG) workloads.

The tests spanned datasets from 10 to 200GB, representing realistic AI inference conditions.

By performing computation within static RAM, the APU reduces the constant data shuffling between the processor and memory.

This is a key source of energy loss and latency in conventional GPU architectures.

Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!

The results showed the APU could achieve GPU-class throughput while consuming far less power.

GSI reported its APU used up to 98% less energy than a standard GPU and completed retrieval tasks up to 80% faster than comparable CPUs.

Such efficiency could make it appealing for edge devices such as drones, IoT systems, and robotics, as well as for defense and aerospace use, where energy and cooling limits are strict.

Despite these findings, it remains unclear whether compute-in-memory technology can scale to the same level of maturity and support enjoyed by the best GPU platforms.

GPUs currently benefit from well-developed software ecosystems that allow seamless integration with major AI tools.

For compute-in-memory devices, optimization and programming remain emerging areas that could slow broader adoption, especially in large data center operations.

GSI Technology says it is continuing to refine its hardware, with the Gemini-II generation expected to deliver ten times higher throughput and lower latency.

Another design, named Plato, is in development to further extend compute performance for embedded edge systems.

“Cornell’s independent validation confirms what we’ve long believed, compute-in-memory has the potential to disrupt the $100 billion AI inference market,” said Lee-Lean Shu, Chairman and Chief Executive Officer of GSI Technology.

“The APU delivers GPU-class performance at a fraction of the energy cost, thanks to its highly efficient memory-centric architecture. Our recently released second-generation APU silicon, Gemini-II, can deliver roughly 10x faster throughput and even lower latency for memory-intensive AI workloads.”

Via TechPowerUp


Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

Efosa has been writing about technology for over 7 years, initially driven by curiosity but now fueled by a strong passion for the field. He holds both a Master’s and a PhD in sciences, which provided him with a solid foundation in analytical thinking.

Hot this week

Stop avoiding your bank balance and other ways to manage your money better

BBC We've all looked at our bank account and wondered...

Railways: Firms develop new tech to electrify trains

'This is the big one' - tech firms bet...

UK targets 420m at sky high industry energy bills

£420m bill cut for heavy industry as union attacks...

Apple claims ‘tremendous’ global uptake of latest iPhones

Danielle KayeBusiness reporter Reuters Apple boss Tim Cook holds an iPhone...

Trump hails ‘amazing’ meeting with Xi in South Korea

Trump hails 'amazing' meeting with China's Xi but no...

Topics

Stop avoiding your bank balance and other ways to manage your money better

BBC We've all looked at our bank account and wondered...

Railways: Firms develop new tech to electrify trains

'This is the big one' - tech firms bet...

UK targets 420m at sky high industry energy bills

£420m bill cut for heavy industry as union attacks...

Apple claims ‘tremendous’ global uptake of latest iPhones

Danielle KayeBusiness reporter Reuters Apple boss Tim Cook holds an iPhone...

Trump hails ‘amazing’ meeting with Xi in South Korea

Trump hails 'amazing' meeting with China's Xi but no...

Ofcom slams O2 over unexpected mobile phone contract price rise

Imran Rahman-JonesTechnology reporter The UK's media regulator has criticised O2...

Virgin cleared to challenge Eurostar on Channel Tunnel route

Charlotte EdwardsBusiness reporter Virgin Trains has moved closer to being...

US and China’s different reports of their trade meeting

Skip to content British Broadcasting Corporation Home News Sport Business Innovation Culture Arts Travel Earth Audio Video Live More on this story. 23 hours...

Related Articles

Popular Categories