CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • TechAdvisor

Google EmbeddingGemma Leads Small Parameter Embedding Models With On-Device AI

  • September 6, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Google has expanded its Gemma family of models with the launch of EmbeddingGemma, an open-source embedding model designed for on-device use across smartphones, laptops, and desktops. Based on the Gemma 3 architecture, EmbeddingGemma is a 308 million parameter model trained on more than 100 languages and tailored to deliver efficient, private, and high-quality embeddings. According to Google DeepMind’s Min Choi, product manager, and Sahil Dua, lead research engineer, the model is built to integrate seamlessly with widely used tools such as Ollama, llama.cpp, MLX, LiteRT, LMStudio, LangChain, LlamaIndex, and Cloudflare, making it highly adaptable for developers seeking to deploy AI applications locally.

EmbeddingGemma has demonstrated strong results on the Massive Text Embedding Benchmark (MTEB) multilingual v2, where it ranked as the top-performing model under 500 million parameters. This performance underscores Google’s focus on delivering models that can run natively on personal hardware without requiring cloud dependency. The model’s design also supports customizable output dimensions and allows developers to apply it for a range of use cases, including Retrieval Augmented Generation (RAG) and semantic search. These capabilities position EmbeddingGemma as a tool that enables efficient AI-powered applications directly on user devices, ensuring privacy and functionality even in offline environments.

One of the most significant applications of EmbeddingGemma is its role in enabling mobile RAG pipelines. Traditionally, RAG systems rely on cloud or on-premises infrastructure to process embeddings and generate context-aware responses. By shifting this capability to devices like laptops and smartphones, enterprises can empower employees to access and query information directly through their local hardware. This approach allows for faster, more secure interactions with data, while reducing reliance on internet connectivity. Choi and Dua emphasized that the quality of the initial retrieval step is crucial in such pipelines, noting that poor embeddings can lead to irrelevant or inaccurate answers. EmbeddingGemma addresses this challenge with its high-quality representations, which enhance the reliability of on-device RAG systems.

To achieve this flexibility, Google introduced a method called Matryoshka Representation Learning within EmbeddingGemma. This allows developers to choose between different embedding vector sizes depending on their needs. For instance, developers may use the full 768-dimension vector for detailed tasks or opt for smaller dimensions to prioritize speed and efficiency. This adaptability makes the model suitable for diverse scenarios, from advanced enterprise applications to lightweight mobile solutions. The release also reflects growing interest in the embedding model space, where Google faces competition from Cohere’s Embed 4, Mistral’s Codestral Embed, OpenAI’s Text Embedding 3 Large, and Qodo’s Qodo-Embed-1-1.5B.

As interest in running AI applications natively on mobile devices continues to expand, hardware makers like Apple, Samsung, and Qualcomm are also working on ways to support models without compromising device performance or battery life. The arrival of EmbeddingGemma illustrates how embedding models are increasingly becoming a core component of enterprise AI strategies, with developers and organizations showing enthusiasm for integrating them into local workflows. Google’s emphasis on multilingual training, flexibility, and compatibility with popular AI frameworks positions EmbeddingGemma as an important entry in the embedding model market, particularly for developers seeking practical and private on-device solutions.

Source

Follow the SPIN IDG WhatsApp Channel for updates across the Smart Pakistan Insights Network covering all of Pakistan’s technology ecosystem. 

Share
Tweet
Share
Share
Share
Related Topics
  • AI models
  • DeepMind
  • EmbeddingGemma
  • embeddings
  • Gemma 3
  • Google
  • MLX
  • mobile AI
  • Ollama
  • on-device AI
  • RAG
  • semantic search
Previous Article
  • Business

Pakistan’s Services Exports Jump 18.3% In July Driven By IT And Telecom Growth

  • September 6, 2025
Read More
Next Article
  • Cellcos

PTCL Confirms Submarine Cable Damage May Slow Internet Speeds In Pakistan

  • September 6, 2025
Read More
You May Also Like
Read More
  • TechAdvisor

OpenAI Updates ChatGPT’s Default Model To GPT-5.5 Instant With Fewer Hallucinations And New Memory Sources Feature

  • Press Desk
  • May 7, 2026
Read More
  • TechAdvisor

Google Pixel 9 And Pixel 10 Users Report Persistent eSIM Connectivity Issues Requiring Frequent Phone Resets

  • Press Desk
  • May 7, 2026
Read More
  • TechAdvisor

Meta Introduces AI-Powered Age Detection And Visual Analysis To Protect Teenagers Across Its Platforms In Pakistan

  • Press Desk
  • May 7, 2026
Read More
  • TechAdvisor

StormFiber Launches Storm Social+ CDN Bundle Offering Unlimited Speeds On Netflix, YouTube, TikTok And More

  • Press Desk
  • May 6, 2026
Read More
  • TechAdvisor

Apple’s iOS 26.5 To Introduce End-To-End Encryption For RCS Messaging Between iPhone And Android Devices

  • Press Desk
  • May 6, 2026
Read More
  • TechAdvisor

Samsung Galaxy S27 Ultra Rumoured To Feature 200MP Camera With Variable Aperture

  • Press Desk
  • May 5, 2026
Read More
  • TechAdvisor

Ask.com Has Shut Down, Marking The Official Farewell To The Internet’s Favorite Butler

  • Press Desk
  • May 4, 2026
Read More
  • TechAdvisor

Motorola Launches Eight New Phones Including Razr 70 Ultra Foldable And Moto G87 With 200MP Camera

  • Press Desk
  • May 4, 2026
Trending Posts
  • OpenAI Updates ChatGPT’s Default Model To GPT-5.5 Instant With Fewer Hallucinations And New Memory Sources Feature
    • May 7, 2026
  • Nintendo Surprise Reveals Star Fox Remake For Switch 2 With James McCloud Prologue And 4v4 Star Wolf Battle Mode Launching June 25
    • May 7, 2026
  • Google Pixel 9 And Pixel 10 Users Report Persistent eSIM Connectivity Issues Requiring Frequent Phone Resets
    • May 7, 2026
  • Ministry Of IT Pakistan Advertises Director And Joint Director Positions For National AI Advancement Initiative With May 17 Deadline
    • May 7, 2026
  • BankIslami And Aik Partner With Paklaunch At UNConference 26 To Support Pakistan’s Startup And Fintech Ecosystem
    • May 7, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.