arxiv:2509.20354

EmbeddingGemma: Powerful and Lightweight Text Representations

Published on Sep 24

· Submitted by

taesiri on Sep 25

#3 Paper of the day

Upvote

Authors:

Henrique Schechter Vera ,

Sahil Dua ,

Daniel Salz ,

Ryan Mullins ,

Sindhu Raghuram Panyam ,

Sara Smoot ,

Joe Zou ,

Alice Lisak ,

Min Choi ,

Lucas Gonzalez ,

Omar Sanseviero ,

Kaifeng Chen ,

Weiyi Wang ,

Gus Martins

Abstract

EmbeddingGemma, a lightweight text embedding model based on Gemma 3, achieves state-of-the-art performance with fewer parameters through encoder-decoder initialization, geometric embedding distillation, and spread-out regularization.

AI-generated summary

We introduce EmbeddingGemma, a new lightweight, open text embedding model based on the Gemma 3 language model family. Our innovative training recipe strategically captures knowledge from larger models via encoder-decoder initialization and geometric embedding distillation. We improve model robustness and expressiveness with a spread-out regularizer, and ensure generalizability by merging checkpoints from varied, optimized mixtures. Evaluated on the Massive Text Embedding Benchmark (MTEB) across multilingual, English, and code domains, EmbeddingGemma (300M) achieves state-of-the-art results. Notably, it outperforms prior top models, both proprietary and open, with fewer than 500M parameters, and provides performance comparable to models double its size, offering an exceptional performance-to-cost ratio. Remarkably, this lead persists when quantizing model weights or truncating embedding outputs. This makes EmbeddingGemma particularly well-suited for low-latency and high-throughput use cases such as on-device applications. We provide ablation studies exploring our key design choices. We release EmbeddingGemma to the community to promote further research.