[DeepMind] Introducing Gemma 3 270M: The Compact Model fo...

The past few months have been an exciting time for the Gemma family of open models. We introduced Gemma 3 and Gemma 3 QAT, delivering state-of-the-art performance for single cloud and desktop accelerators. Then, we announced the full release of Gemma 3n, a mobile-first architecture bringing powerful, real-time multimodal AI directly to edge devices.

Today, we’re adding a new, highly specialized tool to the Gemma 3 toolkit: Gemma 3 270M, a compact model with 270 million parameters designed for task-specific fine-tuning, featuring strong instruction-following and text structuring capabilities.

Core Capabilities of Gemma 3 270M

Compact and Capable Architecture: The model has a total of 270 million parameters, with 170 million embedding parameters due to a large vocabulary size and 100 million for transformer blocks. The large vocabulary of 256k tokens allows it to handle specific and rare tokens, making it a strong base model for further fine-tuning.
Extreme Energy Efficiency: One key advantage is its low power consumption. Internal tests on a Pixel 9 Pro SoC show that the INT4-quantized model used only 0.75% of the battery for 25 conversations.
Instruction Following: An instruction-tuned model is released alongside a pre-trained checkpoint, capable of following general instructions right out of the box.
Production-Ready Quantization: Quantization-Aware Trained (QAT) checkpoints are available, enabling the model to run at INT4 precision with minimal performance degradation.

When to Choose Gemma 3 270M

You have a high-volume, well-defined task.
You need to iterate and deploy quickly.
You need to ensure user privacy.
You want a fleet of specialized task models.

We aim to make it easy to turn Gemma 3 270M into your custom solution. Built on the same architecture as other Gemma 3 models, it includes guides for quick setup. Downloading, trying, and fine-tuning the model is straightforward.

With the release of Gemma 3 270M, we’re empowering developers to build smarter, faster, and more efficient AI solutions. We can’t wait to see the specialized models you create!

Blogger's Review: The release of Gemma 3 270M signifies an ideal balance between compact, specialized AI models and high efficiency, providing developers with a potent tool, especially in resource-constrained environments. Its robust instruction-following capabilities and flexible fine-tuning options will undoubtedly drive the realization of more innovative applications.

[DeepMind] Introducing Gemma 3 270M: The Compact Model for Hyper-Efficient AI

Core Capabilities of Gemma 3 270M

When to Choose Gemma 3 270M