Welcome to the Gemmaverse

Explore Gemma models crafted by the community

Case studies

Adaptive ML
Adaptive ML aids SK Telecom in creating a version of Gemma that can moderate customer support at a fraction of the size, latency, and cost.
Read more
Quarks
With the help of Gemma 2 and Gemma 3, Quarks delivers safer and more engaging experiences on their dating and connection apps.
Read more
Sarvam AI
Trained with Gemma 3, Sarvam AI’s translation model can translate long-form and structured content effortlessly across India’s diverse range of languages.
Read more
GAIA
GAIA improves Gemma 3 performance in Brazilian Portuguese, contributing to a more inclusive digital ecosystem.
Read more
Roboflow
Roboflow aims to improve computer vision for consumer and enterprise use with its AI workflows, fully integrated with PaliGemma.
Read more
Gemma-2-Llama Swallow
A powerful Japanese-focused LLM with Gemma 2.
Read more
NEXA AI: OmniAudio
An audio-language model for edge applications.
Read more
AI Singapore: SEA-LION
Making AI more inclusive for Southeast Asian languages with Gemma 2.
Read more
Insait
INSAIT creates leading Bulgarian-first LLM with Gemma 2.
Read more

Explore Gemma variants made by the community

View more on Hugging Face

princeton-nlp/gemma-2-9b-it-SimPO

A Gemma 2 9B derivative optimized for preference learning via SimPO

unsloth/gemma-2-9b-it-bnb-4bit

Unsloth supercharges Gemma 2 with 4-bit quantization, offering faster finetuning and reduced memory usage

UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3

A Gemma 2 model developed using Self-Play Preference Optimization

AIDC-AI/Ovis1.6-Gemma2-9B

A 10.2B parameter multimodal large language model for image and text processing and generation

rtzr/ko-gemma-2-9b-it

A Korean-language Gemma 2 model fine-tuned for conversation and human feedback

lemon07r/Gemma-2-Ataraxy-9B

Top-ranked creative writing Gemma 2 merge, combining SimPO and Gutenberg finetunes

Join our Gemma community

Connect, explore, and share your knowledge with others in the ML model community

Chat, share ideas, and make instant connections on our Google Developer Discord #gemma channel

Chat now

Access the model card, explore interactive notebooks, and join community discussions

Join Kaggle

The challenge

It’s often difficult for LLMs to maintain compliance with a business’s unique content policies. For businesses in the online customer service industry, unique content policies curate a safe and respectful environment for customers and employees alike. Models are used to identify and respond to harmful written content across customer service chats and emails, with every company having their own criteria and policies regarding such language. It’s harder to teach those nuances to off-the-shelf proprietary models, so they often miss content that should be marked as adult, harmful, biased, or illegal.

That challenge is compounded when businesses like SK Telecom require a multilingual solution, because many of the leading open and proprietary LLMs focus on western languages like English and don’t perform as well in eastern languages like Korean.

While some larger models might meet the bar for identifying harmful content, their parameter size leads to high inference costs and too much latency. This led the Adaptive ML team to create Adaptive Engine, which offers an enhanced level of control and flexibility when training smaller scale models, while also lowering infrastructure costs and latency.

The solution

Adaptive ML selected Gemma 3 4B for its size and performance, along with multiple other open models to fine-tune for SK Telecom with Adaptive Engine. The models were first supervised fine-tuned (SFT) on 8K Korean samples, followed by further training with proximal policy optimization (PPO). A similarly-sized dataset for harmful content detection in English was used to train and assess the models’ ability to identify toxicity across multiple languages.

Working with Gemma 3 was a positive experience for the team. “We downloaded the model directly from Hugging Face and it was easy to convert to Adaptive ML’s internal format thanks to Gemma’s detailed documentation,” said Alessandro Cappelli, Co-Founder and Research Scientist at Adaptive ML. “The game-changer for our use case was the model's strong multilingual capabilities and long context windows.”

The impact

After training with Adaptive Engine, Adaptive ML evaluated the models using content moderation tests conducted in both Korean and English. And thanks to the SFT and PPO training, Gemma 3 4B now performs in Korean as well as it does in English, delivering the multilingual performance required to meet SK Telecom’s customer service needs.

What’s next

When reflecting on his experience with Gemma, Cappell feels the team spent “too much time debating between the 12B and 27B versions, when, in the end, Gemma 4B worked surprisingly well for our use case,” and recommends that other developers “Give small LLMs the opportunity; they’re likely to surprise you.”

Going forward, the team plans to continue to work with Gemma. “These results are very exciting and validate the use of Gemma 4B for more use cases, particularly other customer support workflows,” concludes Cappelli.

Gemini

Gemma

Generative models

Experiments

Projects

Publications

News

AI for biology

AI for climate and sustainability

AI for mathematics and computer science

AI for physics and chemistry

AI transparency

News

Careers

Milestones

Education

Responsibility

The Podcast

Welcome to the Gemmaverse

Case studies

Explore Gemma variants made by the community

Join our Gemma community

Adaptive ML trains Gemma 3 for exceptional multilingual results

Welcome to the Gemmaverse

Case studies

Explore Gemma variants made by the community

Join our Gemma community

Adaptive ML trains Gemma 3 for exceptional multilingual results

The challenge

The solution

The impact

What’s next