The first Gemma 4 12B finetunes are ready
Summary
The first finetuned variants of the Gemma 4 12B model are now available on Hugging Face, offered in GGUF format by multiple developers.
Similar Articles
G4-Meromero-31B-Uncensored-Heretic Is Out Now, a Finetune of Gemma 4 31B It Designed for Creative Tasks, With Kld of 0.0100 and 15/100 Refusals!
G4-Meromero-31B-Uncensored-Heretic is a finetune of Gemma 4 31B that reduces refusal rate to 15/100 while keeping KL divergence at 0.01, preserving model quality. It is designed for creative tasks and available as GGUF quantizations on Hugging Face.
@LyalinDotCom: If you're waiting Gemma 4 12b through @ollama, its here: gemma4:12b gemma4:12b-it-q4_K_M gemma4:12b-it-q8_0 gemma4:12b-…
Gemma 4 12b models are now available on Ollama, offering various quantized versions for local AI inference.
More Gemma 4 models incoming
Google announces that more Gemma 4 models are coming, potentially including a 120B parameter model.
Gemma 4 26B-A4B GGUF Benchmarks
Unsloth has released KL Divergence benchmarks for Gemma 4 26B-A4B GGUF quantizations, showing Unsloth GGUFs top 21 of 22 sizes on the Pareto frontier. They also introduced a new UD-IQ4_NL_XL quant fitting in 16GB VRAM and updated Q6_K and MLX quants for both Gemma 4 and Qwen3.6.
@_philschmid: We just launched a Gemma 4 12B! Our first mid-sized model with native audio inputs. Gemma 4 12 B is a unified, encoder-…
We just launched Gemma 4 12B, a mid-sized multimodal model with native audio inputs, requiring only 16GB memory and released under Apache 2.0.