Gemma4 12B update

Reddit r/LocalLLaMA 06/03/26, 11:45 PM Models

Summary

Google's Gemma4-12B model weights on HuggingFace have been silently updated; the reason for the update is unclear but may involve fixes.

A couple hours ago, the full content of the Gemma4-12B HuggingFace repos; including models weights, have been "updated". I can't find information about what was the reason behind this update, does anyone know what's up with that? Do we need updated quants to fix some issue? [https://huggingface.co/google/gemma-4-12B-it/commit/66bc78a7534d523aa32004652cb02cc2e6354c62](https://huggingface.co/google/gemma-4-12B-it/commit/66bc78a7534d523aa32004652cb02cc2e6354c62)

Original Article

Similar Articles

More Gemma 4 models incoming

Reddit r/LocalLLaMA

Google announces that more Gemma 4 models are coming, potentially including a 120B parameter model.

google/gemma-4-26B-A4B-it

Hugging Face Models Trending

Google DeepMind releases Gemma 4, a family of open-weight multimodal models ranging from 2.3B to 31B parameters with support for text, image, video, and audio inputs. The models feature 256K context windows, MoE and dense architectures, enhanced reasoning capabilities, and are optimized for deployment across devices from mobile to servers.

google/gemma-4-31B-it-assistant

Hugging Face Models Trending

Google DeepMind releases Gemma 4, a family of open-weights multimodal models featuring Multi-Token Prediction (MTP) for up to 2x decoding speedups, supporting text, image, video, and audio with enhanced reasoning and coding capabilities.

@lmstudio: Gemma 4 12B is here! Dense, mid-sized Gemma that fits right on your laptop - released by @google under Apache 2.0 Avail…

X AI KOLs Timeline

Google released Gemma 4 12B, a dense mid-sized model that runs on laptops, under Apache 2.0, now available in LM Studio.

Introducing Gemma 3

Google DeepMind Blog

Google introduces Gemma 3, a collection of lightweight open models (1B, 4B, 12B, 27B) designed to run on single GPUs or TPUs, featuring support for 140+ languages, 128k context window, and multimodal capabilities. The models outperform larger competitors like Llama 3 and DeepSeek-V3 while maintaining efficiency for on-device deployment.