Google's quantization aware trained Gemma checkpoints enabling mobile device inference just dropped on HF

Reddit r/singularity Models

Summary

Google released quantization-aware trained Gemma 4 checkpoints on HuggingFace, optimized for mobile device inference and available in QAT Mobile and Q4_0 variants.

Release Blog Post: [Gemma 4 with quantization-aware training](https://blog.google/innovation-and-ai/technology/developers-tools/quantization-aware-training-gemma-4/) HuggingFace for mobile: [Gemma 4 QAT Mobile - a google Collection](https://huggingface.co/collections/google/gemma-4-qat-mobile) HuggingFace for Q4\_0: [Gemma 4 QAT Q4\_0 - a google Collection](https://huggingface.co/collections/google/gemma-4-qat-q4-0)
Original Article

Similar Articles

google/gemma-4-12B-it-qat-q4_0-gguf

Hugging Face Models Trending

Google DeepMind releases Gemma 4 models optimized with Quantization-Aware Training (QAT) in multiple formats including GGUF, enabling high quality with reduced memory requirements.