@TheAhmadOsman: Great news Google just released the QAT (4bit) of their Gemma 4 model series including the 31B Dense and the 26B MoE An…
Summary
Google released QAT (4-bit) versions of their Gemma 4 model series, including the 31B Dense and 26B MoE models, furthering open-source AI.
View Cached Full Text
Cached at: 06/06/26, 01:22 AM
Great news
Google just released the QAT (4bit) of their Gemma 4 model series including the 31B Dense and the 26B MoE
Another W for Opensource AI this week https://t.co/SRHWknleOP
Similar Articles
@_philschmid: Weights: https://huggingface.co/collections/google/gemma-4-qat-q4-0… Blog: https://blog.google/innovation-and-ai/techno…
Google released Gemma 4 models with quantization-aware training (QAT) at Q4_0 precision on Hugging Face, offering efficient variants from 5B to 33B parameters.
Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency
Google releases Gemma 4 models optimized with Quantization-Aware Training (QAT) to improve efficiency for mobile and laptop deployment, reducing memory footprint to 1GB for the E2B model while preserving quality.
google/gemma-4-12B-it-qat-q4_0-gguf
Google DeepMind releases Gemma 4 models optimized with Quantization-Aware Training (QAT) in multiple formats including GGUF, enabling high quality with reduced memory requirements.
Gemma 4: Byte for byte, the most capable open models
Google DeepMind introduces Gemma 4, its most capable family of open models to date, designed for advanced reasoning and agentic workflows with high intelligence-per-parameter efficiency across multiple sizes.
google/gemma-4-26B-A4B-it
Google DeepMind releases Gemma 4, a family of open-weight multimodal models ranging from 2.3B to 31B parameters with support for text, image, video, and audio inputs. The models feature 256K context windows, MoE and dense architectures, enhanced reasoning capabilities, and are optimized for deployment across devices from mobile to servers.