4GB "Gemini Nano" model GGUF anyone?

Reddit r/LocalLLaMA 05/08/26, 08:56 AM News

Summary

A user inquires about the specific identity of a ~4GB AI model (likely Gemini Nano) silently downloaded by Chrome for on-device features, and requests a GGUF version for local execution via llama.cpp.

Hi everyone, I saw an article saying Chrome silently downloads a \~4GB AI model (likely "Gemini Nano") to your computer for features like text summarization. Two questions: 1. What is the exact name/version of this model? 2. Is there a **GGUF** file available for download so I can run it locally with llama.cpp? I want to use it locally instead of letting Chrome run it in the background. Thanks!

Original Article

Similar Articles

Jiunsong/supergemma4-26b-uncensored-gguf-v2

Hugging Face Models Trending

SuperGemma4-26B-Uncensored-Fast GGUF v2 is a quantized, locally-runnable variant of Google's Gemma-4-26B model optimized for Apple Silicon, offering faster inference speeds and less-censored chat behavior while maintaining practical performance on general tasks.

guess what? if you are a chrome user, technically you are localllama member!

Reddit r/LocalLLaMA

Google Chrome is silently installing a 4 GB Gemini Nano AI model on user devices without explicit consent or opt-out UI, raising significant privacy, legal, and environmental concerns.

Jiunsong/supergemma4-26b-uncensored-mlx-4bit-v2

Hugging Face Models Trending

SuperGemma4-26B-Uncensored-MLX-4bit-v2 is a fine-tuned and quantized variant of Google's Gemma 4 26B optimized for Apple Silicon, offering improved performance on code, reasoning, and tool-use tasks while maintaining faster inference speeds compared to the stock baseline.

unsloth/gemma-4-26B-A4B-it-GGUF

Hugging Face Models Trending

Unsloth releases GGUF-quantized versions of Google DeepMind's Gemma 4 26B A4B instruction-tuned model, enabling efficient local inference with support for tool-calling and fine-tuning via Unsloth Studio. Gemma 4 is a multimodal MoE model with a 256K context window, supporting text, image, video, and audio inputs.

Chrome’s AI features may be hogging 4GB of your computer storage

Lobsters Hottest

Google Chrome is automatically downloading a 4GB Gemini Nano model weights file to users' devices to power on-device AI features like scam detection and writing assistance, often without clear notification about storage requirements. Users can disable the On-Device AI toggle in Chrome settings to remove the file and prevent re-downloads.

Similar Articles

Jiunsong/supergemma4-26b-uncensored-gguf-v2

guess what? if you are a chrome user, technically you are localllama member!

Jiunsong/supergemma4-26b-uncensored-mlx-4bit-v2

unsloth/gemma-4-26B-A4B-it-GGUF

Chrome’s AI features may be hogging 4GB of your computer storage

Submit Feedback