diffusiongemma

#diffusiongemma

DifussionGemma 4 on 4x7900xtx

Reddit r/LocalLLaMA ↗ · 6h ago

Reports running DiffusionGemma 26B on four AMD 7900 XTX GPUs using vllm, achieving 100 tps generation with overall 45-60 t/s, sharing performance metrics and setup commands.

0 favorites 0 likes

#diffusiongemma

DiffusionGemma 26B A4B results on my 5090

Reddit r/LocalLLaMA ↗ · 6h ago

This post presents benchmark results and tuning parameters for running DiffusionGemma 26B A4B GGUF models on an RTX 5090 GPU, showing up to 44% speedup via optimized temperature settings and quantization choices.

0 favorites 0 likes

#diffusiongemma

@HuggingPapers: NVIDIA just released an NVFP4-quantized DiffusionGemma on Hugging Face A 26B MoE multimodal model generating text via p…

X AI KOLs Following ↗ · yesterday Cached

NVIDIA released a 26B MoE multimodal model called DiffusionGemma on Hugging Face, using NVFP4 quantization and achieving over 1,100 tokens per second on Hopper hardware.

0 favorites 0 likes

#diffusiongemma

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

NVIDIA Blog ↗ · yesterday Cached

NVIDIA optimizes Google DeepMind's DiffusionGemma, an open model that generates text in parallel 256-token blocks, achieving up to 4x faster performance on local RTX GPUs, DGX Spark, and DGX Station systems.

0 favorites 0 likes

#diffusiongemma

DiffusionGemma: The Developer Guide- Google Developers Blog

Reddit r/LocalLLaMA ↗ · yesterday Cached

DiffusionGemma is a new experimental model from Google DeepMind that uses parallel generation on a 256-token canvas, achieving up to 4x faster token generation on GPUs. This developer guide explains its architecture, bidirectional context, and includes a fine-tuning recipe for solving Sudoku.

0 favorites 0 likes

#diffusiongemma

unsloth/diffusiongemma-26B-A4B-it-GGUF

Hugging Face Models Trending ↗ · yesterday Cached

Unsloth releases GGUF quantizations of Google DeepMind's DiffusionGemma (26B-A4B), a new block-diffusion architecture for faster text generation, ready for llama.cpp.

0 favorites 0 likes

diffusiongemma

DifussionGemma 4 on 4x7900xtx

DiffusionGemma 26B A4B results on my 5090

@HuggingPapers: NVIDIA just released an NVFP4-quantized DiffusionGemma on Hugging Face A 26B MoE multimodal model generating text via p…

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

DiffusionGemma: The Developer Guide- Google Developers Blog

unsloth/diffusiongemma-26B-A4B-it-GGUF

Submit Feedback