4090

#4090

DiffusionGemma 26b on a 4090 at up to 475t/s... and some thoughts...

Reddit r/LocalLLaMA ↗ · 6d ago

A user shares their experience running DiffusionGemma 26B on a 4090 GPU via vLLM, achieving up to 475t/s but noting drawbacks like single-user limitation, lower accuracy, and short context, concluding it's not worth using over the regular 26B model.

0 favorites 0 likes

4090

DiffusionGemma 26b on a 4090 at up to 475t/s... and some thoughts...

Submit Feedback