Tag
A benchmark comparison of local open-weight LLMs on a single H100 (FP8) shows DiffusionGemma is 4x faster but makes 6x more mistakes than Gemma4 26B A4B, highlighting trade-offs between speed and accuracy in diffusion versus autoregressive models.