diffusion-large-language-models

#diffusion-large-language-models

FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models

arXiv cs.LG ↗ · 2026-06-08 Cached

This paper proposes FAIR-Calib, a two-stage post-training quantization framework for diffusion large language models that addresses the instability of token commitments during iterative refinement. It achieves state-of-the-art results on LLaDA and Dream models under low-bit quantization.

0 favorites 0 likes

#diffusion-large-language-models

dMoE: dLLMs with Learnable Block Experts

Hugging Face Daily Papers ↗ · 2026-05-29 Cached

This paper proposes dMoE, a block-level mixture-of-experts framework for diffusion large language models that aggregates token-level expert distributions into block-level routing, reducing activated experts and memory usage while maintaining performance.

0 favorites 0 likes

diffusion-large-language-models

FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models

dMoE: dLLMs with Learnable Block Experts

Submit Feedback