confidence-profiling

Tag

Cards List
#confidence-profiling

Fast-dLLM++: Fr\'{e}chet Profile Decoding for Faster Diffusion LLM Inference

arXiv cs.CL · 2026-06-03 Cached

Fast-dLLM++ introduces Fréchet profile decoding for diffusion LLMs, a training-free method that selects parallel commit sets based on heterogeneous confidence profiles, achieving up to 37% higher throughput at comparable accuracy on benchmarks with LLaDA-8B.

0 favorites 0 likes
← Back to home

Submit Feedback