Tag
CopT introduces a contrastive on-policy thinking framework for LLMs that generates draft answers first, then uses contrastive verification and dynamic thinking to improve accuracy while reducing token consumption, achieving up to 23% higher accuracy and 57% lower token usage on math, coding, and agentic reasoning tasks.
ELF proposes a continuous diffusion model for language that uses embedding space and flow matching, outperforming existing discrete and continuous diffusion language models with fewer sampling steps.