reasoning-tokens

Tag

Cards List
#reasoning-tokens

ConFu: Contemplate the Future for Better Speculative Sampling

arXiv cs.CL · 2026-04-20 Cached

ConFu introduces a novel speculative decoding framework that enables draft models to anticipate future generation directions through contemplate tokens and soft prompts, achieving 8-20% improvements in token acceptance rates and generation speed over EAGLE-3 across multiple LLM models.

0 favorites 0 likes
← Back to home

Submit Feedback