attention-state-adaptive-generation

Tag

Cards List
#attention-state-adaptive-generation

@sheriyuo: This paper proposes ASAG, Attention-State Adaptive Generation, a training-free, plug-and-play stopping framework for re…

X AI KOLs Timeline · yesterday Cached

ASAG uses attention entropy to detect when reasoning is unproductive, stopping early to improve accuracy and reduce token generation. Experiments on Qwen3-8B show a 4.4% accuracy gain and over 40% fewer generated tokens.

0 favorites 0 likes
← Back to home

Submit Feedback