entropy-guided-search

#entropy-guided-search

Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding

Hugging Face Daily Papers ↗ · 4d ago Cached

This paper introduces Confident Decoding, a training-free decoding strategy that dynamically selects the most reliable intermediate layer in LLMs using entropy-guided search, mitigating the alignment tax and improving reasoning performance on benchmarks like GPQA-Diamond and Omni-MATH with negligible overhead.

0 favorites 0 likes

entropy-guided-search

Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding

Submit Feedback