early-decoding

#early-decoding

When is Your LLM Steerable?

Hugging Face Daily Papers ↗ · 2026-06-10 Cached

This paper introduces a method to predict activation steering effectiveness in language models from early decoding states using a Gradient Boosting Decision Trees (GBDT) classifier, enabling efficient steering strength optimization without full rollouts.

0 favorites 0 likes

#early-decoding

Disentangling Mathematical Reasoning in LLMs: A Methodological Investigation of Internal Mechanisms

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper investigates how large language models perform arithmetic operations by analyzing internal mechanisms through early decoding, revealing that proficient models exhibit a clear division of labor between attention and MLP modules in reasoning tasks.

0 favorites 0 likes

early-decoding

When is Your LLM Steerable?

Disentangling Mathematical Reasoning in LLMs: A Methodological Investigation of Internal Mechanisms

Submit Feedback