norm-decomposition

#norm-decomposition

A Geometric Account of Activation Steering through Angle-Norm Decomposition

arXiv cs.AI ↗ · 4d ago Cached

This paper analyzes linear activation steering in language models by decomposing interventions into angular and radial components. It finds that concepts are primarily encoded in angular structure, but norm adjustments are crucial for stability, supporting spherical steering methods while showing that additive coefficients conflate geometry.

0 favorites 0 likes

norm-decomposition

A Geometric Account of Activation Steering through Angle-Norm Decomposition

Submit Feedback