Tag
Presents an iterative data generation pipeline to isolate cascading linear features responsible for sycophancy in language models, enabling detection, scoring, and steering with lower computational cost than baselines.