batchtopk

#batchtopk

Size Doesn't Matter: Cosine-Scored Sparse Autoencoders

arXiv cs.LG ↗ · 2026-06-16 Cached

This paper proposes replacing the inner product scoring in sparse autoencoders with a learned combination of cosine similarity and input magnitude, showing that the resulting features are more interpretable and concept-aligned, with the optimizer consistently preferring cosine over inner product.

0 favorites 0 likes

batchtopk

Size Doesn't Matter: Cosine-Scored Sparse Autoencoders

Submit Feedback