two-layer-networks

#two-layer-networks

Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking

arXiv cs.LG ↗ · 2026-05-12 Cached

This empirical study validates theoretical findings on feature repulsion and spectral lock-in during the grokking phenomenon in two-layer neural networks, demonstrating how activation functions influence the transition from memorization to generalization.

0 favorites 0 likes

two-layer-networks

Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking

Submit Feedback