implicit-bias

#implicit-bias

Edge of Stability Selectively Shapes Learning Across the Data Distribution

arXiv cs.LG ↗ · 2026-06-04 Cached

MIT researchers show that the edge of stability (EoS) in neural network training is not merely a global optimization phenomenon but selectively redistributes learning across subsets of the training distribution, amplifying progress on some data groups while suppressing others. They identify two key conditions governing this allocation: gradient alignment with the top Hessian eigenvector and sustained non-vanishing gradient magnitude.

0 favorites 0 likes

#implicit-bias

The Implicit Bias of Depth: From Neural Collapse to Softmax Codes

arXiv cs.LG ↗ · 2026-05-25 Cached

This paper studies how depth alone induces an implicit low-rank bias in deep unconstrained feature models trained without regularization, shifting the optimal solution from neural collapse to softmax codes, and provides the first asymptotic and dynamic characterization of this bias under gradient descent with cross-entropy loss.

0 favorites 0 likes

#implicit-bias

Deep double descent

OpenAI Blog ↗ · 2019-12-05 Cached

OpenAI research reveals the 'double descent' phenomenon where test error exhibits a non-monotonic pattern as both model size and training steps increase, challenging traditional understanding of the bias-variance tradeoff in deep learning.

0 favorites 0 likes

implicit-bias

Edge of Stability Selectively Shapes Learning Across the Data Distribution

The Implicit Bias of Depth: From Neural Collapse to Softmax Codes

Deep double descent

Submit Feedback