bregman-divergence

Tag

Cards List
#bregman-divergence

Mirror Descent Beyond Euclidean Stability: An Exponential Separation in Initialization Sensitivity

arXiv cs.LG · 2026-06-11 Cached

This paper reveals that Mirror Descent with non-quadratic regularizers can be exponentially more sensitive to initialization than Gradient Descent, even under well-conditioned settings, which has implications for reproducibility in RL and LLM post-training.

0 favorites 0 likes
← Back to home

Submit Feedback