smoothness

#smoothness

Sharp First-Order Lower Bounds for Higher-Order Smooth Nonconvex Optimization

arXiv cs.LG ↗ · 2026-06-05 Cached

This paper proves sharp dimension-free first-order lower bounds for finding epsilon-stationary points in higher-order smooth nonconvex optimization, resolving open problems for Hessian-Lipschitz and third-order smooth cases.

0 favorites 0 likes

#smoothness

Convergence of Steepest Descent and Adam under Non-Uniform Smoothness

arXiv cs.LG ↗ · 2026-06-01 Cached

This paper generalizes non-uniform smoothness assumptions to objectives whose curvature is affine in the objective value, proving convergence rates for steepest descent and diagonal variants of RMSProp and Adam, with applications to logistic regression and neural networks.

0 favorites 0 likes

#smoothness

The quadratic sandwich

Hacker News Top ↗ · 2026-05-20 Cached

An article explaining the concepts of strong convexity and L-smoothness in optimization, known as the quadratic sandwich, and their role in gradient descent performance.

0 favorites 0 likes

#smoothness

Fitting Is Not Enough: Smoothness in Extremely Quantized LLMs

arXiv cs.CL ↗ · 2026-05-12 Cached

This paper investigates smoothness degradation in extremely quantized Large Language Models, arguing that preserving smoothness is crucial for maintaining performance beyond numerical accuracy.

0 favorites 0 likes

smoothness

Sharp First-Order Lower Bounds for Higher-Order Smooth Nonconvex Optimization

Convergence of Steepest Descent and Adam under Non-Uniform Smoothness

The quadratic sandwich

Fitting Is Not Enough: Smoothness in Extremely Quantized LLMs

Submit Feedback