stochastic-gradient-descent

Tag

Cards List
#stochastic-gradient-descent

A Link between Shock-wave Theory and Symmetry-reduced Stochastic Gradient Descent for Artificial Neural Networks

arXiv cs.LG · 2026-06-18 Cached

This paper establishes a mathematically rigorous connection between shock-wave theory and symmetry-quotiented learning dynamics of stochastic gradient descent, showing that after symmetry reduction and coarse-graining, the dynamics satisfy viscous Hamilton-Jacobi and Burgers-type equations with shock formation times controlled by loss curvature.

0 favorites 0 likes
#stochastic-gradient-descent

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

arXiv cs.LG · 2026-06-08 Cached

This paper analyzes generalization error, uniform stability, and uniform argument stability of gradient descent (GD) and stochastic gradient descent (SGD) over discrete parameter spaces with deterministic or stochastic rounding, showing that rounding degrades generalization for GD and introduces dimension-dependent errors for stochastic rounding.

0 favorites 0 likes
← Back to home

Submit Feedback