pith. sign in

Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it
abstract

We develop the mathematical foundations of the stochastic modified equations (SME) framework for analyzing the dynamics of stochastic gradient algorithms, where the latter is approximated by a class of stochastic differential equations with small noise parameters. We prove that this approximation can be understood mathematically as an weak approximation, which leads to a number of precise and useful results on the approximations of stochastic gradient descent (SGD), momentum SGD and stochastic Nesterov's accelerated gradient method in the general setting of stochastic objectives. We also demonstrate through explicit calculations that this continuous-time approach can uncover important analytical insights into the stochastic gradient algorithms under consideration that may not be easy to obtain in a purely discrete-time setting.

citation-role summary

background 1

citation-polarity summary

years

2026 3 2025 1

verdicts

UNVERDICTED 4

roles

background 1

polarities

background 1

representative citing papers

Thermodynamic Irreversibility of Training Algorithms

cond-mat.stat-mech · 2026-05-21 · unverdicted · novelty 6.0

Four characterizations of irreversibility in training algorithms are equivalent to leading order in step size and produce an emergent force that breaks reparametrization symmetries while favoring minimum entropy production trajectories.

citing papers explorer

Showing 4 of 4 citing papers.