pith. sign in

arxiv: 2602.05504 · v3 · pith:UMGMH243new · submitted 2026-02-05 · 🧮 math.OC

Continuized Nesterov Momentum Achieves the O(varepsilon^(-7/4)) Complexity without Additional Mechanisms

classification 🧮 math.OC
keywords mechanismsmomentumcomplexityvarepsiloncontinuizedwithoutaccumulatedachieve
0
0 comments X
read the original abstract

For first-order optimization of non-convex functions with Lipschitz continuous gradient and Hessian, the best known complexity for reaching an $\varepsilon$-approximation of a stationary point is $\mathcal{O}(\varepsilon^{-7/4})$. Existing algorithms achieving this bound are based on momentum, but are always complemented with safeguard mechanisms that erase the accumulated momentum if a certain condition is violated. Whether such resetting-momentum mechanisms are fundamentally necessary has remained an open question. We show that randomizing the parameters enable to achieve this complexity with high probability when using momentum, without any of such mechanisms. From an analysis perspective, we do so leveraging the continuized method, that interprets the algorithm as a realization of a continuous-time stochastic differential equation involving a Poisson process.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.