Linear convergence rate in convex setup is possible! gradient descent method variants under (l\_0, l\_1) -smoothness

Aleksandr Lobanov, Alexander Gasnikov, Eduard Gorbunov · 2024 · arXiv 2412.17050

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Avoiding Bias in Clipped SGD for Overparameterized Models under Generalized Smoothness

math.OC · 2026-05-14 · unverdicted · novelty 7.0

Clipped and normalized SGD converge without bias in overparameterized interpolating models under (L0,L1)-smoothness, with improved rates and extensions to heavy-tailed noise and weaker smoothness.

citing papers explorer

Showing 1 of 1 citing paper.

Avoiding Bias in Clipped SGD for Overparameterized Models under Generalized Smoothness math.OC · 2026-05-14 · unverdicted · none · ref 41
Clipped and normalized SGD converge without bias in overparameterized interpolating models under (L0,L1)-smoothness, with improved rates and extensions to heavy-tailed noise and weaker smoothness.

Linear convergence rate in convex setup is possible! gradient descent method variants under (l\_0, l\_1) -smoothness

fields

years

verdicts

representative citing papers

citing papers explorer