pith. sign in

arxiv: 2602.10022 · v2 · pith:LHJZ5TETnew · submitted 2026-02-10 · 🧮 math.OC

Acceleration for Polyak-{L}ojasiewicz Functions with a Gradient Aiming Condition

classification 🧮 math.OC
keywords functionsaccelerationdescentgradientmomentumaimingboundscondition
0
0 comments X
read the original abstract

It is known that when minimizing smooth Polyak-{\L}ojasiewicz (PL) functions, momentum algorithms cannot significantly improve the convergence bound of gradient descent, contrasting with the acceleration phenomenon occurring in the strongly convex case. To bridge this gap, the literature has proposed strongly quasar-convex functions as an intermediate non-convex class, for which accelerated bounds have been suggested to persist. We show that this is not true in general: the additional structure of strong quasar-convexity does not suffice to guaranty better worst-case bounds for momentum compared to gradient descent. As an alternative, we study PL functions under an aiming condition that measures how well the descent direction points toward a minimizer. This perspective clarifies the geometric ingredient enabling provable acceleration by momentum when minimizing PL functions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.