Journal of Machine Learning Research , volume=

An optimal algorithm for bandit, zero-order convex optimization with two-point feedback , author=

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

Unveiling High-Probability Generalization in Decentralized SGD

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

High-probability generalization bounds for D-SGD are derived at the optimal rate O(1/sqrt(mn) log(1/δ)) via pointwise uniform stability across convex and non-convex settings.

Learning Dynamics of Zeroth-Order Optimization: A Kernel Perspective

cs.LG · 2026-05-05 · unverdicted · novelty 6.0

Zeroth-order SGD learning dynamics are governed by a random low-dimensional projection of the empirical NTK whose approximation error scales with model output dimension, not parameter count.

Stability and Generalization for Decentralized Markov SGD

cs.LG · 2026-05-03 · unverdicted · novelty 6.0

Decentralized SGD and SGDA under Markovian sampling admit non-asymptotic generalization bounds that incorporate network topology, Markov mixing rates, and primal-dual dynamics.

citing papers explorer

Showing 3 of 3 citing papers.

Unveiling High-Probability Generalization in Decentralized SGD cs.LG · 2026-05-11 · unverdicted · none · ref 40
High-probability generalization bounds for D-SGD are derived at the optimal rate O(1/sqrt(mn) log(1/δ)) via pointwise uniform stability across convex and non-convex settings.
Learning Dynamics of Zeroth-Order Optimization: A Kernel Perspective cs.LG · 2026-05-05 · unverdicted · none · ref 40
Zeroth-order SGD learning dynamics are governed by a random low-dimensional projection of the empirical NTK whose approximation error scales with model output dimension, not parameter count.
Stability and Generalization for Decentralized Markov SGD cs.LG · 2026-05-03 · unverdicted · none · ref 60
Decentralized SGD and SGDA under Markovian sampling admit non-asymptotic generalization bounds that incorporate network topology, Markov mixing rates, and primal-dual dynamics.

Journal of Machine Learning Research , volume=

fields

years

verdicts

representative citing papers

citing papers explorer