Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming

Guanghui Lan; Saeed Ghadimi

arxiv: 1309.5549 · v1 · pith:UB3MTVNLnew · submitted 2013-09-22 · 🧮 math.OC · cs.CC· stat.ML

Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming

Saeed Ghadimi , Guanghui Lan This is my paper

classification 🧮 math.OC cs.CCstat.ML

keywords stochasticmethodalgorithmprogrammingclassmethodsnonconvexnonlinear

0 comments

read the original abstract

In this paper, we introduce a new stochastic approximation (SA) type algorithm, namely the randomized stochastic gradient (RSG) method, for solving an important class of nonlinear (possibly nonconvex) stochastic programming (SP) problems. We establish the complexity of this method for computing an approximate stationary point of a nonlinear programming problem. We also show that this method possesses a nearly optimal rate of convergence if the problem is convex. We discuss a variant of the algorithm which consists of applying a post-optimization phase to evaluate a short list of solutions generated by several independent runs of the RSG method, and show that such modification allows to improve significantly the large-deviation properties of the algorithm. These methods are then specialized for solving a class of simulation-based optimization problems in which only stochastic zeroth-order information is available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Parameter Aggregation: Semantic Consensus for Federated Fine-Tuning of LLMs
cs.LG 2026-05 unverdicted novelty 7.0

Semantic consensus on model outputs for public prompts enables federated LLM fine-tuning that matches parameter-aggregation baselines with orders-of-magnitude lower communication.