pith. machine review for the scientific record. sign in

arxiv: 1810.12273 · v1 · submitted 2018-10-29 · 📊 stat.ML · cs.LG· math.OC

Recognition: unknown

Kalman Gradient Descent: Adaptive Variance Reduction in Stochastic Optimization

James Vuckovic

classification 📊 stat.ML cs.LGmath.OC
keywords gradientalgorithmdescentkalmanoptimizationstochasticfilteringvariance
0
0 comments X
read the original abstract

We introduce Kalman Gradient Descent, a stochastic optimization algorithm that uses Kalman filtering to adaptively reduce gradient variance in stochastic gradient descent by filtering the gradient estimates. We present both a theoretical analysis of convergence in a non-convex setting and experimental results which demonstrate improved performance on a variety of machine learning areas including neural networks and black box variational inference. We also present a distributed version of our algorithm that enables large-dimensional optimization, and we extend our algorithm to SGD with momentum and RMSProp.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. FIBER: A Differentially Private Optimizer with Filter-Aware Innovation Bias Correction

    cs.LG 2026-05 unverdicted novelty 7.0

    FiBeR adds a closed-form filter-aware correction A(ω)σ_w² to the second-moment term for temporally filtered DP gradients, improving adaptive optimization performance.