Kalman Gradient Descent: Adaptive Variance Reduction in Stochastic Optimization

James Vuckovic

classification 📊 stat.ML cs.LGmath.OC

keywords gradientalgorithmdescentkalmanoptimizationstochasticfilteringvariance

read the original abstract

We introduce Kalman Gradient Descent, a stochastic optimization algorithm that uses Kalman filtering to adaptively reduce gradient variance in stochastic gradient descent by filtering the gradient estimates. We present both a theoretical analysis of convergence in a non-convex setting and experimental results which demonstrate improved performance on a variety of machine learning areas including neural networks and black box variational inference. We also present a distributed version of our algorithm that enables large-dimensional optimization, and we extend our algorithm to SGD with momentum and RMSProp.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

FIBER: A Differentially Private Optimizer with Filter-Aware Innovation Bias Correction
cs.LG 2026-05 unverdicted novelty 7.0

FiBeR adds a closed-form filter-aware correction A(ω)σ_w² to the second-moment term for temporally filtered DP gradients, improving adaptive optimization performance.