pith. machine review for the scientific record. sign in

arxiv: 1703.07370 · v4 · submitted 2017-03-21 · 💻 cs.LG · stat.ML

Recognition: unknown

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Authors on Pith no claims yet
classification 💻 cs.LG stat.ML
keywords gradientdiscreteestimateslow-variancerelaxationvarianceapproachescontinuous
0
0 comments X
read the original abstract

Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator. Recent work (Jang et al. 2016, Maddison et al. 2016) has taken a different approach, introducing a continuous relaxation of discrete variables to produce low-variance, but biased, gradient estimates. In this work, we combine the two approaches through a novel control variate that produces low-variance, \emph{unbiased} gradient estimates. Then, we introduce a modification to the continuous relaxation and show that the tightness of the relaxation can be adapted online, removing it as a hyperparameter. We show state-of-the-art variance reduction on several benchmark generative modeling tasks, generally leading to faster convergence to a better final log-likelihood.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The Theorems of Dr. David Blackwell and Their Contributions to Artificial Intelligence

    cs.GL 2026-04 unverdicted novelty 2.0

    Blackwell's Rao-Blackwell, Approachability, and Informativeness theorems provide frameworks for variance reduction, sequential decisions under uncertainty, and comparing information sources that remain relevant to AI.