pith. machine review for the scientific record. sign in

arxiv: 1206.7051 · v3 · submitted 2012-06-29 · 📊 stat.ML · cs.AI· stat.CO· stat.ME

Recognition: unknown

Stochastic Variational Inference

Chong Wang, David M. Blei, John Paisley, Matt Hoffman

Authors on Pith no claims yet
classification 📊 stat.ML cs.AIstat.COstat.ME
keywords inferencestochasticvariationalarticlesmodelstopicbayesiandata
0
0 comments X
read the original abstract

We develop stochastic variational inference, a scalable algorithm for approximating posterior distributions. We develop this technique for a large class of probabilistic models and we demonstrate it with two probabilistic topic models, latent Dirichlet allocation and the hierarchical Dirichlet process topic model. Using stochastic variational inference, we analyze several large collections of documents: 300K articles from Nature, 1.8M articles from The New York Times, and 3.8M articles from Wikipedia. Stochastic inference can easily handle data sets of this size and outperforms traditional variational inference, which can only handle a smaller subset. (We also show that the Bayesian nonparametric topic model outperforms its parametric counterpart.) Stochastic variational inference lets us apply complex Bayesian models to massive data sets.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. High-dimensional inference for the $\gamma$-ray sky with differentiable programming

    astro-ph.HE 2026-04 unverdicted novelty 7.0

    A differentiable forward model and likelihood enable probabilistic inference over many spatial morphologies for the Galactic Center gamma-ray Excess using variational methods on GPUs.

  2. Bayesian Modeling and Prediction of Generalized Contact Matrices

    stat.ME 2026-05 unverdicted novelty 6.0

    A Bayesian model for multi-feature contact matrices that uses tensor structures and contingency table theory to satisfy structural constraints and impute missing contact features, validated on simulations and US/Germa...