pith. sign in

arxiv: 2202.12176 · v1 · pith:2KHGD3GVnew · submitted 2022-02-24 · 💻 cs.LG

Clarifying MCMC-based training of modern EBMs : Contrastive Divergence versus Maximum Likelihood

classification 💻 cs.LG
keywords mcmc-basedtheoreticalalgorithmbehindcontrastivedivergenceebmsinterpretation
0
0 comments X
read the original abstract

The Energy-Based Model (EBM) framework is a very general approach to generative modeling that tries to learn and exploit probability distributions only defined though unnormalized scores. It has risen in popularity recently thanks to the impressive results obtained in image generation by parameterizing the distribution with Convolutional Neural Networks (CNN). However, the motivation and theoretical foundations behind modern EBMs are often absent from recent papers and this sometimes results in some confusion. In particular, the theoretical justifications behind the popular MCMC-based learning algorithm Contrastive Divergence (CD) are often glossed over and we find that this leads to theoretical errors in recent influential papers (Du & Mordatch, 2019; Du et al., 2020). After offering a first-principles introduction of MCMC-based training, we argue that the learning algorithm they use can in fact not be described as CD and reinterpret theirs methods in light of a new interpretation. Finally, we discuss the implications of our new interpretation and provide some illustrative experiments.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Contrastive Regularization of Machine Learning Potentials

    physics.chem-ph 2026-06 conditional novelty 7.0

    Contrastive Regularized MSE (CRMSE) corrects distribution-level failures of MSE-trained ML interatomic potentials on ethanol and aspirin from MD17 by penalizing spurious minima sampled from the model's own Langevin chains.