Taming VAEs

Danilo Jimenez Rezende; Fabio Viola

arxiv: 1810.00597 · v1 · pith:4GB5JKBInew · submitted 2018-10-01 · 📊 stat.ML · cs.LG

Taming VAEs

Danilo Jimenez Rezende , Fabio Viola This is my paper

classification 📊 stat.ML cs.LG

keywords constraintsgecomodelvaescombinationconstraineddesiredoften

0 comments

read the original abstract

In spite of remarkable progress in deep latent variable generative modeling, training still remains a challenge due to a combination of optimization and generalization issues. In practice, a combination of heuristic algorithms (such as hand-crafted annealing of KL-terms) is often used in order to achieve the desired results, but such solutions are not robust to changes in model architecture or dataset. The best settings can often vary dramatically from one problem to another, which requires doing expensive parameter sweeps for each new case. Here we develop on the idea of training VAEs with additional constraints as a way to control their behaviour. We first present a detailed theoretical analysis of constrained VAEs, expanding our understanding of how these models work. We then introduce and analyze a practical algorithm termed Generalized ELBO with Constrained Optimization, GECO. The main advantage of GECO for the machine learning practitioner is a more intuitive, yet principled, process of tuning the loss. This involves defining of a set of constraints, which typically have an explicit relation to the desired model performance, in contrast to tweaking abstract hyper-parameters which implicitly affect the model behavior. Encouraging experimental results in several standard datasets indicate that GECO is a very robust and effective tool to balance reconstruction and compression constraints.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Generative Robust Optimisation
cs.LG 2026-06 unverdicted novelty 7.0

Generative Robust Optimisation defines uncertainty sets via neural network decoders over latent spaces and evaluates them with a five-point framework, validated on planning problems using Wasserstein autoencoders.
Diffusion-Driven State Space Models
stat.ML 2026-06 unverdicted novelty 7.0

DDSSM replaces Gaussian transitions in SSMs with diffusion models to jointly train autoencoders and diffusion on sequential data, outperforming standard deep SSMs on simulated multimodal time series.
Advances in Scientific Machine Learning for Coupled Fluid Flow and Transport
cs.LG 2026-06 unverdicted novelty 4.0

Reviews linear and nonlinear SciML surrogates for coupled fluid flow and transport, with new PINN modeling of turbidity currents and β-VAE mode extraction from Rayleigh-Bénard convection.