Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.
Simple statistical gradient-following algorithms for connectionist reinforcement learning
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
method 1polarities
use method 1representative citing papers
DreamerV3 uses world models and robustness techniques to solve over 150 tasks across domains with a single configuration, including Minecraft diamond collection from scratch.
A framework for baseline-corrected values in EFGs is introduced, with new baselines including a predictive one that is provably optimal for zero-variance estimates under certain sampling schemes.
citing papers explorer
-
Density estimation using Real NVP
Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.
-
Mastering Diverse Domains through World Models
DreamerV3 uses world models and robustness techniques to solve over 150 tasks across domains with a single configuration, including Minecraft diamond collection from scratch.
-
Low-Variance and Zero-Variance Baselines for Extensive-Form Games
A framework for baseline-corrected values in EFGs is introduced, with new baselines including a predictive one that is provably optimal for zero-variance estimates under certain sampling schemes.