pith. sign in

Simple statistical gradient-following algorithms for connectionist reinforcement learning

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

method 1

citation-polarity summary

roles

method 1

polarities

use method 1

representative citing papers

Density estimation using Real NVP

cs.LG · 2016-05-27 · accept · novelty 8.0

Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.

Mastering Diverse Domains through World Models

cs.AI · 2023-01-10 · unverdicted · novelty 7.0

DreamerV3 uses world models and robustness techniques to solve over 150 tasks across domains with a single configuration, including Minecraft diamond collection from scratch.

citing papers explorer

Showing 3 of 3 citing papers.

  • Density estimation using Real NVP cs.LG · 2016-05-27 · accept · none · ref 68

    Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.

  • Mastering Diverse Domains through World Models cs.AI · 2023-01-10 · unverdicted · none · ref 31

    DreamerV3 uses world models and robustness techniques to solve over 150 tasks across domains with a single configuration, including Minecraft diamond collection from scratch.

  • Low-Variance and Zero-Variance Baselines for Extensive-Form Games cs.GT · 2019-07-22 · unverdicted · none · ref 29

    A framework for baseline-corrected values in EFGs is introduced, with new baselines including a predictive one that is provably optimal for zero-variance estimates under certain sampling schemes.