Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model,

· 2020 · arXiv 1907.00953

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Mastering Atari with Discrete World Models

cs.LG · 2020-10-05 · accept · novelty 7.0

DreamerV2 reaches human-level performance on 55 Atari games by learning behaviors inside a separately trained discrete-latent world model.

Dream to Control: Learning Behaviors by Latent Imagination

cs.LG · 2019-12-03 · accept · novelty 7.0

Dreamer learns to control from images by imagining and optimizing behaviors in a learned latent world model, outperforming prior methods on 20 visual tasks in data efficiency and final performance.

Self-Supervised Multisensory Pretraining for Contact-Rich Robot Reinforcement Learning

cs.RO · 2025-11-18 · unverdicted · novelty 6.0

MSDP pretrains a transformer encoder via masked multisensory reconstruction and feeds the embeddings into an asymmetric actor-critic RL setup, yielding faster learning and high real-robot success rates with only 6,000 interactions.

citing papers explorer

Showing 3 of 3 citing papers.

Mastering Atari with Discrete World Models cs.LG · 2020-10-05 · accept · none · ref 36
DreamerV2 reaches human-level performance on 55 Atari games by learning behaviors inside a separately trained discrete-latent world model.
Dream to Control: Learning Behaviors by Latent Imagination cs.LG · 2019-12-03 · accept · none · ref 32
Dreamer learns to control from images by imagining and optimizing behaviors in a learned latent world model, outperforming prior methods on 20 visual tasks in data efficiency and final performance.
Self-Supervised Multisensory Pretraining for Contact-Rich Robot Reinforcement Learning cs.RO · 2025-11-18 · unverdicted · none · ref 30
MSDP pretrains a transformer encoder via masked multisensory reconstruction and feeds the embeddings into an asymmetric actor-critic RL setup, yielding faster learning and high real-robot success rates with only 6,000 interactions.

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model,

fields

years

verdicts

representative citing papers

citing papers explorer