Safedreamer: Safe reinforcement learn- ing with world models.arXiv preprint arXiv:2307.07176

Safedreamer: Safe reinforcement learning with world models , author= · 2024 · arXiv 2307.07176

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Latent Chain-of-Thought World Modeling for End-to-End Driving

cs.CV · 2025-12-11 · unverdicted · novelty 7.0

LCDrive unifies chain-of-thought reasoning and action selection for end-to-end driving by interleaving action-proposal tokens and latent world-model tokens that predict action outcomes, yielding faster inference and better trajectories than text-based or non-reasoning baselines.

Human Cognition in Machines: A Unified Perspective of World Models

cs.RO · 2026-04-17 · unverdicted · novelty 6.0

The paper introduces a unified framework for world models that fully incorporates all cognitive functions from Cognitive Architecture Theory, highlights under-researched areas in motivation and meta-cognition, and proposes Epistemic World Models as a new category for scientific discovery agents.

Safety, Security, and Cognitive Risks in World Models

cs.CR · 2026-04-01 · unverdicted · novelty 6.0

World models enable efficient AI planning but create risks from adversarial corruption, goal misgeneralization, and human bias, demonstrated via attacks that amplify errors and reduce rewards on models like RSSM and DreamerV3.

Safe and Generalizable Hierarchical Multi-Agent RL via Constraint Manifold Control

cs.AI · 2026-06-22 · unverdicted · novelty 5.0

Proposes hierarchical MARL framework enforcing safety via constraint manifold at low level with theoretical guarantees and stationary dynamics for stable training and generalization.

SHAPO: Sharpness-Aware Policy Optimization for Safe Exploration

cs.LG · 2026-06-08 · unverdicted · novelty 5.0

SHAPO adds a sharpness-aware adjustment to policy optimization that reweights gradients to favor conservative behavior in uncertain areas, yielding better safety-performance tradeoffs on continuous control tasks.

citing papers explorer

Showing 5 of 5 citing papers.

Latent Chain-of-Thought World Modeling for End-to-End Driving cs.CV · 2025-12-11 · unverdicted · none · ref 13
LCDrive unifies chain-of-thought reasoning and action selection for end-to-end driving by interleaving action-proposal tokens and latent world-model tokens that predict action outcomes, yielding faster inference and better trajectories than text-based or non-reasoning baselines.
Human Cognition in Machines: A Unified Perspective of World Models cs.RO · 2026-04-17 · unverdicted · none · ref 69
The paper introduces a unified framework for world models that fully incorporates all cognitive functions from Cognitive Architecture Theory, highlights under-researched areas in motivation and meta-cognition, and proposes Epistemic World Models as a new category for scientific discovery agents.
Safety, Security, and Cognitive Risks in World Models cs.CR · 2026-04-01 · unverdicted · none · ref 30
World models enable efficient AI planning but create risks from adversarial corruption, goal misgeneralization, and human bias, demonstrated via attacks that amplify errors and reduce rewards on models like RSSM and DreamerV3.
Safe and Generalizable Hierarchical Multi-Agent RL via Constraint Manifold Control cs.AI · 2026-06-22 · unverdicted · none · ref 22
Proposes hierarchical MARL framework enforcing safety via constraint manifold at low level with theoretical guarantees and stationary dynamics for stable training and generalization.
SHAPO: Sharpness-Aware Policy Optimization for Safe Exploration cs.LG · 2026-06-08 · unverdicted · none · ref 7
SHAPO adds a sharpness-aware adjustment to policy optimization that reweights gradients to favor conservative behavior in uncertain areas, yielding better safety-performance tradeoffs on continuous control tasks.

Safedreamer: Safe reinforcement learn- ing with world models.arXiv preprint arXiv:2307.07176

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer