A review of safe reinforcement learning: Methods, theory and applications

· 2022 · arXiv 2205.10330

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Data-Driven Synthesis of Probabilistic Controlled Invariant Sets for Linear MDPs

eess.SY · 2026-04-03 · unverdicted · novelty 7.0

Data-driven regularized least squares with self-normalized bounds and lattice abstraction yields certified (N, ε)-PCIS for linear MDPs via conservative backward recursion.

TwinGate: Stateful Defense against Decompositional Jailbreaks in Untraceable Traffic via Asymmetric Contrastive Learning

cs.CR · 2026-04-30 · unverdicted · novelty 6.0

TwinGate deploys a stateful dual-encoder system with asymmetric contrastive learning to detect decompositional jailbreaks in untraceable LLM traffic at high recall and low false-positive rate with negligible latency.

Learning Control Policies to Provably Satisfy Hard Affine Constraints for Black-Box Hybrid Dynamical Systems

cs.RO · 2026-04-24 · unverdicted · novelty 6.0

The authors introduce affine repulsive RL policies that provably satisfy hard affine state constraints for black-box hybrid dynamical systems with affine reset maps by deriving sufficient closed-loop safety conditions and testing on pendulum and juggler examples.

Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI

cs.AI · 2026-04-19 · unverdicted · novelty 5.0

CAMCO enforces policy constraints on multi-agent AI at deployment time via convex projection, risk-weighted Lagrangian shaping, and bounded-convergence negotiation, yielding zero violations and 92-97% utility in tested enterprise scenarios.

citing papers explorer

Showing 4 of 4 citing papers.

Data-Driven Synthesis of Probabilistic Controlled Invariant Sets for Linear MDPs eess.SY · 2026-04-03 · unverdicted · none · ref 3
Data-driven regularized least squares with self-normalized bounds and lattice abstraction yields certified (N, ε)-PCIS for linear MDPs via conservative backward recursion.
TwinGate: Stateful Defense against Decompositional Jailbreaks in Untraceable Traffic via Asymmetric Contrastive Learning cs.CR · 2026-04-30 · unverdicted · none · ref 8
TwinGate deploys a stateful dual-encoder system with asymmetric contrastive learning to detect decompositional jailbreaks in untraceable LLM traffic at high recall and low false-positive rate with negligible latency.
Learning Control Policies to Provably Satisfy Hard Affine Constraints for Black-Box Hybrid Dynamical Systems cs.RO · 2026-04-24 · unverdicted · none · ref 26
The authors introduce affine repulsive RL policies that provably satisfy hard affine state constraints for black-box hybrid dynamical systems with affine reset maps by deriving sufficient closed-loop safety conditions and testing on pendulum and juggler examples.
Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI cs.AI · 2026-04-19 · unverdicted · none · ref 3
CAMCO enforces policy constraints on multi-agent AI at deployment time via convex projection, risk-weighted Lagrangian shaping, and bounded-convergence negotiation, yielding zero violations and 92-97% utility in tested enterprise scenarios.

A review of safe reinforcement learning: Methods, theory and applications

fields

years

verdicts

representative citing papers

citing papers explorer