NPG-based actor-critic with Lagrangian for model-free chance-constrained LQG, claiming linear convergence, critic convergence via TD(0), and no duality gap.
Learning control barrier functions and their application in reinforcement learning: A survey,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Robust Koopman-CBF SAC learns Koopman predictors from data, tightens lifted CBF constraints with a data-estimated residual margin, and applies a QP safety filter inside SAC, reporting zero constraint violations on CartPole while matching unconstrained returns.
citing papers explorer
-
Robust Koopman Control Barrier Filters for Safe Actor-Critic Reinforcement Learning
Robust Koopman-CBF SAC learns Koopman predictors from data, tightens lifted CBF constraints with a data-estimated residual margin, and applies a QP safety filter inside SAC, reporting zero constraint violations on CartPole while matching unconstrained returns.