Safe Learning Control with Optimality and Stability Guarantees

Hongwei Zhang; Martin Guay; Shimin Wang; Wei Xiao; Xinyang Wang

arxiv: 2501.15373 · v2 · pith:WU2UXNCCnew · submitted 2025-01-26 · 📡 eess.SY · cs.AI· cs.LG· cs.SY· math.OC· nlin.AO

Safe Learning Control with Optimality and Stability Guarantees

Xinyang Wang , Hongwei Zhang , Shimin Wang , Wei Xiao , Martin Guay This is my paper

classification 📡 eess.SY cs.AIcs.LGcs.SYmath.OCnlin.AO

keywords performancesafetycontrolguaranteeproposedsafebarriercbfs

0 comments

read the original abstract

Merely pursuing performance may adversely affect safety, while a conservative policy for safe exploration will degrade the performance. How to guarantee both safety and performance in learning-based control problems is an interesting yet challenging issue. This paper aims to enhance system performance with a safety guarantee by solving reinforcement learning (RL)-based optimal control problems for nonlinear systems subject to high-relative-degree state constraints and unknown time-varying disturbance/actuator faults. A new type of control barrier functions (CBFs), termed high-order reciprocal-based control barrier function, is proposed to handle high-relative-degree constraints, which extends the design of CBFs to enforce robust safety without knowing the disturbance bound. The concept of gradient similarity is proposed to quantify the relationship between safety and performance. Finally, gradient manipulation and adaptive mechanisms are introduced in the model-based safe RL framework to enhance the performance with a safety guarantee. Two simulation examples illustrate the efficacy of the proposed algorithms.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Synthesizing Safety in Infinite-Horizon Optimal Control for Disturbed High-Relative-Degree Systems via Barrier-Regulating Auxiliary Variables
eess.SY 2026-04 unverdicted novelty 5.0

A framework reformulates safety-constrained infinite-horizon optimal control as an unconstrained problem on an extended state space using barrier-Lyapunov functions, auxiliary variables, adaptive excitation, and onlin...