Safe reinforcement learning: A control barrier function optimization approach

被引:123
|
作者
Marvi, Zahra [1 ]
Kiumarsi, Bahare [1 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, 428 S Shaw Lane,Room 2120, E Lansing, MI 48824 USA
关键词
actor; critic; control barrier function; safety; reinforcement learning; QUADRATIC PROGRAMS; SYSTEMS;
D O I
10.1002/rnc.5132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article presents a learning-based barrier certified method to learn safe optimal controllers that guarantee operation of safety-critical systems within their safe regions while providing an optimal performance. The cost function that encodes the designer's objectives is augmented with a control barrier function (CBF) to ensure safety and optimality. A damping coefficient is incorporated into the CBF which specifies the trade-off between safety and optimality. The proposed formulation provides a look-ahead and proactive safety planning and results in a smooth transition of states within the feasible set. That is, instead of applying an optimal controller and intervening with it only if the safety constraints are violated, the safety is planned and optimized along with the performance to minimize the intervention with the optimal controller. It is shown that addition of the CBF into the cost function does not affect the stability and optimality of the designed controller within the safe region. This formulation enables us to find the optimal safe solution iteratively. An off-policy reinforcement learning (RL) algorithm is then employed to find a safe optimal policy without requiring the complete knowledge about the system dynamics, while satisfies the safety constraints. The efficacy of the proposed safe RL control design approach is demonstrated on the lane keeping as an automotive control problem.
引用
收藏
页码:1923 / 1940
页数:18
相关论文
共 50 条
  • [1] Research on Safe Reinforcement Controller Using Deep Reinforcement Learning with Control Barrier Function
    Ryu Y.-H.
    Oualid D.
    Lee D.-J.
    Journal of Institute of Control, Robotics and Systems, 2022, 28 (11) : 1013 - 1021
  • [2] Constrained reinforcement learning with statewise projection: a control barrier function approach
    Jin, Xinze
    Li, Kuo
    Jia, Qingshan
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)
  • [3] Constrained reinforcement learning with statewise projection: a control barrier function approach
    Xinze JIN
    Kuo LI
    Qingshan JIA
    Science China(Information Sciences), 2024, 67 (03) : 136 - 154
  • [4] Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems
    Vu, Thanh Long
    Mukherjee, Sayak
    Huang, Renke
    Huang, Qiuhua
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 3652 - 3657
  • [5] Safe Reinforcement Learning for LiDAR-based Navigation via Control Barrier Function
    Song, Lixing
    Ferderer, Luke
    Wu, Shaoen
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 264 - 269
  • [6] Barrier Function-based Safe Reinforcement Learning for Formation Control of Mobile Robots
    Zhang, Xinglong
    Peng, Yaoqian
    Pan, Wei
    Xu, Xin
    Xie, Haibin
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5532 - 5538
  • [7] A Novel Online Safe Reinforcement Learning with Control Barrier Function Technique for Autonomous vehicles
    Jabbari, Fatemeh
    Samsami, Reza
    Arefi, Mohammad Mehdi
    2024 10th International Conference on Control, Instrumentation and Automation, ICCIA 2024, 2024,
  • [8] Control Lyapunov-barrier function-based safe reinforcement learning for nonlinear optimal control
    Wang, Yujia
    Wu, Zhe
    AICHE JOURNAL, 2024, 70 (03)
  • [9] Safe Reinforcement Learning Using Robust Control Barrier Functions
    Emam, Yousef
    Notomista, Gennaro
    Glotfelter, Paul
    Kira, Zsolt
    Egerstedt, Magnus
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2886 - 2893
  • [10] Safe RAN control: A Symbolic Reinforcement Learning Approach
    Nikou, Alexandros
    Mujumdar, Anusha
    Sundararajan, Vaishnavi
    Orlic, Marin
    Feljan, Aneta Vulgarakis
    2022 IEEE 17TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA, 2022, : 332 - 337