Constrained reinforcement learning with statewise projection: a control barrier function approach

被引:0
|
作者
Xinze JIN
Kuo LI
Qingshan JIA
机构
[1] Tsinghua University
[2] Department of Automation
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Safety is a critical issue for reinforcement learning(RL), as it may be risky for some actual applications if the learning process involves unsafe exploration. Instead of formulating constraints as expectationbased in constrained RL, considering statewise safety in constrained RL is more meaningful. This work aims to address the issue of safe projection in RL by introducing a control barrier function that inherently learns a safe policy through a set certificate. We seek to analyze some theoretical properties of safe projection in the learning process, including convergence and performance bound, and extend the discussion into ensembles and guided controllers. Moreover, we approach analytical solutions for deterministic and stochastic system dynamics. Experimental results in different tasks show that the proposed method achieves better effects in terms of both performance and safety.
引用
收藏
页码:136 / 154
页数:19
相关论文
共 50 条
  • [1] Constrained reinforcement learning with statewise projection: a control barrier function approach
    Jin, Xinze
    Li, Kuo
    Jia, Qingshan
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)
  • [2] Safe reinforcement learning: A control barrier function optimization approach
    Marvi, Zahra
    Kiumarsi, Bahare
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (06) : 1923 - 1940
  • [3] Model-based Constrained Reinforcement Learning using Generalized Control Barrier Function
    Ma, Haitong
    Chen, Jianyu
    Eben, Shengbo
    Lin, Ziyu
    Guan, Yang
    Ren, Yangang
    Zheng, Sifa
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4552 - 4559
  • [4] Research on Safe Reinforcement Controller Using Deep Reinforcement Learning with Control Barrier Function
    Ryu Y.-H.
    Oualid D.
    Lee D.-J.
    Journal of Institute of Control, Robotics and Systems, 2022, 28 (11) : 1013 - 1021
  • [5] Constrained Optimal Fuel Consumption of HEV: A Constrained Reinforcement Learning Approach
    Yan, Shuchang
    Zhu, Futang
    Wu, Jinlong
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2025, 11 (01): : 2660 - 2673
  • [6] Reinforcement Learning for Continuous Control: A Quantum Normalized Advantage Function Approach
    Liu, Yaofu
    Xu, Chang
    Jin, Siyuan
    2023 IEEE INTERNATIONAL CONFERENCE ON QUANTUM SOFTWARE, QSW, 2023, : 83 - 87
  • [7] Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems
    Vu, Thanh Long
    Mukherjee, Sayak
    Huang, Renke
    Huang, Qiuhua
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 3652 - 3657
  • [8] Safe Reinforcement Learning for LiDAR-based Navigation via Control Barrier Function
    Song, Lixing
    Ferderer, Luke
    Wu, Shaoen
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 264 - 269
  • [9] A Novel Online Safe Reinforcement Learning with Control Barrier Function Technique for Autonomous vehicles
    Jabbari, Fatemeh
    Samsami, Reza
    Arefi, Mohammad Mehdi
    2024 10th International Conference on Control, Instrumentation and Automation, ICCIA 2024, 2024,
  • [10] Barrier Function-based Safe Reinforcement Learning for Formation Control of Mobile Robots
    Zhang, Xinglong
    Peng, Yaoqian
    Pan, Wei
    Xu, Xin
    Xie, Haibin
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5532 - 5538