A Review of Safe Reinforcement Learning: Methods, Theories, and Applications

被引:4
|
作者
Gu, Shangding [1 ]
Yang, Long [3 ]
Du, Yali [4 ]
Chen, Guang [5 ]
Walter, Florian [2 ]
Wang, Jun [6 ]
Knoll, Alois [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Tech Univ Munich, Dept Informat, D-85748 Munich, Germany
[3] Peking Univ, Inst AI, Beijing 100871, Peoples R China
[4] Kings Coll London, Dept Informat, London WC1E 6EB, England
[5] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[6] UCL, Dept Comp Sci, London WC1E 6BT, England
基金
中国国家自然科学基金;
关键词
Safe reinforcement learning (RL); safety optimisation; constrained Markov decision processes; safety problems; MARKOV DECISION-PROCESSES; ACTOR-CRITIC ALGORITHM; APPROXIMATION; MODEL; NETWORKS; POLICIES; CHAINS;
D O I
10.1109/TPAMI.2024.3457538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-making tasks. However, safety concerns are raised during deploying RL in real-world applications, leading to a growing demand for safe RL algorithms, such as in autonomous driving and robotics scenarios. While safe control has a long history, the study of safe RL algorithms is still in the early stages. To establish a good foundation for future safe RL research, in this paper, we provide a review of safe RL from the perspectives of methods, theories, and applications. First, we review the progress of safe RL from five dimensions and come up with five crucial problems for safe RL being deployed in real-world applications, coined as "2H3W". Second, we analyze the algorithm and theory progress from the perspectives of answering the "2H3W" problems. Particularly, the sample complexity of safe RL algorithms is reviewed and discussed, followed by an introduction to the applications and benchmarks of safe RL algorithms. Finally, we open the discussion of the challenging problems in safe RL, hoping to inspire future research on this thread. To advance the study of safe RL algorithms, we release an open-sourced repository containing major safe RL algorithms at the link.
引用
收藏
页码:11216 / 11235
页数:20
相关论文
共 50 条
  • [1] Shielded Reinforcement Learning: A review of reactive methods for safe learning
    Odriozola-Olalde, Haritz
    Zamalloa, Maider
    Arana-Arexolaleiba, Nestor
    2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,
  • [2] Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics
    Mosavi, Amirhosein
    Faghan, Yaser
    Ghamisi, Pedram
    Puhong Duan
    Ardabili, Sina Faizollahzadeh
    Salwana, Ely
    Band, Shahab S.
    MATHEMATICS, 2020, 8 (10)
  • [3] Explainability in Deep Reinforcement Learning: A Review into Current Methods and Applications
    Hickling, Thomas
    Zenati, Abdelhafid
    Aouf, Nabil
    Spencer, Phillippa
    ACM COMPUTING SURVEYS, 2024, 56 (05)
  • [4] Extending the Capabilities of Reinforcement Learning Through Curriculum: A Review of Methods and Applications
    Kashish Gupta
    Debasmita Mukherjee
    Homayoun Najjaran
    SN Computer Science, 2022, 3 (1)
  • [5] Safe reinforcement learning and its applications in robotics: A survey
    Zhang C.-X.
    Zhang X.-L.
    Xu X.
    Lu Y.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (12): : 2090 - 2103
  • [6] Safe Reinforcement Learning via Formal Methods: Toward Safe Control Through Proof and Learning
    Fulton, Nathan
    Platzer, Andre
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6485 - 6492
  • [7] A Review of Reinforcement Learning in Financial Applications
    Bai, Yahui
    Gao, Yuhe
    Wan, Runzhe
    Zhang, Sheng
    Song, Rui
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2025, 12 : 209 - 232
  • [8] A review of the applications and hotspots of reinforcement learning
    Hou, Jun
    Li, Hua
    Hu, Jinwen
    Zhao, Chunhui
    Guo, Yaning
    Li, Sijia
    Pan, Quan
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2017, : 506 - 511
  • [9] Formal Methods Assisted Training of Safe Reinforcement Learning Agents
    Murugesan, Anitha
    Moghadamfalahi, Mohammad
    Chattopadhyay, Arunabh
    NASA FORMAL METHODS (NFM 2019), 2019, 11460 : 333 - 340
  • [10] On Normative Reinforcement Learning via Safe Reinforcement Learning
    Neufeld, Emery A.
    Bartocci, Ezio
    Ciabattoni, Agata
    PRIMA 2022: PRINCIPLES AND PRACTICE OF MULTI-AGENT SYSTEMS, 2023, 13753 : 72 - 89