Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

被引:0
|
作者
Ji, Jiaming [1 ]
Zhang, Borong [1 ]
Zhou, Jiayi [1 ]
Pan, Xuehai [1 ]
Huang, Weidong [1 ]
Sun, Ruiyang [1 ]
Geng, Yiran [1 ]
Zhong, Yifan [1 ,2 ]
Dai, Juntao [1 ]
Yang, Yaodong [1 ]
机构
[1] Peking Univ, Inst AI, Beijing, Peoples R China
[2] Beijing Inst Gen Artificial Intelligence BIGAI, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial intelligence (AI) systems possess significant potential to drive societal progress. However, their deployment often faces obstacles due to substantial safety concerns. Safe reinforcement learning (SafeRL) emerges as a solution to optimize policies while simultaneously adhering to multiple constraints, thereby addressing the challenge of integrating reinforcement learning in safety-critical scenarios. In this paper, we present an environment suite called Safety-Gymnasium, which encompasses safety-critical tasks in both single and multi-agent scenarios, accepting vector and vision-only input. Additionally, we offer a library of algorithms named Safe Policy Optimization (SafePO), comprising 16 state-of-the-art SafeRL algorithms. This comprehensive library can serve as a validation tool for the research community. By introducing this benchmark, we aim to facilitate the evaluation and comparison of safety performance, thus fostering the development of reinforcement learning for safer, more reliable, and responsible real-world applications. The website of this project can be accessed at https://sites.google.com/view/safety-gymnasium.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics
    Yuan, Zhaocong
    Hall, Adam W.
    Zhou, Siqi
    Brunke, Lukas
    Greeff, Melissa
    Panerati, Jacopo
    Schoellig, Angela P.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 11142 - 11149
  • [2] Reducing Safety Interventions in Provably Safe Reinforcement Learning
    Thumm, Jakob
    Pelat, Guillaume
    Althoff, Matthias
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7515 - 7522
  • [3] On Normative Reinforcement Learning via Safe Reinforcement Learning
    Neufeld, Emery A.
    Bartocci, Ezio
    Ciabattoni, Agata
    [J]. PRIMA 2022: PRINCIPLES AND PRACTICE OF MULTI-AGENT SYSTEMS, 2023, 13753 : 72 - 89
  • [4] Safe Reinforcement Learning via a Model-Free Safety Certifier
    Modares, Amir
    Sadati, Nasser
    Esmaeili, Babak
    Yaghmaie, Farnaz Adib
    Modares, Hamidreza
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3302 - 3311
  • [5] COMPOSUITE: A COMPOSITIONAL REINFORCEMENT LEARNING BENCHMARK
    Mendez, Jorge A.
    Hussing, Marcel
    Gummadi, Meghna
    Eaton, Eric
    [J]. CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [6] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [7] Safe Reinforcement Learning: A Survey
    Wang, Xue-Song
    Wang, Rong-Rong
    Cheng, Yu-Hu
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (09): : 1813 - 1835
  • [8] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    [J]. Artificial Life and Robotics, 2019, 24 : 352 - 359
  • [9] Minimizing Safety Interference for Safe and Comfortable Automated Driving with Distributional Reinforcement Learning
    Kamran, Danial
    Engelgeh, Tizian
    Busch, Marvin
    Fischer, Johannes
    Stiller, Christoph
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1236 - 1243
  • [10] Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
    Gu, Shangding
    Sel, Bilgehan
    Ding, Yuhao
    Wang, Lu
    Lin, Qingwei
    Jin, Ming
    Knoll, Alois
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21099 - 21106