Safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics

被引:0
|
作者
Yuan, Zhaocong [1 ,2 ,3 ]
Hall, Adam W. [1 ,2 ,3 ]
Zhou, Siqi [1 ,2 ,3 ]
Brunke, Lukas [1 ,2 ,3 ]
Greeff, Melissa [1 ,2 ,3 ]
Panerati, Jacopo [1 ,2 ,3 ]
Schoellig, Angela P. [1 ,2 ,3 ]
机构
[1] Univ Toronto, Dynam Syst Lab, Toronto, ON M5S 1A1, Canada
[2] Univ Toronto, Inst Aerosp Studies, Toronto, ON M5S 1A1, Canada
[3] Toronto Univ Toronto, Vector Inst Artificial Intelligence, Toronto, ON M5S 1A1, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
Machine learning for robot control; reinforcement learning; robot safety; software tools for benchmarking and reproducibility; PHYSICS;
D O I
10.1109/LRA.2022.3196132
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In recent years, both reinforcement learning and learning-based control-as well as the study of their safety, which is crucial for deployment in real-world robots-have gained significant traction. However, to adequately gauge the progress and applicability of new results, we need the tools to equitably compare the approaches proposed by the controls and reinforcement learning communities. Here, we propose a new open-source benchmark suite, called safe-control-gym, supporting both model-based and data-based control techniques. We provide implementations for three dynamic systems-the cart-pole, the 1D, and 2D quadrotor-and two control tasks-stabilization and trajectory tracking. We propose to extend OpenAI's Gym API-the de facto standard in reinforcement learning research-with (i) the ability to specify (and query) symbolic dynamics and (ii) constraints, and (iii) (repeatably) inject simulated disturbances in the control inputs, state measurements, and inertial properties. To demonstrate our proposal and in an attempt to bring research communities closer together, we show how to use safe-control-gym to quantitatively compare the control performance, data efficiency, and safety of multiple approaches from the fields of traditional control, learning-based control, and reinforcement learning.
引用
收藏
页码:11142 / 11149
页数:8
相关论文
共 50 条
  • [31] Tracking interval control for urban rail trains based on safe reinforcement learning
    Lin, Junting
    Qiu, Xiaohui
    Li, Maolin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [32] Safe reinforcement learning: A control barrier function optimization approach
    Marvi, Zahra
    Kiumarsi, Bahare
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (06) : 1923 - 1940
  • [33] Safe chance constrained reinforcement learning for batch process control
    Mowbray, M.
    Petsagkourakis, R.
    del Rio-Chanona, E. A.
    Zhang, D.
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2022, 157
  • [34] Safe Reinforcement Learning for Mixed-Autonomy Platoon Control
    Zhou, Jingyuan
    Yan, Longhao
    Yang, Kaidi
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5744 - 5749
  • [35] Safe Building HVAC Control via Batch Reinforcement Learning
    Zhang, Chi
    Kuppannagari, Sanmukh Rao
    Prasanna, Viktor K.
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2022, 7 (04): : 923 - 934
  • [36] Risk-Sensitive Inhibitory Control for Safe Reinforcement Learning
    Lederer, Armin
    Noorani, Erfaun
    Baras, John S.
    Hirche, Sandra
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1040 - 1045
  • [37] Safe deep reinforcement learning in diesel engine emission control
    Norouzi, Armin
    Shahpouri, Saeid
    Gordon, David
    Shahbakhti, Mahdi
    Koch, Charles Robert
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2023, 237 (08) : 1440 - 1453
  • [38] A safe reinforcement learning algorithm for supervisory control of power plants
    Sun, Yixuan
    Khairy, Sami
    Vilim, Richard B.
    Hu, Rui
    Dave, Akshay J.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [39] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
    Du, Desong
    Han, Shaohang
    Qi, Naiming
    Ammar, Haitham Bou
    Wang, Jun
    Pan, Wei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
  • [40] Safe Learning-Based Control of Elastic Joint Robots via Control Barrier Functions L
    Lederer, Armin
    Begzadie, Azra
    Das, Neha
    Hirche, Sandra
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 2250 - 2256