Safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics

被引：0

作者：

Yuan, Zhaocong ^{[1
,2
,3
]}

Hall, Adam W. ^{[1
,2
,3
]}

Zhou, Siqi ^{[1
,2
,3
]}

Brunke, Lukas ^{[1
,2
,3
]}

Greeff, Melissa ^{[1
,2
,3
]}

Panerati, Jacopo ^{[1
,2
,3
]}

Schoellig, Angela P. ^{[1
,2
,3
]}

机构：

[1] Univ Toronto, Dynam Syst Lab, Toronto, ON M5S 1A1, Canada

[2] Univ Toronto, Inst Aerosp Studies, Toronto, ON M5S 1A1, Canada

[3] Toronto Univ Toronto, Vector Inst Artificial Intelligence, Toronto, ON M5S 1A1, Canada

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2022年 / 7卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Machine learning for robot control; reinforcement learning; robot safety; software tools for benchmarking and reproducibility; PHYSICS;

D O I：

10.1109/LRA.2022.3196132

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In recent years, both reinforcement learning and learning-based control-as well as the study of their safety, which is crucial for deployment in real-world robots-have gained significant traction. However, to adequately gauge the progress and applicability of new results, we need the tools to equitably compare the approaches proposed by the controls and reinforcement learning communities. Here, we propose a new open-source benchmark suite, called safe-control-gym, supporting both model-based and data-based control techniques. We provide implementations for three dynamic systems-the cart-pole, the 1D, and 2D quadrotor-and two control tasks-stabilization and trajectory tracking. We propose to extend OpenAI's Gym API-the de facto standard in reinforcement learning research-with (i) the ability to specify (and query) symbolic dynamics and (ii) constraints, and (iii) (repeatably) inject simulated disturbances in the control inputs, state measurements, and inertial properties. To demonstrate our proposal and in an attempt to bring research communities closer together, we show how to use safe-control-gym to quantitatively compare the control performance, data efficiency, and safety of multiple approaches from the fields of traditional control, learning-based control, and reinforcement learning.

引用

页码：11142 / 11149

页数：8

共 50 条

[31] Tracking interval control for urban rail trains based on safe reinforcement learning
Lin, Junting
Qiu, Xiaohui
Li, Maolin
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
[32] Safe reinforcement learning: A control barrier function optimization approach
Marvi, Zahra
Kiumarsi, Bahare
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (06) : 1923 - 1940
[33] Safe chance constrained reinforcement learning for batch process control
Mowbray, M.
Petsagkourakis, R.
del Rio-Chanona, E. A.
Zhang, D.
[J]. COMPUTERS & CHEMICAL ENGINEERING, 2022, 157
[34] Safe Reinforcement Learning for Mixed-Autonomy Platoon Control
Zhou, Jingyuan
Yan, Longhao
Yang, Kaidi
[J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5744 - 5749
[35] Safe Building HVAC Control via Batch Reinforcement Learning
Zhang, Chi
Kuppannagari, Sanmukh Rao
Prasanna, Viktor K.
[J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2022, 7 (04): : 923 - 934
[36] Risk-Sensitive Inhibitory Control for Safe Reinforcement Learning
Lederer, Armin
Noorani, Erfaun
Baras, John S.
Hirche, Sandra
[J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1040 - 1045
[37] Safe deep reinforcement learning in diesel engine emission control
Norouzi, Armin
Shahpouri, Saeid
Gordon, David
Shahbakhti, Mahdi
Koch, Charles Robert
[J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2023, 237 (08) : 1440 - 1453
[38] A safe reinforcement learning algorithm for supervisory control of power plants
Sun, Yixuan
Khairy, Sami
Vilim, Richard B.
Hu, Rui
Dave, Akshay J.
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 301
[39] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Du, Desong
Han, Shaohang
Qi, Naiming
Ammar, Haitham Bou
Wang, Jun
Pan, Wei
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
[40] Safe Learning-Based Control of Elastic Joint Robots via Control Barrier Functions L
Lederer, Armin
Begzadie, Azra
Das, Neha
Hirche, Sandra
[J]. IFAC PAPERSONLINE, 2023, 56 (02): : 2250 - 2256

← 1 2 3 4 5 →