Safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics

被引：0

作者：

Yuan, Zhaocong ^{[1
,2
,3
]}

Hall, Adam W. ^{[1
,2
,3
]}

Zhou, Siqi ^{[1
,2
,3
]}

Brunke, Lukas ^{[1
,2
,3
]}

Greeff, Melissa ^{[1
,2
,3
]}

Panerati, Jacopo ^{[1
,2
,3
]}

Schoellig, Angela P. ^{[1
,2
,3
]}

机构：

[1] Univ Toronto, Dynam Syst Lab, Toronto, ON M5S 1A1, Canada

[2] Univ Toronto, Inst Aerosp Studies, Toronto, ON M5S 1A1, Canada

[3] Toronto Univ Toronto, Vector Inst Artificial Intelligence, Toronto, ON M5S 1A1, Canada

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2022年 / 7卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Machine learning for robot control; reinforcement learning; robot safety; software tools for benchmarking and reproducibility; PHYSICS;

D O I：

10.1109/LRA.2022.3196132

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In recent years, both reinforcement learning and learning-based control-as well as the study of their safety, which is crucial for deployment in real-world robots-have gained significant traction. However, to adequately gauge the progress and applicability of new results, we need the tools to equitably compare the approaches proposed by the controls and reinforcement learning communities. Here, we propose a new open-source benchmark suite, called safe-control-gym, supporting both model-based and data-based control techniques. We provide implementations for three dynamic systems-the cart-pole, the 1D, and 2D quadrotor-and two control tasks-stabilization and trajectory tracking. We propose to extend OpenAI's Gym API-the de facto standard in reinforcement learning research-with (i) the ability to specify (and query) symbolic dynamics and (ii) constraints, and (iii) (repeatably) inject simulated disturbances in the control inputs, state measurements, and inertial properties. To demonstrate our proposal and in an attempt to bring research communities closer together, we show how to use safe-control-gym to quantitatively compare the control performance, data efficiency, and safety of multiple approaches from the fields of traditional control, learning-based control, and reinforcement learning.

引用

页码：11142 / 11149

页数：8

共 50 条

[1] Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
Brunke, Lukas
Greeff, Melissa
Hall, Adam W.
Yuan, Zhaocong
Zhou, Siqi
Panerati, Jacopo
Schoellig, Angela P.
[J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 411 - 444
[2] Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Ji, Jiaming
Zhang, Borong
Zhou, Jiayi
Pan, Xuehai
Huang, Weidong
Sun, Ruiyang
Geng, Yiran
Zhong, Yifan
Dai, Juntao
Yang, Yaodong
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[3] Learning-Based Model Predictive Control: Toward Safe Learning in Control
Hewing, Lukas
Wabersich, Kim P.
Menner, Marcel
Zeilinger, Melanie N.
[J]. ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296
[4] Safe deep reinforcement learning-based adaptive control for USV interception mission
Du, Bin
Lin, Bin
Zhang, Chenming
Dong, Botao
Zhang, Weidong
[J]. OCEAN ENGINEERING, 2022, 246
[5] Learning-based Model Predictive Control for Safe Exploration
Koller, Torsten
Berkenkamp, Felix
Turchetta, Matteo
Krause, Andreas
[J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 6059 - 6066
[6] Learning a Unified Control Policy for Safe Falling
Kumar, Visak C. V.
Ha, Sehoon
Liu, C. Karen
[J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3940 - 3947
[7] Safe Reinforcement Learning-Based Robust Approximate Optimal Control for Hypersonic Flight Vehicles
Shi, Lei
Wang, Xuesong
Cheng, Yuhu
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11401 - 11414
[8] Reinforcement Learning-based Optimal Control and Software Rejuvenation for Safe and Efficient UAV Navigation
Chen, Angela
Mitsopoulos, Konstantinos
Romagnoli, Raffaele
[J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7527 - 7532
[9] Safe Reinforcement Learning-Based Balance Control for Multi-Cylinder Hydraulic Press
Jia, Chao
Song, Zijian
Du, Lifeng
Wang, Hongkun
[J]. JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2024, 146 (04):
[10] Learning-based model predictive control for safe path planning and control
Ren, Hongbin
Li, Yunong
Wang, Yang
Chen, Chih-Keng
Yang, Lin
Zhao, Yuzhuang
[J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024,

← 1 2 3 4 5 →