Safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics

被引:0
|
作者
Yuan, Zhaocong [1 ,2 ,3 ]
Hall, Adam W. [1 ,2 ,3 ]
Zhou, Siqi [1 ,2 ,3 ]
Brunke, Lukas [1 ,2 ,3 ]
Greeff, Melissa [1 ,2 ,3 ]
Panerati, Jacopo [1 ,2 ,3 ]
Schoellig, Angela P. [1 ,2 ,3 ]
机构
[1] Univ Toronto, Dynam Syst Lab, Toronto, ON M5S 1A1, Canada
[2] Univ Toronto, Inst Aerosp Studies, Toronto, ON M5S 1A1, Canada
[3] Toronto Univ Toronto, Vector Inst Artificial Intelligence, Toronto, ON M5S 1A1, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
Machine learning for robot control; reinforcement learning; robot safety; software tools for benchmarking and reproducibility; PHYSICS;
D O I
10.1109/LRA.2022.3196132
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In recent years, both reinforcement learning and learning-based control-as well as the study of their safety, which is crucial for deployment in real-world robots-have gained significant traction. However, to adequately gauge the progress and applicability of new results, we need the tools to equitably compare the approaches proposed by the controls and reinforcement learning communities. Here, we propose a new open-source benchmark suite, called safe-control-gym, supporting both model-based and data-based control techniques. We provide implementations for three dynamic systems-the cart-pole, the 1D, and 2D quadrotor-and two control tasks-stabilization and trajectory tracking. We propose to extend OpenAI's Gym API-the de facto standard in reinforcement learning research-with (i) the ability to specify (and query) symbolic dynamics and (ii) constraints, and (iii) (repeatably) inject simulated disturbances in the control inputs, state measurements, and inertial properties. To demonstrate our proposal and in an attempt to bring research communities closer together, we show how to use safe-control-gym to quantitatively compare the control performance, data efficiency, and safety of multiple approaches from the fields of traditional control, learning-based control, and reinforcement learning.
引用
收藏
页码:11142 / 11149
页数:8
相关论文
共 50 条
  • [1] Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
    Brunke, Lukas
    Greeff, Melissa
    Hall, Adam W.
    Yuan, Zhaocong
    Zhou, Siqi
    Panerati, Jacopo
    Schoellig, Angela P.
    [J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 411 - 444
  • [2] Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
    Ji, Jiaming
    Zhang, Borong
    Zhou, Jiayi
    Pan, Xuehai
    Huang, Weidong
    Sun, Ruiyang
    Geng, Yiran
    Zhong, Yifan
    Dai, Juntao
    Yang, Yaodong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Learning-Based Model Predictive Control: Toward Safe Learning in Control
    Hewing, Lukas
    Wabersich, Kim P.
    Menner, Marcel
    Zeilinger, Melanie N.
    [J]. ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296
  • [4] Safe deep reinforcement learning-based adaptive control for USV interception mission
    Du, Bin
    Lin, Bin
    Zhang, Chenming
    Dong, Botao
    Zhang, Weidong
    [J]. OCEAN ENGINEERING, 2022, 246
  • [5] Learning-based Model Predictive Control for Safe Exploration
    Koller, Torsten
    Berkenkamp, Felix
    Turchetta, Matteo
    Krause, Andreas
    [J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 6059 - 6066
  • [6] Learning a Unified Control Policy for Safe Falling
    Kumar, Visak C. V.
    Ha, Sehoon
    Liu, C. Karen
    [J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3940 - 3947
  • [7] Safe Reinforcement Learning-Based Robust Approximate Optimal Control for Hypersonic Flight Vehicles
    Shi, Lei
    Wang, Xuesong
    Cheng, Yuhu
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11401 - 11414
  • [8] Reinforcement Learning-based Optimal Control and Software Rejuvenation for Safe and Efficient UAV Navigation
    Chen, Angela
    Mitsopoulos, Konstantinos
    Romagnoli, Raffaele
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7527 - 7532
  • [9] Safe Reinforcement Learning-Based Balance Control for Multi-Cylinder Hydraulic Press
    Jia, Chao
    Song, Zijian
    Du, Lifeng
    Wang, Hongkun
    [J]. JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2024, 146 (04):
  • [10] Learning-based model predictive control for safe path planning and control
    Ren, Hongbin
    Li, Yunong
    Wang, Yang
    Chen, Chih-Keng
    Yang, Lin
    Zhao, Yuzhuang
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024,