Reachability-Based Trajectory Safeguard (RTS): A Safe and Fast Reinforcement Learning Safety Layer for Continuous Control

被引:23
|
作者
Shao, Yifei Simon [1 ,2 ]
Chen, Chao [2 ]
Kousik, Shreyas [3 ]
Vasudevan, Ram [1 ,2 ]
机构
[1] Univ Michigan, Sch Mech Engn, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Robot Inst, Ann Arbor, MI 48109 USA
[3] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA
来源
基金
美国国家科学基金会;
关键词
Reinforcement learning; robot safety; task and motion planning;
D O I
10.1109/LRA.2021.3063989
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Reinforcement Learning (RL) algorithms have achieved remarkable performance in decision making and control tasks by reasoning about long-term, cumulative reward using trial and error. However, during RL training, applying this trial-and-error approach to real-world robots operating in safety critical environment may lead to collisions. To address this challenge, this letter proposes a Reachability-based Trajectory Safeguard (RTS), which leverages reachability analysis to ensure safety during training and operation. Given a known (but uncertain) model of a robot, RTS precomputes a Forward Reachable Set of the robot tracking a continuum of parameterized trajectories. At runtime, the RL agent selects from this continuum in a receding-horizon way to control the robot; the FRS is used to identify if the agent's choice is safe or not, and to adjust unsafe choices. The efficacy of this method is illustrated in static environments on three nonlinear robot models, including a 12-D quadrotor drone, in simulation and in comparison with state-of-the-art safe motion planning methods.
引用
收藏
页码:3663 / 3670
页数:8
相关论文
共 50 条
  • [1] Reachability-Based Safe Learning with Gaussian Processes
    Akametalu, Anayo K.
    Fisac, Jaime F.
    Gillula, Jeremy H.
    Kaynama, Shahab
    Zeilinger, Melanie N.
    Tomlin, Claire J.
    [J]. 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1424 - 1431
  • [2] Reachability-Based Safe Learning Framework: Experimental Results
    不详
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (05): : 164 - 165
  • [3] Hierarchical Reinforcement Learning from Demonstration via Reachability-Based Reward Shaping
    Gao, Xiaozhu
    Liu, Jinhui
    Wan, Bo
    An, Lingling
    [J]. NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [4] Safe, Fast and Explainable Online Reinforcement Learning for Continuous Process Control
    Patel, Kalpesh M.
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON ADVANCED CONTROL OF INDUSTRIAL PROCESSES (ADCONIP 2022), 2022, : 54 - 60
  • [5] Guaranteed Safe Reachability-based Trajectory Design for a High-Fidelity Model of an Autonomous Passenger Vehicle
    Vaskov, Sean
    Sharma, Utkarsh
    Kousik, Shreyas
    Johnson-Roberson, Matthew
    Vasudevan, Ramanarayan
    [J]. 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 705 - 710
  • [6] Infusing Reachability-Based Safety into Planning and Control for Multi-agent Interactions
    Wang, Xinrui
    Leung, Karen
    Pavone, Marco
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6252 - 6259
  • [7] Safe Model-Based Reinforcement Learning With an Uncertainty-Aware Reachability Certificate
    Yu, Dongjie
    Zou, Wenjun
    Yang, Yujie
    Ma, Haitong
    Li, Shengbo Eben
    Yin, Yuming
    Chen, Jianyu
    Duan, Jingliang
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, : 1 - 14
  • [8] Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
    Brunke, Lukas
    Greeff, Melissa
    Hall, Adam W.
    Yuan, Zhaocong
    Zhou, Siqi
    Panerati, Jacopo
    Schoellig, Angela P.
    [J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 411 - 444
  • [9] Safe Learning-based Predictive Control from Efficient Reachability
    Cao, Michael E.
    Coogan, Samuel
    [J]. 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 1832 - 1837
  • [10] Learning Collision-freed Trajectory of welding manipulator based on Safe Reinforcement Learning
    Xu, Yintao
    Wang, Tao
    Chen, Chong
    Hu, Bo
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 836 - 841