Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots

被引:0
|
作者
Zhang, RuiXian [1 ]
Han, YiNing [2 ]
Su, Man [3 ]
Lin, ZeFeng [1 ]
Li, HaoWei [1 ]
Zhang, LiXian [1 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Sch Management, Harbin 150001, Peoples R China
[3] Beijing Inst Tracking & Telecommun Technol, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
motion control; reinforcement learning; robustness; stability;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper addresses the issue of safety in reinforcement learning (RL) with disturbances and its application in the safety-constrained motion control of autonomous robots. To tackle this problem, a robust Lyapunov value function (rLVF) is proposed. The rLVF is obtained by introducing a data-based LVF under the worst-case disturbance of the observed state. Using the rLVF, a uniformly ultimate boundedness criterion is established. This criterion is desired to ensure that the cost function, which serves as a safety criterion, ultimately converges to a range via the policy to be designed. Moreover, to mitigate the drastic variation of the rLVF caused by differences in states, a smoothing regularization of the rLVF is introduced. To train policies with safety guarantees under the worst disturbances of the observed states, an off-policy robust RL algorithm is proposed. The proposed algorithm is applied to motion control tasks of an autonomous vehicle and a cartpole, which involve external disturbances and variations of the model parameters, respectively. The experimental results demonstrate the effectiveness of the theoretical findings and the advantages of the proposed algorithm in terms of robustness and safety.
引用
收藏
页码:172 / 182
页数:11
相关论文
共 50 条
  • [21] A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control
    Kondo, T
    Ito, K
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2004, 46 (02) : 111 - 124
  • [22] CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
    Xu, Tengyu
    Liang, Yingbin
    Lan, Guanghui
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [23] Safe, efficient, and comfortable velocity control based on reinforcement learning for autonomous driving
    Zhu, Meixin
    Wang, Yinhai
    Pu, Ziyuan
    Hu, Jingyun
    Wang, Xuesong
    Ke, Ruimin
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 117
  • [24] Bayesian learning for the robust verification of autonomous robots
    Xingyu Zhao
    Simos Gerasimou
    Radu Calinescu
    Calum Imrie
    Valentin Robu
    David Flynn
    [J]. Communications Engineering, 3 (1):
  • [25] Robust Control of Quadruped Robots using Reinforcement Learning and Depth Completion Network
    Xu, Ruonan
    Guo, Bin
    Zhao, Kaixing
    Jing, Yao
    Ding, Yasan
    Yu, Zhiwen
    [J]. PROCEEDINGS OF THE 2024 ADAAIOTSYS 2024-WORKSHOP ON ADAPTIVE AIOT SYSTEMS, ADAAIOTSYS 2024, 2024, : 7 - 12
  • [26] Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics
    Meindl, Michael
    Lehmann, Dustin
    Seel, Thomas
    [J]. FRONTIERS IN ROBOTICS AND AI, 2022, 9
  • [27] Reinforcement learning-based motion control for snake robots in complex environments
    Zhang, Dong
    Ju, Renjie
    Cao, Zhengcai
    [J]. ROBOTICA, 2024, 42 (04) : 947 - 961
  • [28] Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning
    Zhao, Rui
    Chen, Ziguo
    Fan, Yuze
    Li, Yun
    Gao, Fei
    [J]. SENSORS, 2024, 24 (13)
  • [29] Safe Reinforcement Learning Using Robust MPC
    Zanon, Mario
    Gros, Sebastien
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (08) : 3638 - 3652
  • [30] Optimal Motion Design for Autonomous Vehicles With Learning Aided Robust Control
    Lelko, Attila
    Nemeth, Balazs
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12638 - 12651