Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots

被引:0
|
作者
Zhang, RuiXian [1 ]
Han, YiNing [2 ]
Su, Man [3 ]
Lin, ZeFeng [1 ]
Li, HaoWei [1 ]
Zhang, LiXian [1 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Sch Management, Harbin 150001, Peoples R China
[3] Beijing Inst Tracking & Telecommun Technol, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
motion control; reinforcement learning; robustness; stability;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper addresses the issue of safety in reinforcement learning (RL) with disturbances and its application in the safety-constrained motion control of autonomous robots. To tackle this problem, a robust Lyapunov value function (rLVF) is proposed. The rLVF is obtained by introducing a data-based LVF under the worst-case disturbance of the observed state. Using the rLVF, a uniformly ultimate boundedness criterion is established. This criterion is desired to ensure that the cost function, which serves as a safety criterion, ultimately converges to a range via the policy to be designed. Moreover, to mitigate the drastic variation of the rLVF caused by differences in states, a smoothing regularization of the rLVF is introduced. To train policies with safety guarantees under the worst disturbances of the observed states, an off-policy robust RL algorithm is proposed. The proposed algorithm is applied to motion control tasks of an autonomous vehicle and a cartpole, which involve external disturbances and variations of the model parameters, respectively. The experimental results demonstrate the effectiveness of the theoretical findings and the advantages of the proposed algorithm in terms of robustness and safety.
引用
收藏
页码:172 / 182
页数:11
相关论文
共 50 条
  • [1] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    RuiXian Zhang
    YiNing Han
    Man Su
    ZeFeng Lin
    HaoWei Li
    LiXian Zhang
    [J]. Science China Technological Sciences, 2024, 67 : 172 - 182
  • [2] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    Zhang, Ruixian
    Han, Yining
    Su, Man
    Lin, Zefeng
    Li, Haowei
    Zhang, Lixian
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (04) : 1023 - 1039
  • [3] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    ZHANG RuiXian
    HAN YiNing
    SU Man
    LIN ZeFeng
    LI HaoWei
    ZHANG LiXian
    [J]. Science China Technological Sciences, 2024, (01) : 172 - 182
  • [4] Integration of Robust Control with Reinforcement Learning for Safe Autonomous Vehicle Motion
    Lelko, Attila
    Nemeth, Balazs
    Fenyes, Daniel
    Gaspar, Peter
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 1101 - 1106
  • [5] Safe Reinforcement Learning With Stability Guarantee for Motion Planning of Autonomous Vehicles
    Zhang, Lixian
    Zhang, Ruixian
    Wu, Tong
    Weng, Rui
    Han, Minghao
    Zhao, Ye
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5435 - 5444
  • [6] A classifier system for reinforcement learning control of autonomous robots
    Kuroyama, K
    Svinin, MM
    Ueda, K
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 304 - 311
  • [7] A Safe and Self-Recoverable Reinforcement Learning Framework for Autonomous Robots
    Wang, Weiqiang
    Zhou, Xu
    Xu, Benlian
    Lu, Mingli
    Zhang, Yuxin
    Gu, Yuhang
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3878 - 3883
  • [8] Safe Robust Adaptive Motion Control for Underactuated Marine Robots
    Nazmara, G. Reza
    Aguiar, A. Pedro
    [J]. SENSORS, 2024, 24 (12)
  • [9] Safe and Robust Motion Planning for Autonomous Navigation of Quadruped Robots in Cluttered Environments
    Liu, Hongyi
    Yuan, Quan
    [J]. IEEE ACCESS, 2024, 12 : 69728 - 69737
  • [10] A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments
    Zhou, Zhiqian
    Ren, Junkai
    Zeng, Zhiwen
    Xiao, Junhao
    Zhang, Xinglong
    Guo, Xian
    Zhou, Zongtan
    Lu, Huimin
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023,