Safe Reinforcement Learning With Stability Guarantee for Motion Planning of Autonomous Vehicles

被引:72
|
作者
Zhang, Lixian [1 ]
Zhang, Ruixian [1 ]
Wu, Tong [1 ]
Weng, Rui [1 ]
Han, Minghao [1 ]
Zhao, Ye [2 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150080, Peoples R China
[2] Georgia Inst Technol, Woodruff Sch Mech Engn, Atlanta, GA 30332 USA
关键词
Autonomous vehicles; Planning; Safety; Stability criteria; Lyapunov methods; Trajectory; Robot sensing systems; motion planning; safe reinforcement learning (RL); stability guarantee; UNIFORM ULTIMATE BOUNDEDNESS; AVOIDANCE; SYSTEMS; DELAY;
D O I
10.1109/TNNLS.2021.3084685
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning with safety constraints is promising for autonomous vehicles, of which various failures may result in disastrous losses. In general, a safe policy is trained by constrained optimization algorithms, in which the average constraint return as a function of states and actions should be lower than a predefined bound. However, most existing safe learning-based algorithms capture states via multiple high-precision sensors, which complicates the hardware systems and is power-consuming. This article is focused on safe motion planning with the stability guarantee for autonomous vehicles with limited size and power. To this end, the risk-identification method and the Lyapunov function are integrated with the well-known soft actor-critic (SAC) algorithm. By borrowing the concept of Lyapunov functions in the control theory, the learned policy can theoretically guarantee that the state trajectory always stays in a safe area. A novel risk-sensitive learning-based algorithm with the stability guarantee is proposed to train policies for the motion planning of autonomous vehicles. The learned policy is implemented on a differential drive vehicle in a simulation environment. The experimental results show that the proposed algorithm achieves a higher success rate than the SAC.
引用
收藏
页码:5435 / 5444
页数:10
相关论文
共 50 条
  • [1] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    RuiXian Zhang
    YiNing Han
    Man Su
    ZeFeng Lin
    HaoWei Li
    LiXian Zhang
    [J]. Science China Technological Sciences, 2024, 67 : 172 - 182
  • [2] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    ZHANG RuiXian
    HAN YiNing
    SU Man
    LIN ZeFeng
    LI HaoWei
    ZHANG LiXian
    [J]. Science China Technological Sciences, 2024, (01) : 172 - 182
  • [3] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    Zhang, Ruixian
    Han, Yining
    Su, Man
    Lin, Zefeng
    Li, Haowei
    Zhang, Lixian
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (04) : 1023 - 1039
  • [4] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    Zhang, RuiXian
    Han, YiNing
    Su, Man
    Lin, ZeFeng
    Li, HaoWei
    Zhang, LiXian
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (01) : 172 - 182
  • [5] Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles
    Aradi, Szilard
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 740 - 759
  • [6] Optimal motion planning by reinforcement learning in autonomous mobile vehicles
    Gomez, M.
    Gonzalez, R. V.
    Martinez-Marin, T.
    Meziat, D.
    Sanchez, S.
    [J]. ROBOTICA, 2012, 30 : 159 - 170
  • [7] Safe Reinforcement Learning on Autonomous Vehicles
    Isele, David
    Nakhaei, Alireza
    Fujimura, Kikuo
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 6162 - 6167
  • [8] Motion Planning for Autonomous Vehicles in the Presence of Uncertainty Using Reinforcement Learning
    Rezaee, Kasra
    Yadmellat, Peyman
    Chamorro, Simon
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3506 - 3511
  • [9] Hierarchical Reinforcement Learning for Autonomous Decision Making and Motion Planning of Intelligent Vehicles
    Lu, Yang
    Xu, Xin
    Zhang, Xinglong
    Qian, Lilin
    Zhou, Xing
    [J]. IEEE ACCESS, 2020, 8 : 209776 - 209789
  • [10] Reinforcement Learning based Negotiation-aware Motion Planning of Autonomous Vehicles
    Wang, Zhitao
    Zhuang, Yuzheng
    Gu, Qiang
    Chen, Dong
    Zhang, Hongbo
    Liu, Wulong
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4532 - 4537