Hierarchical Reinforcement Learning for Autonomous Decision Making and Motion Planning of Intelligent Vehicles

被引:18
|
作者
Lu, Yang [1 ]
Xu, Xin [1 ]
Zhang, Xinglong [1 ]
Qian, Lilin [2 ]
Zhou, Xing [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[2] Natl Innovat Inst Def Technol, Unmanned Syst Technol Res Ctr, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
Decision making; Planning; Vehicle dynamics; Dynamics; Reinforcement learning; Heuristic algorithms; Intelligent vehicles; Autonomous driving; hierarchical reinforcement learning; complex dynamic traffics; decision making; motion planning;
D O I
10.1109/ACCESS.2020.3034225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Autonomous decision making and motion planning in complex dynamic traffic environments, such as left-turn without traffic signals and multi-lane merging from side-ways, are still challenging tasks for intelligent vehicles. It is difficult to generate optimized behavior decisions while considering the motion capabilities and dynamic properties of intelligent vehicles. Aiming at the above problems, this article proposes a hierarchical reinforcement learning approach for autonomous decision making and motion planning in complex dynamic traffic scenarios. The proposed approach consists of two layers. At the higher layer, a kernel-based least-squares policy iteration algorithm with uneven sampling and pooling strategy (USP-KLSPI) is presented for solving the decision-making problems. The motion capabilities of the ego vehicle and the surrounding vehicles are evaluated with a high-fidelity dynamic model in the decision-making layer. By doing so, the consistency between the decisions generated at the higher layer and the operations in the lower planning layer can be well guaranteed. The lower layer addresses the motion-planning problem in the lateral direction using a dual heuristic programming (DHP) algorithm learned in a batch-mode manner, while the velocity profile in the longitudinal direction is inherited from the higher layer. Extensive simulations are conducted in complex traffic conditions including left-turn without traffic signals and multi-lane merging from side-ways scenarios. The results demonstrate the effectiveness and efficiency of the proposed approach in realizing optimized decision making and motion planning in complex environments.
引用
收藏
页码:209776 / 209789
页数:14
相关论文
共 50 条
  • [41] Hierarchical Motion Planning and Tracking for Autonomous Vehicles Using Global Heuristic Based Potential Field and Reinforcement Learning Based Predictive Control
    Du, Guodong
    Zou, Yuan
    Zhang, Xudong
    Li, Zirui
    Liu, Qi
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8304 - 8323
  • [42] Robust Lane Change Decision Making for Autonomous Vehicles: An Observation Adversarial Reinforcement Learning Approach
    He, Xiangkun
    Yang, Haohan
    Hu, Zhongxu
    Lv, Chen
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 184 - 193
  • [43] A Deep Reinforcement Learning Decision-Making Approach for Adaptive Cruise Control in Autonomous Vehicles
    Ghraizi, Dany
    Talj, Reine
    Francis, Clovis
    [J]. 2023 21ST INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, ICAR, 2023, : 71 - 78
  • [44] Deep Reinforcement Learning Based Game-Theoretic Decision-Making for Autonomous Vehicles
    Yuan, Mingfeng
    Shan, Jinjun
    Mi, Kevin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 818 - 825
  • [45] Human-like Decision Making for Autonomous Vehicles at the Intersection Using Inverse Reinforcement Learning
    Wu, Zheng
    Qu, Fangbing
    Yang, Lin
    Gong, Jianwei
    [J]. SENSORS, 2022, 22 (12)
  • [46] Decision-Making Models for Autonomous Vehicles at Unsignalized Intersections Based on Deep Reinforcement Learning
    Xu, Shu-Yuan
    Chen, Xue-Mei
    Wang, Zi-Jia
    Hu, Yu-Hui
    Han, Xin-Tong
    [J]. 2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 672 - 677
  • [47] Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning
    You, Changxi
    Lu, Jianbo
    Filev, Dimitar
    Tsiotras, Panagiotis
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 114 : 1 - 18
  • [48] Learning hierarchical behavior and motion planning for autonomous driving
    Wang, Jingke
    Wang, Yue
    Zhang, Dongkun
    Yang, Yezhou
    Xiong, Rong
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2235 - 2242
  • [49] Iterative Learning-Based Cooperative Motion Planning and Decision-Making for Connected and Autonomous Vehicles Coordination at On-Ramps
    Wang, Bowen
    Gong, Xinle
    Lyu, Peiyuan
    Liang, Sheng
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 8105 - 8120
  • [50] An Intelligent Path Planning Scheme of Autonomous Vehicles Platoon Using Deep Reinforcement Learning on Network Edge
    Chen, Chen
    Jiang, Jiange
    Lv, Ning
    Li, Siyu
    [J]. IEEE ACCESS, 2020, 8 : 99059 - 99069