Efficient Deep Reinforcement Learning for Optimal Path Planning

被引:11
|
作者
Ren, Jing [1 ]
Huang, Xishi [2 ]
Huang, Raymond N. [3 ]
机构
[1] Ontario Tech Univ, Dept Elect Comp & Software Engn, Oshawa, ON L1G 0C5, Canada
[2] RS OPTO Tech Ltd, Suzhou 215100, Peoples R China
[3] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON M5S 3G8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
deep reinforcement learning; global optimal path planning; dynamic programming; mobile robots; shortest path; continuous state space; collision avoidance; DYNAMIC-SYSTEM; EXTREME; MACHINE;
D O I
10.3390/electronics11213628
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel deep reinforcement learning (DRL) method for optimal path planning for mobile robots using dynamic programming (DP)-based data collection. The proposed method can overcome the slow learning process and improve training data quality inherently in DRL algorithms. The main idea of our approach is as follows. First, we mapped the dynamic programming method to typical optimal path planning problems for mobile robots, and created a new efficient DP-based method to find an exact, analytical, optimal solution for the path planning problem. Then, we used high-quality training data gathered using the DP method for DRL, which greatly improves training data quality and learning efficiency. Next, we established a two-stage reinforcement learning method where, prior to the DRL, we employed extreme learning machines (ELM) to initialize the parameters of actor and critic neural networks to a near-optimal solution in order to significantly improve the learning performance. Finally, we illustrated our method using some typical path planning tasks. The experimental results show that our DRL method can converge much easier and faster than other methods. The resulting action neural network is able to successfully guide robots from any start position in the environment to the goal position while following the optimal path and avoiding collision with obstacles.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Path Planning for Mobile Robot Based on Deep Reinforcement Learning and Fuzzy Control
    Liu, Chunling
    Xu, Jun
    Guo, Kaiwen
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 533 - 537
  • [42] Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning
    Li, Bohao
    Wu, Yunjie
    [J]. IEEE ACCESS, 2020, 8 : 29064 - 29074
  • [43] Improved Path Planning for Indoor Patrol Robot Based on Deep Reinforcement Learning
    Zheng, Jianfeng
    Mao, Shuren
    Wu, Zhenyu
    Kong, Pengcheng
    Qiang, Hao
    [J]. SYMMETRY-BASEL, 2022, 14 (01):
  • [44] A novel path planning approach for unmanned ships based on deep reinforcement learning
    Chen, Chen
    Ma, Feng
    Liu, Jia-Lun
    Yan, Xin-Ping
    Chen, Xian-Qiao
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 626 - 633
  • [45] Target Tracking and Path Planning of Mobile Sensor Based on Deep Reinforcement Learning
    Zhang, Kun
    Hu, Yuanjiang
    Huang, Deqing
    Yin, Zijie
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 190 - 195
  • [46] Path Planning Method of Mobile Robot Using Improved Deep Reinforcement Learning
    Wang, Wei
    Wu, Zhenkui
    Luo, Huafu
    Zhang, Bin
    [J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2022, 2022
  • [47] Path Planning for Ferry Crossing Inland Waterways Based on Deep Reinforcement Learning
    Yuan, Xiaoli
    Yuan, Chengji
    Tian, Wuliu
    Liu, Gan
    Zhang, Jinfen
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (02)
  • [48] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Yanglong Liu
    Zuguo Chen
    Yonggang Li
    Ming Lu
    Chaoyang Chen
    Xuzhuo Zhang
    [J]. International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
  • [49] Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning
    Zhang, Kai
    Wang, Guile
    Hu, Jinwen
    Xu, Zhao
    Guo, Chubing
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8392 - 8397
  • [50] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
    Meng, Haitao
    Zhang, Hengrui
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)