Efficient Deep Reinforcement Learning for Optimal Path Planning

被引:10
|
作者
Ren, Jing [1 ]
Huang, Xishi [2 ]
Huang, Raymond N. [3 ]
机构
[1] Ontario Tech Univ, Dept Elect Comp & Software Engn, Oshawa, ON L1G 0C5, Canada
[2] RS OPTO Tech Ltd, Suzhou 215100, Peoples R China
[3] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON M5S 3G8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
deep reinforcement learning; global optimal path planning; dynamic programming; mobile robots; shortest path; continuous state space; collision avoidance; DYNAMIC-SYSTEM; EXTREME; MACHINE;
D O I
10.3390/electronics11213628
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel deep reinforcement learning (DRL) method for optimal path planning for mobile robots using dynamic programming (DP)-based data collection. The proposed method can overcome the slow learning process and improve training data quality inherently in DRL algorithms. The main idea of our approach is as follows. First, we mapped the dynamic programming method to typical optimal path planning problems for mobile robots, and created a new efficient DP-based method to find an exact, analytical, optimal solution for the path planning problem. Then, we used high-quality training data gathered using the DP method for DRL, which greatly improves training data quality and learning efficiency. Next, we established a two-stage reinforcement learning method where, prior to the DRL, we employed extreme learning machines (ELM) to initialize the parameters of actor and critic neural networks to a near-optimal solution in order to significantly improve the learning performance. Finally, we illustrated our method using some typical path planning tasks. The experimental results show that our DRL method can converge much easier and faster than other methods. The resulting action neural network is able to successfully guide robots from any start position in the environment to the goal position while following the optimal path and avoiding collision with obstacles.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] EPPE: An Efficient Progressive Policy Enhancement framework of deep reinforcement learning in path planning
    Zhao, Wang
    Zhang, Ye
    Xie, Zikang
    [J]. NEUROCOMPUTING, 2024, 596
  • [2] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    [J]. 2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [3] Dynamic Path Planning for Mobile Robots with Deep Reinforcement Learning
    Yang, Laiyi
    Bi, Jing
    Yuan, Haitao
    [J]. IFAC PAPERSONLINE, 2022, 55 (11): : 19 - 24
  • [4] Explainable Deep Reinforcement Learning for UAV autonomous path planning
    He, Lei
    Aouf, Nabil
    Song, Bifeng
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
  • [5] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
    Zheng, Kaiyuan
    Gao, Jingpeng
    Shen, Liangxi
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
  • [6] A decentralized path planning model based on deep reinforcement learning
    Guo, Dong
    Ji, Shouwen
    Yao, Yanke
    Chen, Cheng
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2024, 117
  • [7] Application of Deep Reinforcement Learning in Mobile Robot Path Planning
    Xin, Jing
    Zhao, Huan
    Liu, Ding
    Li, Minqi
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7112 - 7116
  • [8] Grid Path Planning with Deep Reinforcement Learning: Preliminary Results
    Panov, Aleksandr, I
    Yakovlev, Konstantin S.
    Suvorov, Roman
    [J]. 8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 347 - 353
  • [9] A Deep Reinforcement Learning Based Approach for AGVs Path Planning
    Guo, Xinde
    Ren, Zhigang
    Wu, Zongze
    Lai, Jialun
    Zeng, Deyu
    Xie, Shengli
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6833 - 6838
  • [10] Research on path planning of robot based on deep reinforcement learning
    Liu, Feng
    Chen, Chang
    Li, Zhihua
    Guan, Zhi-Hong
    Wang, Hua O.
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3730 - 3734