Efficient Deep Reinforcement Learning for Optimal Path Planning

被引：10

作者：

Ren, Jing ^{[1
]}

Huang, Xishi ^{[2
]}

Huang, Raymond N. ^{[3
]}

机构：

[1] Ontario Tech Univ, Dept Elect Comp & Software Engn, Oshawa, ON L1G 0C5, Canada

[2] RS OPTO Tech Ltd, Suzhou 215100, Peoples R China

[3] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON M5S 3G8, Canada

来源：

ELECTRONICS | 2022年 / 11卷 / 21期

基金：

加拿大自然科学与工程研究理事会;

关键词：

deep reinforcement learning; global optimal path planning; dynamic programming; mobile robots; shortest path; continuous state space; collision avoidance; DYNAMIC-SYSTEM; EXTREME; MACHINE;

D O I：

10.3390/electronics11213628

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a novel deep reinforcement learning (DRL) method for optimal path planning for mobile robots using dynamic programming (DP)-based data collection. The proposed method can overcome the slow learning process and improve training data quality inherently in DRL algorithms. The main idea of our approach is as follows. First, we mapped the dynamic programming method to typical optimal path planning problems for mobile robots, and created a new efficient DP-based method to find an exact, analytical, optimal solution for the path planning problem. Then, we used high-quality training data gathered using the DP method for DRL, which greatly improves training data quality and learning efficiency. Next, we established a two-stage reinforcement learning method where, prior to the DRL, we employed extreme learning machines (ELM) to initialize the parameters of actor and critic neural networks to a near-optimal solution in order to significantly improve the learning performance. Finally, we illustrated our method using some typical path planning tasks. The experimental results show that our DRL method can converge much easier and faster than other methods. The resulting action neural network is able to successfully guide robots from any start position in the environment to the goal position while following the optimal path and avoiding collision with obstacles.

引用

页数：21

共 50 条

[1] EPPE: An Efficient Progressive Policy Enhancement framework of deep reinforcement learning in path planning
Zhao, Wang
Zhang, Ye
Xie, Zikang
[J]. NEUROCOMPUTING, 2024, 596
[2] Robot path planning based on deep reinforcement learning
Long, Yinxin
He, Huajin
[J]. 2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
[3] Dynamic Path Planning for Mobile Robots with Deep Reinforcement Learning
Yang, Laiyi
Bi, Jing
Yuan, Haitao
[J]. IFAC PAPERSONLINE, 2022, 55 (11): : 19 - 24
[4] Explainable Deep Reinforcement Learning for UAV autonomous path planning
He, Lei
Aouf, Nabil
Song, Bifeng
[J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
[5] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
Zheng, Kaiyuan
Gao, Jingpeng
Shen, Liangxi
[J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
[6] A decentralized path planning model based on deep reinforcement learning
Guo, Dong
Ji, Shouwen
Yao, Yanke
Chen, Cheng
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2024, 117
[7] Application of Deep Reinforcement Learning in Mobile Robot Path Planning
Xin, Jing
Zhao, Huan
Liu, Ding
Li, Minqi
[J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7112 - 7116
[8] Grid Path Planning with Deep Reinforcement Learning: Preliminary Results
Panov, Aleksandr, I
Yakovlev, Konstantin S.
Suvorov, Roman
[J]. 8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 347 - 353
[9] A Deep Reinforcement Learning Based Approach for AGVs Path Planning
Guo, Xinde
Ren, Zhigang
Wu, Zongze
Lai, Jialun
Zeng, Deyu
Xie, Shengli
[J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6833 - 6838
[10] Research on path planning of robot based on deep reinforcement learning
Liu, Feng
Chen, Chang
Li, Zhihua
Guan, Zhi-Hong
Wang, Hua O.
[J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3730 - 3734

← 1 2 3 4 5 →