Efficient Deep Reinforcement Learning for Optimal Path Planning

被引：11

作者：

Ren, Jing ^{[1
]}

Huang, Xishi ^{[2
]}

Huang, Raymond N. ^{[3
]}

机构：

[1] Ontario Tech Univ, Dept Elect Comp & Software Engn, Oshawa, ON L1G 0C5, Canada

[2] RS OPTO Tech Ltd, Suzhou 215100, Peoples R China

[3] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON M5S 3G8, Canada

来源：

ELECTRONICS | 2022年 / 11卷 / 21期

基金：

加拿大自然科学与工程研究理事会;

关键词：

deep reinforcement learning; global optimal path planning; dynamic programming; mobile robots; shortest path; continuous state space; collision avoidance; DYNAMIC-SYSTEM; EXTREME; MACHINE;

D O I：

10.3390/electronics11213628

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a novel deep reinforcement learning (DRL) method for optimal path planning for mobile robots using dynamic programming (DP)-based data collection. The proposed method can overcome the slow learning process and improve training data quality inherently in DRL algorithms. The main idea of our approach is as follows. First, we mapped the dynamic programming method to typical optimal path planning problems for mobile robots, and created a new efficient DP-based method to find an exact, analytical, optimal solution for the path planning problem. Then, we used high-quality training data gathered using the DP method for DRL, which greatly improves training data quality and learning efficiency. Next, we established a two-stage reinforcement learning method where, prior to the DRL, we employed extreme learning machines (ELM) to initialize the parameters of actor and critic neural networks to a near-optimal solution in order to significantly improve the learning performance. Finally, we illustrated our method using some typical path planning tasks. The experimental results show that our DRL method can converge much easier and faster than other methods. The resulting action neural network is able to successfully guide robots from any start position in the environment to the goal position while following the optimal path and avoiding collision with obstacles.

引用

页数：21

共 50 条

[41] Path Planning for Mobile Robot Based on Deep Reinforcement Learning and Fuzzy Control
Liu, Chunling
Xu, Jun
Guo, Kaiwen
[J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 533 - 537
[42] Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning
Li, Bohao
Wu, Yunjie
[J]. IEEE ACCESS, 2020, 8 : 29064 - 29074
[43] Improved Path Planning for Indoor Patrol Robot Based on Deep Reinforcement Learning
Zheng, Jianfeng
Mao, Shuren
Wu, Zhenyu
Kong, Pengcheng
Qiang, Hao
[J]. SYMMETRY-BASEL, 2022, 14 (01):
[44] A novel path planning approach for unmanned ships based on deep reinforcement learning
Chen, Chen
Ma, Feng
Liu, Jia-Lun
Yan, Xin-Ping
Chen, Xian-Qiao
[J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 626 - 633
[45] Target Tracking and Path Planning of Mobile Sensor Based on Deep Reinforcement Learning
Zhang, Kun
Hu, Yuanjiang
Huang, Deqing
Yin, Zijie
[J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 190 - 195
[46] Path Planning Method of Mobile Robot Using Improved Deep Reinforcement Learning
Wang, Wei
Wu, Zhenkui
Luo, Huafu
Zhang, Bin
[J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2022, 2022
[47] Path Planning for Ferry Crossing Inland Waterways Based on Deep Reinforcement Learning
Yuan, Xiaoli
Yuan, Chengji
Tian, Wuliu
Liu, Gan
Zhang, Jinfen
[J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (02)
[48] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
Yanglong Liu
Zuguo Chen
Yonggang Li
Ming Lu
Chaoyang Chen
Xuzhuo Zhang
[J]. International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
[49] Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning
Zhang, Kai
Wang, Guile
Hu, Jinwen
Xu, Zhao
Guo, Chubing
[J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8392 - 8397
[50] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
Meng, Haitao
Zhang, Hengrui
[J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)

← 1 2 3 4 5 →