Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient

被引：19

作者：

Gong, Hui ^{[1
]}

Wang, Peng ^{[1
,2
]}

Ni, Cui ^{[1
]}

Cheng, Nuo ^{[1
]}

机构：

[1] Shandong Jiao Tong Univ, Informat Sci & Elect Engn, Jinan 250357, Peoples R China

[2] Shandong Acad Sci, Inst Automat, Jinan 250013, Peoples R China

来源：

SENSORS | 2022年 / 22卷 / 09期

基金：

中国博士后科学基金;

关键词：

path planning; DDPG; LSTM; reward function; mixed noise; MULTIVEHICLE TASK ASSIGNMENT; ALGORITHM;

D O I：

10.3390/s22093579

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

When a traditional Deep Deterministic Policy Gradient (DDPG) algorithm is used in mobile robot path planning, due to the limited observable environment of mobile robots, the training efficiency of the path planning model is low, and the convergence speed is slow. In this paper, Long Short-Term Memory (LSTM) is introduced into the DDPG network, the former and current states of the mobile robot are combined to determine the actions of the robot, and a Batch Norm layer is added after each layer of the Actor network. At the same time, the reward function is optimized to guide the mobile robot to move faster towards the target point. In order to improve the learning efficiency, different normalization methods are used to normalize the distance and angle between the mobile robot and the target point, which are used as the input of the DDPG network model. When the model outputs the next action of the mobile robot, mixed noise composed of Gaussian noise and Ornstein-Uhlenbeck (OU) noise is added. Finally, the simulation environment built by a ROS system and a Gazebo platform is used for experiments. The results show that the proposed algorithm can accelerate the convergence speed of DDPG, improve the generalization ability of the path planning model and improve the efficiency and success rate of mobile robot path planning.

引用

页数：20

共 50 条

[1] Local Path Planning with Turnabouts for Mobile Robot by Deep Deterministic Policy Gradient
Nakamura, Tomoaki
Kobayashi, Masato
Motoi, Naoki
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS, ICM, 2023,
[2] Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm
Zhang, Shuzhen
Tang, Wei
Li, Panpan
Zha, Fusheng
[J]. SENSORS, 2024, 24 (17)
[3] Mobile robot path planning based on multi-experience pool deep deterministic policy gradient in unknown environment
Wei, Linxin
Xu, Quanxing
Hu, Ziyu
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
[4] Path planning based on improved Deep Deterministic Policy Gradient algorithm
Liu, Yandong
Zhang, Wenzhi
Chen, Fumin
Li, Jianliang
[J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 295 - 299
[5] Path Planning of Humanoid Arm Based on Deep Deterministic Policy Gradient
Wen, Shuhuan
Chen, Jianhua
Wang, Shen
Zhang, Hong
Hu, Xueheng
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1755 - 1760
[6] Mobile robot path planning using deep deterministic policy gradient with differential gaming (DDPG-DG) exploration
Deshpande, Shripad V.
R, Harikrishnan
Ibrahim, Babul Salam KSM Kader
Ponnuru, Mahesh Datta Sai
[J]. Cognitive Robotics, 2024, 4 : 156 - 173
[7] UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient
Wu, Runjia
Gu, Fangqing
Liu, Hai-lin
Shi, Hongjian
[J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
[8] A path planning algorithm of deterministic mobile robot based on immune
Fan, Jun-Yan
Chu, Yu
Yue, Di
Hong, Lu
[J]. Kongzhi yu Juece/Control and Decision, 2021, 36 (10): : 2418 - 2426
[9] UAV Coverage Path Planning With Quantum-Based Recurrent Deep Deterministic Policy Gradient
Silvirianti
Narottama, Bhaskara
Shin, Soo Young
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (05) : 7424 - 7429
[10] Improved Deep Deterministic Policy Gradient for Dynamic Obstacle Avoidance of Mobile Robot
Gao, Xiaoshan
Yan, Liang
Li, Zhijun
Wang, Gang
Chen, I-Ming
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (06): : 3675 - 3682

← 1 2 3 4 5 →