Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient

被引:19
|
作者
Gong, Hui [1 ]
Wang, Peng [1 ,2 ]
Ni, Cui [1 ]
Cheng, Nuo [1 ]
机构
[1] Shandong Jiao Tong Univ, Informat Sci & Elect Engn, Jinan 250357, Peoples R China
[2] Shandong Acad Sci, Inst Automat, Jinan 250013, Peoples R China
基金
中国博士后科学基金;
关键词
path planning; DDPG; LSTM; reward function; mixed noise; MULTIVEHICLE TASK ASSIGNMENT; ALGORITHM;
D O I
10.3390/s22093579
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
When a traditional Deep Deterministic Policy Gradient (DDPG) algorithm is used in mobile robot path planning, due to the limited observable environment of mobile robots, the training efficiency of the path planning model is low, and the convergence speed is slow. In this paper, Long Short-Term Memory (LSTM) is introduced into the DDPG network, the former and current states of the mobile robot are combined to determine the actions of the robot, and a Batch Norm layer is added after each layer of the Actor network. At the same time, the reward function is optimized to guide the mobile robot to move faster towards the target point. In order to improve the learning efficiency, different normalization methods are used to normalize the distance and angle between the mobile robot and the target point, which are used as the input of the DDPG network model. When the model outputs the next action of the mobile robot, mixed noise composed of Gaussian noise and Ornstein-Uhlenbeck (OU) noise is added. Finally, the simulation environment built by a ROS system and a Gazebo platform is used for experiments. The results show that the proposed algorithm can accelerate the convergence speed of DDPG, improve the generalization ability of the path planning model and improve the efficiency and success rate of mobile robot path planning.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Local Path Planning with Turnabouts for Mobile Robot by Deep Deterministic Policy Gradient
    Nakamura, Tomoaki
    Kobayashi, Masato
    Motoi, Naoki
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS, ICM, 2023,
  • [2] Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm
    Zhang, Shuzhen
    Tang, Wei
    Li, Panpan
    Zha, Fusheng
    [J]. SENSORS, 2024, 24 (17)
  • [3] Mobile robot path planning based on multi-experience pool deep deterministic policy gradient in unknown environment
    Wei, Linxin
    Xu, Quanxing
    Hu, Ziyu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [4] Path planning based on improved Deep Deterministic Policy Gradient algorithm
    Liu, Yandong
    Zhang, Wenzhi
    Chen, Fumin
    Li, Jianliang
    [J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 295 - 299
  • [5] Path Planning of Humanoid Arm Based on Deep Deterministic Policy Gradient
    Wen, Shuhuan
    Chen, Jianhua
    Wang, Shen
    Zhang, Hong
    Hu, Xueheng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1755 - 1760
  • [6] Mobile robot path planning using deep deterministic policy gradient with differential gaming (DDPG-DG) exploration
    Deshpande, Shripad V.
    R, Harikrishnan
    Ibrahim, Babul Salam KSM Kader
    Ponnuru, Mahesh Datta Sai
    [J]. Cognitive Robotics, 2024, 4 : 156 - 173
  • [7] UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient
    Wu, Runjia
    Gu, Fangqing
    Liu, Hai-lin
    Shi, Hongjian
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [8] A path planning algorithm of deterministic mobile robot based on immune
    Fan, Jun-Yan
    Chu, Yu
    Yue, Di
    Hong, Lu
    [J]. Kongzhi yu Juece/Control and Decision, 2021, 36 (10): : 2418 - 2426
  • [9] UAV Coverage Path Planning With Quantum-Based Recurrent Deep Deterministic Policy Gradient
    Silvirianti
    Narottama, Bhaskara
    Shin, Soo Young
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (05) : 7424 - 7429
  • [10] Improved Deep Deterministic Policy Gradient for Dynamic Obstacle Avoidance of Mobile Robot
    Gao, Xiaoshan
    Yan, Liang
    Li, Zhijun
    Wang, Gang
    Chen, I-Ming
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (06): : 3675 - 3682