Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

被引:0
|
作者
Xie, Ronglei [1 ]
Meng, Zhijun [1 ]
Wang, Lifeng [1 ]
Li, Haochen [1 ]
Wang, Kaipeng [1 ]
Wu, Zhe [1 ]
机构
[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
中国国家自然科学基金;
关键词
Path planning; Reinforcement learning; Heuristic algorithms; Unmanned aerial vehicles; Vehicle dynamics; Safety; Recurrent neural networks; Deep reinforcement learning; path planning; recurrent neural network; COLLISION-AVOIDANCE; UAVS; OPTIMIZATION; NETWORKS;
D O I
10.1109/ACCESS.2021.3057485
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this article, we propose a deep reinforcement learning approach for three-dimensional path planning by utilizing the local information and relative distance without global information. UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. Therefore, path planning can be formulated as a Partially Observable Markov Decision Process. The recurrent neural network with temporal memory is constructed to address the partial observability problem by extracting crucial information from historical state-action sequences. We develop an action selection strategy that combines the current reward value and the state-action value to reduce the meaningless exploration. In addition, we construct two sample memory pools and propose an adaptive experience replay mechanism based on the frequency of failure. The simulation experiment results show that our method has significant improvements over Deep Q-Network and Deep Recurrent Q-Network in terms of stability and learning efficiency. Our approach successfully plans a reasonable three-dimensional path in the large-scale and complex environment, and has the perfect ability to avoid obstacles.in the unknown environment.
引用
收藏
页码:24884 / 24900
页数:17
相关论文
共 50 条
  • [1] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
    Xie, Ronglei
    Meng, Zhijun
    Wang, Lifeng
    Li, Haochen
    Wang, Kaipeng
    Wu, Zhe
    [J]. IEEE Access, 2021, 9 : 24884 - 24900
  • [2] A LARGE-SCALE PATH PLANNING ALGORITHM FOR UNDERWATER ROBOTS BASED ON DEEP REINFORCEMENT LEARNING
    Wang, Wenhui
    Li, Leqing
    Ye, Fumeng
    Peng, Yumin
    Ma, Yiming
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2024, 39 (03): : 204 - 210
  • [3] An Improved Artificial Potential Field based Path Planning Algorithm for Unmanned Aerial Vehicle in Dynamic Environments
    Chen, Shoufeng
    Yang, Zhihua
    Liu, Zhentao
    Jin, Haojie
    [J]. 2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 591 - 596
  • [4] Variation Encoded Large-Scale Swarm Optimizers for Path Planning of Unmanned Aerial Vehicle
    Xiao, Tan-Lin
    Yang, Qiang
    Gao, Xu-Dong
    Lu, Zhen-Yu
    Ma, Yuan-Yuan
    Jeon, Sang-Woon
    Zhang, Jun
    [J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 102 - 110
  • [5] Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning
    Zhang, Kai
    Wang, Guile
    Hu, Jinwen
    Xu, Zhao
    Guo, Chubing
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8392 - 8397
  • [6] Unmanned aerial vehicle path planning based on TLBO algorithm
    [J]. Yu, Guolin (guolin_yu@126.com), 1600, Massey University (07):
  • [7] UNMANNED AERIAL VEHICLE PATH PLANNING BASED ON TLBO ALGORITHM
    Yu, Guolin
    Song, Hui
    Gao, Jie
    [J]. INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (03) : 1310 - 1325
  • [8] An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning
    Xing, Bowen
    Wang, Xiao
    Yang, Liu
    Liu, Zhenchong
    Wu, Qingyun
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
  • [9] Autonomous Path Planning by Unmanned Aerial Vehicle (UAV) for Precise Monitoring of Large-Scale PV plants
    Sizkouhi, Amir Mohammad Moradi
    Esmailifar, Sayyed Majid
    Aghaei, Mohammadreza
    Vidal de Oliveira, Aline Kirsten
    Ruther, Ricardo
    [J]. 2019 IEEE 46TH PHOTOVOLTAIC SPECIALISTS CONFERENCE (PVSC), 2019, : 1398 - 1402
  • [10] LARGE-SCALE PATH PLANNING IN COMPLEX ENVIRONMENTS BASED ON GENETIC ALGORITHM
    Hu, Chuanhui
    Jin, Yan
    [J]. PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3B, 2023,