Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

被引:0
|
作者
Xie, Ronglei [1 ]
Meng, Zhijun [1 ]
Wang, Lifeng [1 ]
Li, Haochen [1 ]
Wang, Kaipeng [1 ]
Wu, Zhe [1 ]
机构
[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
中国国家自然科学基金;
关键词
Path planning; Reinforcement learning; Heuristic algorithms; Unmanned aerial vehicles; Vehicle dynamics; Safety; Recurrent neural networks; Deep reinforcement learning; path planning; recurrent neural network; COLLISION-AVOIDANCE; UAVS; OPTIMIZATION; NETWORKS;
D O I
10.1109/ACCESS.2021.3057485
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this article, we propose a deep reinforcement learning approach for three-dimensional path planning by utilizing the local information and relative distance without global information. UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. Therefore, path planning can be formulated as a Partially Observable Markov Decision Process. The recurrent neural network with temporal memory is constructed to address the partial observability problem by extracting crucial information from historical state-action sequences. We develop an action selection strategy that combines the current reward value and the state-action value to reduce the meaningless exploration. In addition, we construct two sample memory pools and propose an adaptive experience replay mechanism based on the frequency of failure. The simulation experiment results show that our method has significant improvements over Deep Q-Network and Deep Recurrent Q-Network in terms of stability and learning efficiency. Our approach successfully plans a reasonable three-dimensional path in the large-scale and complex environment, and has the perfect ability to avoid obstacles.in the unknown environment.
引用
收藏
页码:24884 / 24900
页数:17
相关论文
共 50 条
  • [31] A novel reinforcement learning based grey wolf optimizer algorithm for unmanned aerial vehicles (UAVs) path planning
    Qu, Chengzhi
    Gai, Wendong
    Zhong, Maiying
    Zhang, Jing
    [J]. APPLIED SOFT COMPUTING, 2020, 89
  • [32] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
    Zheng, Kaiyuan
    Gao, Jingpeng
    Shen, Liangxi
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
  • [33] Unmanned Aerial Vehicle Path Planning Based on Improved Intelligent Water Drop Algorithm
    Sun, Xixia
    Pan, Su
    Cai, Chao
    Chen, Yanfang
    Chen, Jie
    [J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 867 - 872
  • [34] An Improved Path Planning Algorithm for Unmanned Aerial Vehicle Based on RRT-Connect
    Zhang, Denggui
    Xu, Yong
    Yao, Xingting
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 4854 - 4858
  • [35] A path planning method for unmanned aerial vehicle based on improved wolf pack algorithm
    Jiang, Hao
    Yu, Qizhou
    Han, Dan
    Chen, Yaqing
    Li, Zejun
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (14):
  • [36] Hybrid FWPS cooperation algorithm based unmanned aerial vehicle constrained path planning
    Zhang, Xiangyin
    Xia, Shuang
    Zhang, Tian
    Li, Xiuzhi
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
  • [37] Path planning based on unmanned aerial vehicle performance with segmented cellular genetic algorithm
    Gezer, Ahmet
    Turan, Önder
    Baklacıoğlu, Tolga
    [J]. Journal of the Faculty of Engineering and Architecture of Gazi University, 2024, 40 (01): : 135 - 153
  • [38] Dynamic path planning based on improved boundary value problem for unmanned aerial vehicle
    Liang, Xiao
    Meng, Guanglei
    Luo, Haitao
    Chen, Xia
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 2087 - 2096
  • [39] Trajectory Planning of Unmanned Aerial Vehicle Based On A* Algorithm
    Xu, Hao
    Xu, Xiangrong
    Li, Yan
    Zhu, Xiaosheng
    Jia, Liming
    Shi, Dongqing
    [J]. 2014 IEEE 4TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2014, : 463 - 468
  • [40] Neighborhood global learning based flower pollination algorithm and its application to unmanned aerial vehicle path planning
    Chen, Yang
    Pi, Dechang
    Xu, Yue
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 170