Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

被引：0

作者：

Xie, Ronglei ^{[1
]}

Meng, Zhijun ^{[1
]}

Wang, Lifeng ^{[1
]}

Li, Haochen ^{[1
]}

Wang, Kaipeng ^{[1
]}

Wu, Zhe ^{[1
]}

机构：

[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金;

关键词：

Path planning; Reinforcement learning; Heuristic algorithms; Unmanned aerial vehicles; Vehicle dynamics; Safety; Recurrent neural networks; Deep reinforcement learning; path planning; recurrent neural network; COLLISION-AVOIDANCE; UAVS; OPTIMIZATION; NETWORKS;

D O I：

10.1109/ACCESS.2021.3057485

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this article, we propose a deep reinforcement learning approach for three-dimensional path planning by utilizing the local information and relative distance without global information. UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. Therefore, path planning can be formulated as a Partially Observable Markov Decision Process. The recurrent neural network with temporal memory is constructed to address the partial observability problem by extracting crucial information from historical state-action sequences. We develop an action selection strategy that combines the current reward value and the state-action value to reduce the meaningless exploration. In addition, we construct two sample memory pools and propose an adaptive experience replay mechanism based on the frequency of failure. The simulation experiment results show that our method has significant improvements over Deep Q-Network and Deep Recurrent Q-Network in terms of stability and learning efficiency. Our approach successfully plans a reasonable three-dimensional path in the large-scale and complex environment, and has the perfect ability to avoid obstacles.in the unknown environment.

引用

页码：24884 / 24900

页数：17

共 50 条

[31] A novel reinforcement learning based grey wolf optimizer algorithm for unmanned aerial vehicles (UAVs) path planning
Qu, Chengzhi
Gai, Wendong
Zhong, Maiying
Zhang, Jing
[J]. APPLIED SOFT COMPUTING, 2020, 89
[32] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
Zheng, Kaiyuan
Gao, Jingpeng
Shen, Liangxi
[J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
[33] Unmanned Aerial Vehicle Path Planning Based on Improved Intelligent Water Drop Algorithm
Sun, Xixia
Pan, Su
Cai, Chao
Chen, Yanfang
Chen, Jie
[J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 867 - 872
[34] An Improved Path Planning Algorithm for Unmanned Aerial Vehicle Based on RRT-Connect
Zhang, Denggui
Xu, Yong
Yao, Xingting
[J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 4854 - 4858
[35] A path planning method for unmanned aerial vehicle based on improved wolf pack algorithm
Jiang, Hao
Yu, Qizhou
Han, Dan
Chen, Yaqing
Li, Zejun
[J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (14):
[36] Hybrid FWPS cooperation algorithm based unmanned aerial vehicle constrained path planning
Zhang, Xiangyin
Xia, Shuang
Zhang, Tian
Li, Xiuzhi
[J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
[37] Path planning based on unmanned aerial vehicle performance with segmented cellular genetic algorithm
Gezer, Ahmet
Turan, Önder
Baklacıoğlu, Tolga
[J]. Journal of the Faculty of Engineering and Architecture of Gazi University, 2024, 40 (01): : 135 - 153
[38] Dynamic path planning based on improved boundary value problem for unmanned aerial vehicle
Liang, Xiao
Meng, Guanglei
Luo, Haitao
Chen, Xia
[J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 2087 - 2096
[39] Trajectory Planning of Unmanned Aerial Vehicle Based On A* Algorithm
Xu, Hao
Xu, Xiangrong
Li, Yan
Zhu, Xiaosheng
Jia, Liming
Shi, Dongqing
[J]. 2014 IEEE 4TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2014, : 463 - 468
[40] Neighborhood global learning based flower pollination algorithm and its application to unmanned aerial vehicle path planning
Chen, Yang
Pi, Dechang
Xu, Yue
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 170

← 1 2 3 4 5 →