Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

被引：0

作者：

Xie, Ronglei ^{[1
]}

Meng, Zhijun ^{[1
]}

Wang, Lifeng ^{[1
]}

Li, Haochen ^{[1
]}

Wang, Kaipeng ^{[1
]}

Wu, Zhe ^{[1
]}

机构：

[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金;

关键词：

Path planning; Reinforcement learning; Heuristic algorithms; Unmanned aerial vehicles; Vehicle dynamics; Safety; Recurrent neural networks; Deep reinforcement learning; path planning; recurrent neural network; COLLISION-AVOIDANCE; UAVS; OPTIMIZATION; NETWORKS;

D O I：

10.1109/ACCESS.2021.3057485

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this article, we propose a deep reinforcement learning approach for three-dimensional path planning by utilizing the local information and relative distance without global information. UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. Therefore, path planning can be formulated as a Partially Observable Markov Decision Process. The recurrent neural network with temporal memory is constructed to address the partial observability problem by extracting crucial information from historical state-action sequences. We develop an action selection strategy that combines the current reward value and the state-action value to reduce the meaningless exploration. In addition, we construct two sample memory pools and propose an adaptive experience replay mechanism based on the frequency of failure. The simulation experiment results show that our method has significant improvements over Deep Q-Network and Deep Recurrent Q-Network in terms of stability and learning efficiency. Our approach successfully plans a reasonable three-dimensional path in the large-scale and complex environment, and has the perfect ability to avoid obstacles.in the unknown environment.

引用

页码：24884 / 24900

页数：17

共 50 条

[21] Path planning of unmanned aerial vehicle based on improved gravitational search algorithm
LI Pei 1 & DUAN HaiBin 1
2 State Key Laboratory of Virtual Reality Technology and Systems
[J]. Science China Technological Sciences, 2012, (10) : 2712 - 2719
[22] Unmanned Aerial Vehicle Coverage Path Planning Algorithm based on Cellular Automata
Song, Zhihua
Zhang, Han
Zhang, Xiaojie
Zhang, Fa
[J]. 2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 123 - 126
[23] A* algorithm based on adaptive expansion convolution for unmanned aerial vehicle path planning
Xu, Yu
Li, Yang
Tai, Yubo
Lu, Xiaohan
Jia, Yaodong
Wang, Yifan
[J]. INTELLIGENT SERVICE ROBOTICS, 2024, 17 (03) : 521 - 531
[24] Path planning of unmanned aerial vehicle based on improved gravitational search algorithm
Pei Li
HaiBin Duan
[J]. Science China Technological Sciences, 2012, 55 : 2712 - 2719
[25] Stealth penetration path planning of unmanned aerial vehicle based on dynamic RCS
Cai, Chao
Ge, Chao
Wu, Zhenbo
Li, Zhen
[J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (11): : 72 - 78
[26] Unmanned aerial vehicle dynamic path planning in an uncertain environment
Yao, Min
Zhao, Min
[J]. ROBOTICA, 2015, 33 (03) : 611 - 621
[27] Trajectory planning for unmanned aerial vehicle slungHpayload aerial transportation system based on reinforcement learning
基于强化学习的无人机吊挂负载系统轨迹规划
[J]. 1600, Editorial Board of Jilin University (51): : 2259 - 2267
[28] A new method for unmanned aerial vehicle path planning in complex environments
He, Yong
Hou, Ticheng
Wang, Mingran
[J]. SCIENTIFIC REPORTS, 2024, 14 (01):
[29] Heterogeneous mission planning for a single unmanned aerial vehicle (UAV) with attention-based deep reinforcement learning
Jung, Minjae
Oh, Hyondong
[J]. PEERJ COMPUTER SCIENCE, 2022, 8
[30] Heterogeneous mission planning for a single unmanned aerial vehicle (UAV) with attention-based deep reinforcement learning
Jung, Minjae
Oh, Hyondong
[J]. PeerJ Computer Science, 2022, 8

← 1 2 3 4 5 →