Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

被引：0

作者：

Xie, Ronglei ^{[1
]}

Meng, Zhijun ^{[1
]}

Wang, Lifeng ^{[1
]}

Li, Haochen ^{[1
]}

Wang, Kaipeng ^{[1
]}

Wu, Zhe ^{[1
]}

机构：

[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金;

关键词：

Path planning; Reinforcement learning; Heuristic algorithms; Unmanned aerial vehicles; Vehicle dynamics; Safety; Recurrent neural networks; Deep reinforcement learning; path planning; recurrent neural network; COLLISION-AVOIDANCE; UAVS; OPTIMIZATION; NETWORKS;

D O I：

10.1109/ACCESS.2021.3057485

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this article, we propose a deep reinforcement learning approach for three-dimensional path planning by utilizing the local information and relative distance without global information. UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. Therefore, path planning can be formulated as a Partially Observable Markov Decision Process. The recurrent neural network with temporal memory is constructed to address the partial observability problem by extracting crucial information from historical state-action sequences. We develop an action selection strategy that combines the current reward value and the state-action value to reduce the meaningless exploration. In addition, we construct two sample memory pools and propose an adaptive experience replay mechanism based on the frequency of failure. The simulation experiment results show that our method has significant improvements over Deep Q-Network and Deep Recurrent Q-Network in terms of stability and learning efficiency. Our approach successfully plans a reasonable three-dimensional path in the large-scale and complex environment, and has the perfect ability to avoid obstacles.in the unknown environment.

引用

页码：24884 / 24900

页数：17

共 50 条

[1] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
Xie, Ronglei
Meng, Zhijun
Wang, Lifeng
Li, Haochen
Wang, Kaipeng
Wu, Zhe
[J]. IEEE Access, 2021, 9 : 24884 - 24900
[2] A LARGE-SCALE PATH PLANNING ALGORITHM FOR UNDERWATER ROBOTS BASED ON DEEP REINFORCEMENT LEARNING
Wang, Wenhui
Li, Leqing
Ye, Fumeng
Peng, Yumin
Ma, Yiming
[J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2024, 39 (03): : 204 - 210
[3] An Improved Artificial Potential Field based Path Planning Algorithm for Unmanned Aerial Vehicle in Dynamic Environments
Chen, Shoufeng
Yang, Zhihua
Liu, Zhentao
Jin, Haojie
[J]. 2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 591 - 596
[4] Variation Encoded Large-Scale Swarm Optimizers for Path Planning of Unmanned Aerial Vehicle
Xiao, Tan-Lin
Yang, Qiang
Gao, Xu-Dong
Lu, Zhen-Yu
Ma, Yuan-Yuan
Jeon, Sang-Woon
Zhang, Jun
[J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 102 - 110
[5] Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning
Zhang, Kai
Wang, Guile
Hu, Jinwen
Xu, Zhao
Guo, Chubing
[J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8392 - 8397
[6] Unmanned aerial vehicle path planning based on TLBO algorithm
[J]. Yu, Guolin (guolin_yu@126.com), 1600, Massey University (07):
[7] UNMANNED AERIAL VEHICLE PATH PLANNING BASED ON TLBO ALGORITHM
Yu, Guolin
Song, Hui
Gao, Jie
[J]. INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (03) : 1310 - 1325
[8] An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning
Xing, Bowen
Wang, Xiao
Yang, Liu
Liu, Zhenchong
Wu, Qingyun
[J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
[9] Autonomous Path Planning by Unmanned Aerial Vehicle (UAV) for Precise Monitoring of Large-Scale PV plants
Sizkouhi, Amir Mohammad Moradi
Esmailifar, Sayyed Majid
Aghaei, Mohammadreza
Vidal de Oliveira, Aline Kirsten
Ruther, Ricardo
[J]. 2019 IEEE 46TH PHOTOVOLTAIC SPECIALISTS CONFERENCE (PVSC), 2019, : 1398 - 1402
[10] LARGE-SCALE PATH PLANNING IN COMPLEX ENVIRONMENTS BASED ON GENETIC ALGORITHM
Hu, Chuanhui
Jin, Yan
[J]. PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3B, 2023,

← 1 2 3 4 5 →