Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

被引:0
|
作者
Xie, Ronglei [1 ]
Meng, Zhijun [1 ]
Wang, Lifeng [1 ]
Li, Haochen [1 ]
Wang, Kaipeng [1 ]
Wu, Zhe [1 ]
机构
[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
中国国家自然科学基金;
关键词
Path planning; Reinforcement learning; Heuristic algorithms; Unmanned aerial vehicles; Vehicle dynamics; Safety; Recurrent neural networks; Deep reinforcement learning; path planning; recurrent neural network; COLLISION-AVOIDANCE; UAVS; OPTIMIZATION; NETWORKS;
D O I
10.1109/ACCESS.2021.3057485
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this article, we propose a deep reinforcement learning approach for three-dimensional path planning by utilizing the local information and relative distance without global information. UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. Therefore, path planning can be formulated as a Partially Observable Markov Decision Process. The recurrent neural network with temporal memory is constructed to address the partial observability problem by extracting crucial information from historical state-action sequences. We develop an action selection strategy that combines the current reward value and the state-action value to reduce the meaningless exploration. In addition, we construct two sample memory pools and propose an adaptive experience replay mechanism based on the frequency of failure. The simulation experiment results show that our method has significant improvements over Deep Q-Network and Deep Recurrent Q-Network in terms of stability and learning efficiency. Our approach successfully plans a reasonable three-dimensional path in the large-scale and complex environment, and has the perfect ability to avoid obstacles.in the unknown environment.
引用
收藏
页码:24884 / 24900
页数:17
相关论文
共 50 条
  • [21] Path planning of unmanned aerial vehicle based on improved gravitational search algorithm
    LI Pei 1 & DUAN HaiBin 1
    2 State Key Laboratory of Virtual Reality Technology and Systems
    [J]. Science China Technological Sciences, 2012, (10) : 2712 - 2719
  • [22] Unmanned Aerial Vehicle Coverage Path Planning Algorithm based on Cellular Automata
    Song, Zhihua
    Zhang, Han
    Zhang, Xiaojie
    Zhang, Fa
    [J]. 2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 123 - 126
  • [23] A* algorithm based on adaptive expansion convolution for unmanned aerial vehicle path planning
    Xu, Yu
    Li, Yang
    Tai, Yubo
    Lu, Xiaohan
    Jia, Yaodong
    Wang, Yifan
    [J]. INTELLIGENT SERVICE ROBOTICS, 2024, 17 (03) : 521 - 531
  • [24] Path planning of unmanned aerial vehicle based on improved gravitational search algorithm
    Pei Li
    HaiBin Duan
    [J]. Science China Technological Sciences, 2012, 55 : 2712 - 2719
  • [25] Stealth penetration path planning of unmanned aerial vehicle based on dynamic RCS
    Cai, Chao
    Ge, Chao
    Wu, Zhenbo
    Li, Zhen
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (11): : 72 - 78
  • [26] Unmanned aerial vehicle dynamic path planning in an uncertain environment
    Yao, Min
    Zhao, Min
    [J]. ROBOTICA, 2015, 33 (03) : 611 - 621
  • [27] Trajectory planning for unmanned aerial vehicle slungHpayload aerial transportation system based on reinforcement learning
    基于强化学习的无人机吊挂负载系统轨迹规划
    [J]. 1600, Editorial Board of Jilin University (51): : 2259 - 2267
  • [28] A new method for unmanned aerial vehicle path planning in complex environments
    He, Yong
    Hou, Ticheng
    Wang, Mingran
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [29] Heterogeneous mission planning for a single unmanned aerial vehicle (UAV) with attention-based deep reinforcement learning
    Jung, Minjae
    Oh, Hyondong
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [30] Heterogeneous mission planning for a single unmanned aerial vehicle (UAV) with attention-based deep reinforcement learning
    Jung, Minjae
    Oh, Hyondong
    [J]. PeerJ Computer Science, 2022, 8