A State-Decomposition DDPG Algorithm for UAV Autonomous Navigation in 3-D Complex Environments

被引:0
|
作者
Zhang, Lijuan [1 ]
Peng, Jiabin [1 ]
Yi, Weiguo [1 ]
Lin, Hang [1 ]
Lei, Lei [1 ]
Song, Xiaoqin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Coll Integrated Circuits, Nanjing 211106, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous aerial vehicles; Navigation; Autonomous robots; Three-dimensional displays; Training; Heuristic algorithms; Internet of Things; Autonomous navigation; decision making; deep reinforcement learning (DRL); path planning; unmanned aerial vehicle (UAV) autonomy;
D O I
10.1109/JIOT.2023.3327753
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the past decade, unmanned aerial vehicles (UAVs) have been widely applied in many areas, such as goods delivery, disaster monitoring, search and rescue etc. In most of these applications, autonomous navigation is one of the key techniques that enable UAV to perform various tasks. However, UAV autonomous navigation in complex environments presents significant challenges due to the difficulty in simultaneously observing, orientation, decision and action. In this work, an efficient state-decomposition deep deterministic policy gradient algorithm is proposed for UAV autonomous navigation (SDDPG-NAV) in 3-D complex environments. In SDDPG-NAV, a novel state-decomposition method that uses two subnetworks for the perception-related and target-related states separately is developed to establish more appropriate actor networks. We also designed some objective-oriented reward functions to solve the sparse reward problem, including approaching the target, and avoiding obstacles and step award functions. Moreover, some training strategies are introduced to maintain the balance between exploration and exploitation, and the network is well trained with numerous experiments. The proposed SDDPG-NAV algorithm is capable of adapting to surrounding environments with generalized training experiences and effectively improves UAV's navigation performance in 3-D complex environments. Comparing with the benchmark DDPG and TD3 algorithms, SDDPG-NAV exhibits better performance in terms of convergence rate, navigation performance, and generalization capability.
引用
收藏
页码:10778 / 10790
页数:13
相关论文
共 50 条
  • [21] Evolutionary Multi-Objective Deep Reinforcement Learning for Autonomous UAV Navigation in Large-Scale Complex Environments
    An, Guangyan
    Wu, Ziyu
    Shen, Zhilong
    Shang, Ke
    Ishibuchi, Hisao
    [J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 633 - 641
  • [22] A Hybrid Planning Method for 3D Autonomous Exploration in Unknown Environments With a UAV
    Chen, Xuning
    Zheng, Jianying
    Hu, Qinglei
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, : 1 - 12
  • [23] Efficient Autonomous Exploration Planning of Large-Scale 3-D Environments
    Selin, Magnus
    Tiger, Maths
    Duberg, Daniel
    Heintz, Fredrik
    Jensfelt, Patric
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) : 1699 - 1706
  • [24] Efficient Autonomous Exploration With Incrementally Built Topological Map in 3-D Environments
    Wang, Chaoqun
    Ma, Han
    Chen, Weinan
    Liu, Li
    Meng, Max Q. -H.
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (12) : 9853 - 9865
  • [25] Autonomous 3-D Reconstruction, Mapping, and Exploration of Indoor Environments With a Robotic Arm
    Wang, Yiming
    James, Stuart
    Stathopoulou, Elisavet Konstantina
    Beltran-Gonzalez, Carlos
    Konishi, Yoshinori
    Del Bue, Alessi
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 3340 - 3347
  • [26] Development of a micro autonomous underwater vehicle for complex 3-D sensing
    Hobson, B
    Schulz, B
    Janét, J
    Kemp, M
    Moody, R
    Pell, C
    Pinnix, H
    [J]. OCEANS 2001 MTS/IEEE: AN OCEAN ODYSSEY, VOLS 1-4, CONFERENCE PROCEEDINGS, 2001, : 2043 - 2045
  • [27] A Multiobjective Optimization Algorithm for Safety and Optimality of 3-D Route Planning in UAV
    Abdel-Basset, Mohamed
    Mohamed, Reda
    Sallam, Karam M.
    Hezam, Ibrahim M.
    Munasinghe, Kumudu
    Jamalipour, Abbas
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (03) : 3067 - 3080
  • [28] SkyOrbs: A Fast 3-D Directional Neighbor Discovery Algorithm for UAV Networks
    Zhu, Yuchen
    Liu, Min
    Chen, Yali
    Sun, Sheng
    Li, Zhongcheng
    [J]. IEEE Transactions on Mobile Computing, 2024, 23 (12) : 14768 - 14786
  • [29] Improved 3-D real-time trajectory planning algorithm for UAV
    He, Pingchuan
    Dai, Shuling
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2010, 36 (10): : 1248 - 1251
  • [30] MMPA: A modified marine predator algorithm for 3D UAV path planning in complex environments with multiple threats
    Lyu, Lixin
    Yang, Fan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 257