Autonomous UAV Navigation in Dynamic Environments with Double Deep Q-Networks

被引:8
|
作者
Yang, Yupeng [1 ]
Zhang, Kai [1 ]
Liu, Dahai [2 ]
Song, Houbing [1 ]
机构
[1] Embry Riddle Aeronaut Univ, Dept Elect Engn & Comp Sci, Daytona Beach, FL 32114 USA
[2] Embry Riddle Aeronaut Univ, Coll Aviat, Daytona Beach, FL USA
关键词
UAV; autonomous navigation; deep reinforcement learning;
D O I
10.1109/dasc50938.2020.9256455
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
With the rapidly increasing number and complexity of unmanned aircraft systems (UAS), enabling high-density operations becomes the most important goal for UAS operations in congested airspace. However, it is difficult to capture the global environment information such as geolocation of other unmanned aerial vehicles (UAVs) and the steep terrain in real-time. As a result, avoiding dynamic obstacles rather than static ones is challenging. Previous work demonstrates the feasibility of using traditional Q-learning to solve the navigation problem in a static environment, but this method is problematic when facing a dynamic environment because it usually causes the overestimation of action values. To address this challenge, this paper presents a framework based on double deep Q-network with priority experience replay (DDQN-PER) which allows the UAVs to navigate and avoid obstacles in a dynamic environment. The model is built upon convolutional neural networks (CNNs) whose input is raw pixels of the local known environment and whose output is an action after estimating future rewards. We set up multiple experimental scenarios with static and moving obstacles for different tasks which are ranging from single-agent navigation to multi-agent navigation. Then this model is applied to other pre-defined environments, without adjustment of the architecture or learning algorithm, to validate its generalization. Experimental results demonstrate that our proposed models can allow the UAVs reach the goal successfully in new dynamic environments.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Traffic Shaping with Deep Q-Networks for Optimizing the Age of Information
    Lent, Ricardo
    [J]. 2023 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS, LATINCOM, 2023,
  • [42] Uncovering instabilities in variational-quantum deep Q-networks
    Franz, Maja
    Wolf, Lucas
    Periyasamy, Maniraman
    Ufrecht, Christian
    Scherer, Daniel D.
    Plinge, Axel
    Mutschler, Christopher
    Mauerer, Wolfgang
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (17): : 13822 - 13844
  • [43] Improving traffic light systems using Deep Q-networks
    Moreno-Malo, Juan
    Posadas-Yague, Juan-Luis
    Cano, Juan Carlos
    Calafate, Carlos T.
    Conejero, J. Alberto
    Poza-Lujan, Jose-Luis
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [44] DinoDroid: Testing Android Apps Using Deep Q-Networks
    Zhao, Yu
    Harrison, Brent
    Yu, Tingting
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (05)
  • [45] Autonomous Navigation of Quadrotors in Dynamic Complex Environments
    Li, Ruocheng
    Xin, Bin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024,
  • [46] Joint Differential Game and Double Deep Q-Networks for Suppressing Malware Spread in Industrial Internet of Things
    Shen, Shigen
    Xie, Lanlan
    Zhang, Yanchun
    Wu, Guowen
    Zhang, Hong
    Yu, Shui
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 5302 - 5315
  • [47] Reinforcement Learning with an Ensemble of Binary Action Deep Q-Networks
    Hafiz, A.M.
    Hassaballah, M.
    Alqahtani, Abdullah
    Alsubai, Shtwai
    Hameed, Mohamed Abdel
    [J]. Computer Systems Science and Engineering, 2023, 46 (03): : 2651 - 2666
  • [48] Spatio-Temporal Deep Q-Networks for Human Activity Localization
    Xu, Wanru
    Yu, Jian
    Miao, Zhenjiang
    Wan, Lili
    Ji, Qiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) : 2984 - 2999
  • [49] Cache-Enabled Dynamic Spectrum Access via Deep Recurrent Q-Networks with Partial Observation
    Xu, Y.
    Yu, J.
    Buehrer, R. M.
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2019, : 147 - 148
  • [50] Multi-level deep Q-networks for Bitcoin trading strategies
    Otabek, Sattarov
    Choi, Jaeyoung
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)