Autonomous UAV Navigation in Dynamic Environments with Double Deep Q-Networks

被引：8

作者：

Yang, Yupeng ^{[1
]}

Zhang, Kai ^{[1
]}

Liu, Dahai ^{[2
]}

Song, Houbing ^{[1
]}

机构：

[1] Embry Riddle Aeronaut Univ, Dept Elect Engn & Comp Sci, Daytona Beach, FL 32114 USA

[2] Embry Riddle Aeronaut Univ, Coll Aviat, Daytona Beach, FL USA

来源：

2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS | 2020年

关键词：

UAV; autonomous navigation; deep reinforcement learning;

D O I：

10.1109/dasc50938.2020.9256455

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

With the rapidly increasing number and complexity of unmanned aircraft systems (UAS), enabling high-density operations becomes the most important goal for UAS operations in congested airspace. However, it is difficult to capture the global environment information such as geolocation of other unmanned aerial vehicles (UAVs) and the steep terrain in real-time. As a result, avoiding dynamic obstacles rather than static ones is challenging. Previous work demonstrates the feasibility of using traditional Q-learning to solve the navigation problem in a static environment, but this method is problematic when facing a dynamic environment because it usually causes the overestimation of action values. To address this challenge, this paper presents a framework based on double deep Q-network with priority experience replay (DDQN-PER) which allows the UAVs to navigate and avoid obstacles in a dynamic environment. The model is built upon convolutional neural networks (CNNs) whose input is raw pixels of the local known environment and whose output is an action after estimating future rewards. We set up multiple experimental scenarios with static and moving obstacles for different tasks which are ranging from single-agent navigation to multi-agent navigation. Then this model is applied to other pre-defined environments, without adjustment of the architecture or learning algorithm, to validate its generalization. Experimental results demonstrate that our proposed models can allow the UAVs reach the goal successfully in new dynamic environments.

引用

页数：7

共 50 条

[41] Traffic Shaping with Deep Q-Networks for Optimizing the Age of Information
Lent, Ricardo
[J]. 2023 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS, LATINCOM, 2023,
[42] Uncovering instabilities in variational-quantum deep Q-networks
Franz, Maja
Wolf, Lucas
Periyasamy, Maniraman
Ufrecht, Christian
Scherer, Daniel D.
Plinge, Axel
Mutschler, Christopher
Mauerer, Wolfgang
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (17): : 13822 - 13844
[43] Improving traffic light systems using Deep Q-networks
Moreno-Malo, Juan
Posadas-Yague, Juan-Luis
Cano, Juan Carlos
Calafate, Carlos T.
Conejero, J. Alberto
Poza-Lujan, Jose-Luis
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
[44] DinoDroid: Testing Android Apps Using Deep Q-Networks
Zhao, Yu
Harrison, Brent
Yu, Tingting
[J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (05)
[45] Autonomous Navigation of Quadrotors in Dynamic Complex Environments
Li, Ruocheng
Xin, Bin
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024,
[46] Joint Differential Game and Double Deep Q-Networks for Suppressing Malware Spread in Industrial Internet of Things
Shen, Shigen
Xie, Lanlan
Zhang, Yanchun
Wu, Guowen
Zhang, Hong
Yu, Shui
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 5302 - 5315
[47] Reinforcement Learning with an Ensemble of Binary Action Deep Q-Networks
Hafiz, A.M.
Hassaballah, M.
Alqahtani, Abdullah
Alsubai, Shtwai
Hameed, Mohamed Abdel
[J]. Computer Systems Science and Engineering, 2023, 46 (03): : 2651 - 2666
[48] Spatio-Temporal Deep Q-Networks for Human Activity Localization
Xu, Wanru
Yu, Jian
Miao, Zhenjiang
Wan, Lili
Ji, Qiang
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) : 2984 - 2999
[49] Cache-Enabled Dynamic Spectrum Access via Deep Recurrent Q-Networks with Partial Observation
Xu, Y.
Yu, J.
Buehrer, R. M.
[J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2019, : 147 - 148
[50] Multi-level deep Q-networks for Bitcoin trading strategies
Otabek, Sattarov
Choi, Jaeyoung
[J]. SCIENTIFIC REPORTS, 2024, 14 (01)

← 1 2 3 4 5 →