UAV Navigation in 3D Urban Environments with Curriculum-based Deep Reinforcement Learning

被引:1
|
作者
de Carvalho, Kevin Braathen [1 ]
de Oliveira, Iure Rosa L. [1 ]
Brandao, Alexandre S. [1 ,2 ]
机构
[1] Univ Fed Vicosa, Grad Program Comp Sci, Nucl Specializat Robot, Dept Elect Engn, BR-36570900 Vicosa, MG, Brazil
[2] Univ Fed Vicosa, Dept Elect Engn, Vicosa, MG, Brazil
关键词
Deep Reinforcement Learning; Curriculum Learning; Urban Environments; 3D Navigation;
D O I
10.1109/ICUAS57906.2023.10156524
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Unmanned Aerial Vehicles (UAVs) are widely used in various applications, from inspection and surveillance to transportation and delivery. Navigating UAVs in complex 3D environments is a challenging task that requires robust and efficient decision-making algorithms. This paper presents a novel approach to UAV navigation in 3D environments using a Curriculum-based Deep Reinforcement Learning (DRL) approach. The proposed method utilizes a deep neural network to model the UAV's decision-making process and to learn a mapping from the state space to the action space. The learning process is guided by a reinforcement signal that reflects the performance of the UAV in terms of reaching its target while avoiding obstacles and with energy efficiency. Simulation results show that the proposed method has a positive trade off when compared to the baseline algorithm. The proposed method was able to perform well in environments with a state space size of 22 millions, allowing the usage in big environments or in maps with high resolution. The results demonstrate the potential of DRL for enabling UAVs to operate effectively in complex environments.
引用
下载
收藏
页码:1249 / 1255
页数:7
相关论文
共 50 条
  • [31] Deep Reinforcement Learning for Interference Management in UAV-Based 3D Networks: Potentials and Challenges
    Vaezi, Mojtaba
    Lin, Xingqin
    Zhang, Hongliang
    Saad, Walid
    Poor, H. Vincent
    IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (02) : 134 - 140
  • [32] Deep Reinforcement Learning for Procedural Content Generation of 3D Virtual Environments
    Lopez, Christian E.
    Cunningham, James
    Ashour, Omar
    Tucker, Conrad S.
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (05)
  • [33] Safe Navigation for UAV-Enabled Data Dissemination by Deep Reinforcement Learning in Unknown Environments
    Fei Huang
    Guangxia Li
    Shiwei Tian
    Jin Chen
    Guangteng Fan
    Jinghui Chang
    China Communications, 2022, 19 (01) : 202 - 217
  • [34] Connectivity-Aware 3D UAV Path Design With Deep Reinforcement Learning
    Xie, Hao
    Yang, Dingcheng
    Xiao, Lin
    Lyu, Jiangbin
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) : 13022 - 13034
  • [35] Safe Navigation for UAV-Enabled Data Dissemination by Deep Reinforcement Learning in Unknown Environments
    Huang, Fei
    Li, Guangxia
    Tian, Shiwei
    Chen, Jin
    Fan, Guangteng
    Chang, Jinghui
    CHINA COMMUNICATIONS, 2022, 19 (01) : 202 - 217
  • [36] 3D UAV Trajectory and Data Collection Optimisation Via Deep Reinforcement Learning
    Nguyen, Khoi Khac
    Duong, Trung Q.
    Tan Do-Duy
    Claussen, Holger
    Hanzo, Lajos
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (04) : 2358 - 2371
  • [37] Deep Reinforcement Learning Based Mobile Robot Navigation in Crowd Environments
    Yang, Guang
    Guo, Yi
    2024 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR 2024, 2024, : 513 - 519
  • [38] Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach
    Bouhamed, Omar
    Ghazzai, Hakim
    Besbes, Hichem
    Massoud, Yehia
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [39] Deep-Reinforcement-Learning-Based Autonomous UAV Navigation With Sparse Rewards
    Wang, Chao
    Wang, Jian
    Wang, Jingjing
    Zhang, Xudong
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6180 - 6190
  • [40] Automatic Drone Navigation in Realistic 3D Landscapes using Deep Reinforcement Learning
    Shin, Sang-Yun
    Kang, Yong -Won
    Kim, Yong-Guk
    2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019, : 1072 - 1077