UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach

被引:41
|
作者
Bayerlein, Harald [1 ]
Theile, Mirco [2 ]
Caccamo, Marco [2 ]
Gesbert, David [1 ]
机构
[1] EURECOM, Commun Syst Dept, Sophia Antipolis, France
[2] Tech Univ Munich, TUM Dept Mech Engn, Munich, Germany
基金
欧洲研究理事会;
关键词
D O I
10.1109/GLOBECOM42002.2020.9322234
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous deployment of unmanned aerial vehicles (UAVs) supporting next-generation communication networks requires efficient trajectory planning methods. We propose a new end-to-end reinforcement learning (RI) approach to UAV-enabled data collection from Internet of Things (IoT) devices in an urban environment. An autonomous drone is tasked with gathering data from distributed sensor nodes subject to limited flying time and obstacle avoidance. While previous approaches, learning and non-learning based, must perform expensive recomputations or relearn a behavior when important scenario parameters such as the number of sensors, sensor positions, or maximum flying time, change, we train a double deep Q-network (DDQN) with combined experience replay to learn a UAV control policy that generalizes over changing scenario parameters. By exploiting a multi-layer map of the environment fed through convolutional network layers to the agent, we show that our proposed network architecture enables the agent to make movement decisions for a variety of scenario parameters that balance the data collection goal with flight time efficiency and safety constraints. Considerable advantages in learning efficiency from using a map centered on the UAV's position over a non-centered map are also illustrated.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning
    Bayerlein, Harald
    Theile, Mirco
    Caccamo, Marco
    Gesbert, David
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 1171 - 1187
  • [2] Explainable Deep Reinforcement Learning for UAV autonomous path planning
    He, Lei
    Aouf, Nabil
    Song, Bifeng
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
  • [3] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    [J]. 2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [4] UAV online path planning technology based on deep reinforcement learning
    Fan, Jiaxuan
    Wang, Zhenya
    Ren, Jinlei
    Lu, Ying
    Liu, Yiheng
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5382 - 5386
  • [5] A Deep Reinforcement Learning Approach for UAV Path Planning Incorporating Vehicle Dynamics with Acceleration Control
    Sabzekar, Sina
    Samadzad, Mahdi
    Mehditabrizi, Asal
    Tak, Ala Nekouvaght
    [J]. UNMANNED SYSTEMS, 2024, 12 (03) : 477 - 498
  • [6] Multi-UAV trajectory optimizer: A sustainable system for wireless data harvesting with deep reinforcement learning
    Seong, Mincheol
    Jo, Ohyun
    Shin, Kyungseop
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [7] A Deep Reinforcement Learning Approach to Energy-harvesting UAV-aided Data Collection
    Zhang, Ning
    Liu, Juan
    Xie, Lingfu
    Tong, Peng
    [J]. 2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 93 - 98
  • [8] Deep Reinforcement Learning-Driven UAV Data Collection Path Planning: A Study on Minimizing AoI
    Huang, Hesong
    Li, Yang
    Song, Ge
    Gai, Wendong
    [J]. ELECTRONICS, 2024, 13 (10)
  • [9] Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning
    Li, Bohao
    Wu, Yunjie
    [J]. IEEE ACCESS, 2020, 8 : 29064 - 29074
  • [10] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
    Westheider, Jonas
    Rueckin, Julius
    Popovic, Marija
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656