UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach

被引：44

作者：

Bayerlein, Harald ^{[1
]}

Theile, Mirco ^{[2
]}

Caccamo, Marco ^{[2
]}

Gesbert, David ^{[1
]}

机构：

[1] EURECOM, Commun Syst Dept, Sophia Antipolis, France

[2] Tech Univ Munich, TUM Dept Mech Engn, Munich, Germany

来源：

2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2020年

基金：

欧洲研究理事会;

关键词：

D O I：

10.1109/GLOBECOM42002.2020.9322234

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous deployment of unmanned aerial vehicles (UAVs) supporting next-generation communication networks requires efficient trajectory planning methods. We propose a new end-to-end reinforcement learning (RI) approach to UAV-enabled data collection from Internet of Things (IoT) devices in an urban environment. An autonomous drone is tasked with gathering data from distributed sensor nodes subject to limited flying time and obstacle avoidance. While previous approaches, learning and non-learning based, must perform expensive recomputations or relearn a behavior when important scenario parameters such as the number of sensors, sensor positions, or maximum flying time, change, we train a double deep Q-network (DDQN) with combined experience replay to learn a UAV control policy that generalizes over changing scenario parameters. By exploiting a multi-layer map of the environment fed through convolutional network layers to the agent, we show that our proposed network architecture enables the agent to make movement decisions for a variety of scenario parameters that balance the data collection goal with flight time efficiency and safety constraints. Considerable advantages in learning efficiency from using a map centered on the UAV's position over a non-centered map are also illustrated.

引用

页数：6

共 50 条

[1] Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning
Bayerlein, Harald
Theile, Mirco
Caccamo, Marco
Gesbert, David
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 1171 - 1187
[2] Explainable Deep Reinforcement Learning for UAV autonomous path planning
He, Lei
Aouf, Nabil
Song, Bifeng
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
[3] A UAV Path Planning Method Based on Deep Reinforcement Learning
Li, Yibing
Zhang, Sitong
Ye, Fang
Jiang, Tao
Li, Yingsong
2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
[4] Deep Reinforcement Learning Approach for UAV Search Path Planning In Discrete Time and Space
Benalaya, Najoua
Amdouni, Ichrak
Adjih, Cedric
Laouiti, Anis
Saidane, Leila Azouz
20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 1437 - 1442
[5] UAV online path planning technology based on deep reinforcement learning
Fan, Jiaxuan
Wang, Zhenya
Ren, Jinlei
Lu, Ying
Liu, Yiheng
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5382 - 5386
[6] A Deep Reinforcement Learning Approach for UAV Path Planning Incorporating Vehicle Dynamics with Acceleration Control
Sabzekar, Sina
Samadzad, Mahdi
Mehditabrizi, Asal
Tak, Ala Nekouvaght
UNMANNED SYSTEMS, 2024, 12 (03) : 477 - 498
[7] Multi-UAV trajectory optimizer: A sustainable system for wireless data harvesting with deep reinforcement learning
Seong, Mincheol
Jo, Ohyun
Shin, Kyungseop
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[8] A Deep Reinforcement Learning Approach to Energy-harvesting UAV-aided Data Collection
Zhang, Ning
Liu, Juan
Xie, Lingfu
Tong, Peng
2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 93 - 98
[9] Deep Reinforcement Learning-Driven UAV Data Collection Path Planning: A Study on Minimizing AoI
Huang, Hesong
Li, Yang
Song, Ge
Gai, Wendong
ELECTRONICS, 2024, 13 (10)
[10] Research on Path Planning of Agricultural UAV Based on Improved Deep Reinforcement Learning
Fu, Haitao
Li, Zheng
Zhang, Weijian
Feng, Yuxuan
Zhu, Li
Fang, Xu
Li, Jian
AGRONOMY-BASEL, 2024, 14 (11):

← 1 2 3 4 5 →