Multi-UAV Dynamic Wireless Networking With Deep Reinforcement Learning

被引:43
|
作者
Wang, Qiang [1 ]
Zhang, Wenqi [1 ]
Liu, Yuanwei [2 ]
Liu, Ying [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] Queen Mary Univ London, London E1 4NS, England
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Drones; Reinforcement learning; Real-time systems; Wireless networks; Downlink; Capacity; deep reinforcement learning; movement; unmanned aerial vehicles;
D O I
10.1109/LCOMM.2019.2940191
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This letter investigates a novel unmanned aerial vehicle (UAV)-enabled wireless communication system, where multiple UAVs transmit information to multiple ground terminals (GTs). We study how the UAVs can optimally employ their mobility to maximize the real-time downlink capacity while covering all GTs. The system capacity is characterized, by optimizing the UAV locations subject to the coverage constraint. We formula the UAV movement problem as a Constrained Markov Decision Process (CMDP) problem and employ Q-learning to solve the UAV movement problem. Since the state of the UAV movement problem has large dimensions, we propose Dueling Deep Q-network (DDQN) algorithm which introduces neural networks and dueling structure into Q-learning. Simulation results demonstrate the proposed movement algorithm is able to track the movement of GTs and obtains real-time optimal capacity, subject to coverage constraint.
引用
收藏
页码:2243 / 2246
页数:4
相关论文
共 50 条
  • [1] Dynamic deployment of multi-UAV base stations with deep reinforcement learning
    Wu, Guanhan
    Jia, Weimin
    Zhao, Jianwei
    [J]. ELECTRONICS LETTERS, 2021, 57 (15) : 600 - 602
  • [2] Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning
    Bayerlein, Harald
    Theile, Mirco
    Caccamo, Marco
    Gesbert, David
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 1171 - 1187
  • [3] Dynamic Attention Network for Multi-UAV Reinforcement Learning
    Xu, Dongsheng
    Wu, Shang
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
  • [4] Multi-UAV trajectory optimizer: A sustainable system for wireless data harvesting with deep reinforcement learning
    Seong, Mincheol
    Jo, Ohyun
    Shin, Kyungseop
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [5] Three-Dimension Trajectory Design for Multi-UAV Wireless Network With Deep Reinforcement Learning
    Zhang, Wenqi
    Wang, Qiang
    Liu, Xiao
    Liu, Yuanwei
    Chen, Yue
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (01) : 600 - 612
  • [6] Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking
    Moon, Jiseon
    Papaioannou, Savvas
    Laoudias, Christos
    Kolios, Panayiotis
    Kim, Sunwoo
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (20) : 15441 - 15455
  • [7] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
    Westheider, Jonas
    Rueckin, Julius
    Popovic, Marija
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656
  • [8] Collision Detection and Avoidance for Multi-UAV based on Deep Reinforcement Learning
    Wang, Guanzheng
    Liu, Zhihong
    Xiao, Kun
    Xu, Yinbo
    Yang, Lingjie
    Wang, Xiangke
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7783 - 7789
  • [9] Deep Reinforcement Learning for Multi-UAV Exploration Under Energy Constraints
    Zhou, Yating
    Shi, Dianxi
    Yang, Huanhuan
    Hu, Haomeng
    Yang, Shaowu
    Zhang, Yongjun
    [J]. COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT II, 2022, 461 : 363 - 379
  • [10] A deep reinforcement learning based distributed multi-UAV dynamic area coverage algorithm for complex environment
    Xiao, Jian
    Yuan, Guohui
    Xue, Yuxi
    He, Jinhui
    Wang, Yaoting
    Zou, Yuanjiang
    Wang, Zhuoran
    [J]. NEUROCOMPUTING, 2024, 595