Reinforcement Learning in Multiple-UAV Networks: Deployment and Movement Design

被引:187
|
作者
Liu, Xiao [1 ]
Liu, Yuanwei [1 ]
Chen, Yue [1 ]
机构
[1] Queen Mary Univ London, London E1 4NS, England
关键词
Dynamic movement; Q-learning; quality of experience (QoE); three-dimensional deployment; unmanned aerial vehicle (UAV); UNMANNED AERIAL VEHICLES; RESOURCE-ALLOCATION; WIRELESS COMMUNICATIONS; COMMUNICATION; OPTIMIZATION; ALGORITHM; PLACEMENT; MOBILE; 5G;
D O I
10.1109/TVT.2019.2922849
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel framework is proposed for quality of experience driven deployment and dynamic movement of multiple unmanned aerial vehicles (UAVs). The problem of joint non-convex three-dimensional (3-D) deployment and dynamic movement of the UAVs is formulated for maximizing the sum mean opinion score of ground users, which is proved to be NP-hard. In the aim of solving this pertinent problem, a three-step approach is proposed for attaining 3-D deployment and dynamic movement of multiple UAVs. First, a genetic algorithm based K-means (GAK-means) algorithm is utilized for obtaining the cell partition of the users. Second, Q-learning based deployment algorithm is proposed, in which each UAV acts as an agent, making their own decision for attaining 3-D position by learning from trial and mistake. In contrast to the conventional genetic algorithm based learning algorithms, the proposed algorithm is capable of training the direction selection strategy offline. Third, Q-learning based movement algorithm is proposed in the scenario that the users are roaming. The proposed algorithm is capable of converging to an optimal state. Numerical results reveal that the proposed algorithms show a fast convergence rate after a small number of iterations. Additionally, the proposed Q-learning based deployment algorithm outperforms K-means algorithms and Iterative-GAKmean algorithms with low complexity.
引用
收藏
页码:8036 / 8049
页数:14
相关论文
共 50 条
  • [21] Green MEC Networks Design Under UAV Attack: A Deep Reinforcement Learning Approach
    Zhao, Rui
    Xia, Junjuan
    Zhao, Zichao
    Lai, Shiwei
    Fan, Lisheng
    Li, Dong
    [J]. IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2021, 5 (03): : 1248 - 1258
  • [22] Reinforcement Learning Based Topology Control for UAV Networks
    Yoo, Taehoon
    Lee, Sangmin
    Yoo, Kyeonghyun
    Kim, Hwangnam
    [J]. SENSORS, 2023, 23 (02)
  • [23] A Machine Learning UAV Deployment Approach for Emergency Cellular Networks
    Tsipi, Lefteris
    Tatsis, Vasileios I.
    Skoutas, Dimitrios N.
    Vouyioukas, Demosthenes
    Skianis, Charalabos
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5683 - 5688
  • [24] Dynamic deployment of multi-UAV base stations with deep reinforcement learning
    Wu, Guanhan
    Jia, Weimin
    Zhao, Jianwei
    [J]. ELECTRONICS LETTERS, 2021, 57 (15) : 600 - 602
  • [25] UAV Networks Against Multiple Maneuvering Smart Jamming With Knowledge-Based Reinforcement Learning
    Li, Zhiwei
    Lu, Yu
    Li, Xi
    Wang, Zengguang
    Qiao, Wenxin
    Liu, Yicen
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (15): : 12289 - 12310
  • [26] Control of Multiple-UAV Conveying Slung Load With Obstacle Avoidance
    Aliyu, Abdulrahman
    El Ferik, Sami
    [J]. IEEE ACCESS, 2022, 10 : 62247 - 62257
  • [27] QoE-Driven Adaptive Deployment Strategy of Multi-UAV Networks Based on Hybrid Deep Reinforcement Learning
    Zhou, Yi
    Ma, Xiaoyong
    Hu, Shuting
    Zhou, Danyang
    Cheng, Nan
    Lu, Ning
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08) : 5868 - 5881
  • [28] Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches
    Chang, Zheng
    Deng, Hengwei
    You, Li
    Min, Geyong
    Garg, Sahil
    Kaddoum, Georges
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2940 - 2951
  • [29] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
    Xu, Shu
    Zhan, Xiangyu
    Li, Chunguo
    Wang, Dongming
    Yang, Luxi
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394
  • [30] Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks With Sense-and-Send Protocol
    Hu, Jingzhi
    Zhang, Hongliang
    Song, Lingyang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (04): : 6177 - 6189