Resource Allocation in UAV-Assisted Networks: A Clustering-Aided Reinforcement Learning Approach

被引:17
|
作者
Zhou, Shiyang [1 ]
Cheng, Yufan [1 ]
Lei, Xia [1 ]
Peng, Qihang [2 ]
Wang, Jun [1 ]
Li, Shaoqian [1 ]
机构
[1] Univ Elect Sci & Technol China UESTC, Natl Key Lab Sci & Technol Commun, Chengdu 611731, Peoples R China
[2] UESTC, Sch Commun & Informat Engn, Sichuan 611731, Peoples R China
关键词
Unmanned aerial vehicle (UAV); resource allocation; optimization; clustering-aided reinforcement learning; JOINT TRAJECTORY DESIGN; THROUGHPUT MAXIMIZATION; WIRELESS NETWORKS; CELLULAR NETWORK; POWER ALLOCATION; BASE STATIONS; COMMUNICATION; MOBILE; PLACEMENT; ACCESS;
D O I
10.1109/TVT.2022.3189552
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As an aerial base station, unmanned aerial vehicle (UAV) has been considered as a promising technology to assist future wireless communications due to its flexible, swift and low cost features, where resource allocation is the basis for ensuring energy-efficient UAV-assisted networks. This paper formulates a joint optimization problem of user association, UAV trajectory design and power control to maximize the channel capacity among all ground users at a limited power level in a downlink transmission. To tackle the mixed-integer non-linear programming problem, this paper proposes a clustering-aided reinforcement learning approach consisting of three consecutive stages. Firstly, modified expectation-maximization unsupervised learning algorithm is investigated to cluster the ground users, which reduces the dimensions and hence, the association complexity is reduced as well. Then, Kuhn-Munkres algorithm is incorporated for user association, which associates a UAV with the ground users via matching to the cluster, and assigns the UAVs to the centroid of the matching cluster for pre-placement, with the aim of speeding up the convergence of the following deep reinforcement learning algorithm. Finally, a multi-agent twin delayed deep deterministic (MATD3) policy gradient is proposed to solve the non-convex sub-problem, which determines the transmit power and designs the fine-tuned trajectory of UAVs. By incorporating low-bias value estimation technique, the reward of the proposed MATD3 algorithm is improved. Simulation results have demonstrated that our proposed approach achieves higher reward as well as converging faster than existing reinforcement algorithms. Besides, the clustering-aided reinforcement learning has lower computational complexity than the benchmark schemes.
引用
收藏
页码:12088 / 12103
页数:16
相关论文
共 50 条
  • [1] Resource Allocation in UAV-Assisted Wireless Networks Using Reinforcement Learning
    Luong, Phuong
    Gagnon, Francois
    Labeau, Fabrice
    [J]. 2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [2] Deep Reinforcement Learning for Jointly Resource Allocation and Trajectory Planning in UAV-Assisted Networks
    Jwaifel, Arwa Mahmoud
    Van Do, Tien
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 71 - 83
  • [3] User Association and Power Allocation for UAV-Assisted Networks:A Distributed Reinforcement Learning Approach
    Xin Guan
    Yang Huang
    Chao Dong
    Qihui Wu
    [J]. China Communications, 2020, 17 (12) : 110 - 122
  • [4] User Association and Power Allocation for UAV-Assisted Networks: A Distributed Reinforcement Learning Approach
    Guan, Xin
    Huang, Yang
    Dong, Chao
    Wu, Qihui
    [J]. CHINA COMMUNICATIONS, 2020, 17 (12) : 110 - 122
  • [5] Deep Reinforcement Learning-Based Resource Allocation in Cooperative UAV-Assisted Wireless Networks
    Luong, Phuong
    Gagnon, Francois
    Tran, Le-Nam
    Labeau, Fabrice
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) : 7610 - 7625
  • [6] Multi-Agent Low-Bias Reinforcement Learning for Resource Allocation in UAV-Assisted Networks
    Zhou, Shiyang
    Cheng, Yufan
    Lei, Xia
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 1011 - 1016
  • [7] Deep Reinforcement Learning Based Resource Management in UAV-Assisted IoT Networks
    Munaye, Yirga Yayeh
    Juang, Rong-Terng
    Lin, Hsin-Piao
    Tarekegn, Getaneh Berie
    Lin, Ding-Bing
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 20
  • [8] Learning Based Dynamic Resource Allocation in UAV-assisted Mobile Crowdsensing Networks
    Liu, Wenshuai
    Zhou, Yuzhi
    Fu, Yaru
    [J]. 2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [9] Deep Reinforcement Learning Based Trajectory Design and Resource Allocation for UAV-Assisted Communications
    Zhang, Chiya
    Li, Zhukun
    He, Chunlong
    Wang, Kezhi
    Pan, Cunhua
    [J]. IEEE COMMUNICATIONS LETTERS, 2023, 27 (09) : 2398 - 2402
  • [10] Resource allocation for UAV-assisted 5G mMTC slicing networks using deep reinforcement learning
    Rohit Kumar Gupta
    Saubhik Kumar
    Rajiv Misra
    [J]. Telecommunication Systems, 2023, 82 : 141 - 159