Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:21
|
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 50 条
  • [1] Joint Optimization of Trajectory and Node Access in UAV-Aided Data Collection System
    Han, Dongsheng
    Shi, Tianhao
    Han, Tianyu
    Zhou, Zhenyu
    IEEE SYSTEMS JOURNAL, 2023, 17 (02): : 2574 - 2585
  • [2] Joint User Scheduling and UAV Trajectory Design on Completion Time Minimization for UAV-Aided Data Collection
    Yuan, Xiaopeng
    Hu, Yulin
    Zhang, Jian
    Schmeink, Anke
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (06) : 3884 - 3898
  • [3] Joint User Association and UAV Location Optimization for UAV-Aided Communications
    Xi, Xing
    Cao, Xianbin
    Yang, Peng
    Chen, Jingxuan
    Quek, Tony
    Wu, Dapeng
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2019, 8 (06) : 1688 - 1691
  • [4] Deep Reinforcement Learning Based Trajectory Design for Customized UAV-Aided NOMA Data Collection
    Zhang, Lei
    Zhang, Yuandi
    Lu, Jiawangnan
    Xiao, Yunfa
    Zhang, Guanglin
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (12) : 3365 - 3369
  • [5] Joint Resource Allocation and Trajectory Optimization for UAV-Aided Relay Networks
    Hu, Qiyu
    Cai, Yunlong
    Liu, An
    Yu, Guanding
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [6] Joint User Association and Deployment Optimization for Delay-Minimized UAV-Aided MEC Networks
    Han, Zihao
    Zhou, Ting
    Xu, Tianheng
    Hu, Honglin
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (10) : 1791 - 1795
  • [7] UAV-Aided Data Collection for Information Freshness in Wireless Sensor Networks
    Liu, Juan
    Tong, Peng
    Wang, Xijun
    Bai, Bo
    Dai, Huaiyu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (04) : 2368 - 2382
  • [8] A Reinforcement Learning Algorithm for Data Collection in UAV-aided IoT Networks with Uncertain Time Windows
    Cicek, Cihan Tugrul
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [9] Continual Meta-Reinforcement Learning for UAV-Aided Vehicular Wireless Networks
    Marini, Riccardo
    Park, Sangwoo
    Simeone, Osvaldo
    Buratti, Chiara
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5664 - 5669
  • [10] UAV-aided Backscatter Networks: Joint UAV Trajectory and Protocol Design
    Hua, Meng
    Swindlehurst, A. Lee
    Li, Chunguo
    Yang, Luxi
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,