Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:21
|
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 50 条
  • [21] A Deep Reinforcement Learning Approach to Energy-harvesting UAV-aided Data Collection
    Zhang, Ning
    Liu, Juan
    Xie, Lingfu
    Tong, Peng
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 93 - 98
  • [22] Efficient Data Collection in Large-Scale UAV-aided Wireless Sensor Networks
    Chen, Jiahui
    Yan, Feng
    Mao, Shenshen
    Shen, Fei
    Xia, Weiwei
    Wu, Yi
    Shen, Lianfeng
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [23] Resource Allocation and Trajectory Design in UAV-Aided Cellular Networks Based on Multiagent Reinforcement Learning
    Yin, Sixing
    Yu, F. Richard
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (04) : 2933 - 2943
  • [24] Jointly optimal fair data collection and trajectory design algorithms in UAV-aided cellular networks
    Song, Dan
    Zhai, Xiangping Bryce
    Liu, Xin
    Tan, Chee Wei
    IEEE Wireless Communications and Networking Conference, WCNC, 2021, 2021-March
  • [25] Jointly Optimal Fair Data Collection and Trajectory Design Algorithms in UAV-Aided Cellular Networks
    Song, Dan
    Zhai, Xiangping Bryce
    Liu, Xin
    Tan, Chee Wei
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [26] Joint Flight Cruise Control and Data Collection in UAV-Aided Internet of Things: An Onboard Deep Reinforcement Learning Approach
    Li, Kai
    Ni, Wei
    Tovar, Eduardo
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9787 - 9799
  • [27] Trajectory optimization for the UAV assisted data collection in wireless sensor networks
    Saxena, Kartik
    Gupta, Nitin
    Gupta, Jahnvi
    Sharma, Deepak Kumar
    Dev, Kapal
    WIRELESS NETWORKS, 2022, 28 (04) : 1785 - 1796
  • [28] Trajectory optimization for the UAV assisted data collection in wireless sensor networks
    Kartik Saxena
    Nitin Gupta
    Jahnvi Gupta
    Deepak Kumar Sharma
    Kapal Dev
    Wireless Networks, 2022, 28 : 1785 - 1796
  • [29] Joint Scheduling and Trajectory Design for UAV-Aided Wireless Power Transfer System
    Wang, Yi
    Hua, Meng
    Liu, Zhi
    Zhang, Di
    Dai, Haibo
    Hu, Ying
    5G FOR FUTURE WIRELESS NETWORKS, 2019, 278 : 3 - 17
  • [30] UAV-Aided Wireless Power Transfer and Data Collection in Rician Fading
    Liu, Yuan
    Xiong, Ke
    Lu, Yang
    Ni, Qiang
    Fan, Pingyi
    Ben Letaief, Khaled
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (10) : 3097 - 3113