Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:21
|
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 50 条
  • [31] Reinforcement Learning-Based Trajectory Planning For UAV-aided Vehicular Communications
    Marini, Riccardo
    Spampinato, Leonardo
    Mignardi, Silvia
    Verdone, Roberto
    Buratti, Chiara
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 967 - 971
  • [32] Trajectory Design in UAV-Aided Mobile Crowdsensing: A Deep Reinforcement Learning Approach
    Tao, Xi
    Hafid, Abdelhakim Senhaji
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [33] UAV-aided Secure NOMA Transmission via Trajectory and Resource Optimization
    Li, Yanxin
    Wang, Wei
    Liu, Mingqian
    Zhao, Nan
    Jiang, Xu
    Chen, Yunfei
    Wang, Xianbin
    13th International Conference on Wireless Communications and Signal Processing, WCSP 2021, 2021,
  • [34] Joint Optimization of UAV Trajectory and Sensor Uploading Powers for UAV-Assisted Data Collection in Wireless Sensor Networks
    Wang, Yinlu
    Chen, Ming
    Pan, Cunhua
    Wang, Kezhi
    Pan, Yijin
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (13) : 11214 - 11226
  • [35] Energy-Efficient 3D Trajectory Optimization for UAV-Aided Wireless Sensor Networks
    Ma, Yue
    Tang, Yanqun
    Mao, Zhongjun
    Zhang, Di
    Yang, Chao
    Li, Wei
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6591 - 6596
  • [36] UAV-Aided Wireless Powered Communication Networks: Trajectory Optimization and Resource Allocation for Minimum Throughput Maximization
    Park, Junhee
    Lee, Hoon
    Eom, Subin
    Lee, Inkyu
    IEEE ACCESS, 2019, 7 : 134978 - 134991
  • [37] Joint Beamforming and Trajectory Optimization for UAV-Aided ISAC with Dipole Antenna Array
    Yilmaz, Mustafa Burak
    Xiang, Lin
    Klein, Anja
    27TH INTERNATIONAL WORKSHOP ON SMART ANTENNAS, WSA 2024, 2024, : 80 - 86
  • [38] Joint Optimization of Trajectory and Jamming Power for Multiple UAV-Aided Proactive Eavesdropping
    Guo, Delin
    Tang, Lan
    Zhang, Xinggan
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 5770 - 5785
  • [39] Joint Trajectory Design and Radio Resource Management for UAV-Aided Vehicular Networks
    Spampinato, Leonardo
    Ferretti, Danila
    Buratti, Chiara
    Marini, Riccardo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 847 - 860
  • [40] Joint Optimization of UAV Trajectory and Relay Ratio in UAV-Aided Mobile Edge Computation Network
    Zhang, Xinhe
    Zhang, Heli
    Ji, Hong
    Li, Xi
    2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,