Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:21
|
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 50 条
  • [41] Joint User Scheduling, Power Configuration and Trajectory Planning Strategy for UAV-Aided WSNs
    Wang, Xindi
    Liu, Xinyu
    Wu, Jianjian
    Ju, Wei
    Chen, Xiaojing
    Shen, Ling
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (01)
  • [42] Deep Reinforcement Learning for AoI Minimization in UAV-Aided Data Collection for WSN and IoT Applications: A Survey
    Amodu, Oluwatosin Ahmed
    Jarray, Chedia
    Mahmood, Raja Azlina Raja
    Althumali, Huda
    Bukar, Umar Ali
    Nordin, Rosdiadee
    Abdullah, Nor Fadzilah
    Luong, Nguyen Cong
    IEEE ACCESS, 2024, 12 : 108000 - 108040
  • [43] Optimization of Placement and Resource Allocation in UAV-Aided Multihop Wireless Networks
    Nikooroo, Mohammadsaleh
    Esrafilian, Omid
    Becvar, Zdenek
    Gesbert, David
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (11): : 20051 - 20071
  • [44] Joint Precoding Optimization for Secure SWIPT in UAV-Aided NOMA Networks
    Wang, Wei
    Tang, Jie
    Zhao, Nan
    Liu, Xin
    Zhang, Xiu Yin
    Chen, Yunfei
    Qian, Yi
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (08) : 5028 - 5040
  • [45] UAV-Aided Cooperative Data Collection Scheme for Ocean Monitoring Networks
    Ma, Ruofei
    Wang, Ruisong
    Liu, Gongliang
    Meng, Weixiao
    Liu, Xiqing
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (17) : 13222 - 13236
  • [46] Trajectory Planning in UAV-Assisted Wireless Networks via Reinforcement Learning
    He, Simeng
    Zhang, Shangwei
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 232 - 237
  • [47] Trajectory Optimization of UAV for Efficient Data Collection from Wireless Sensor Networks
    Luo, Chuanwen
    Wu, Lidong
    Chen, Wenping
    Wang, Yongcai
    Li, Deying
    Wu, Weili
    ALGORITHMIC ASPECTS IN INFORMATION AND MANAGEMENT, AAIM 2019, 2019, 11640 : 223 - 235
  • [48] Joint Resource Allocation and Trajectory Design for UAV-Aided Wireless Physical Layer Security
    Sun, Xiaofang
    Shen, Chao
    Chang, Tsung-Hui
    Zhong, Zhangdui
    2018 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2018,
  • [49] Energy-Efficient and Fast Data Collection in UAV-Aided Wireless Sensor Networks for Hilly Terrains
    Nazib, Rezoan Ahmed
    Moh, Sangman
    IEEE ACCESS, 2021, 9 : 23168 - 23190
  • [50] Data-driven Deep Reinforcement Learning for Online Flight Resource Allocation in UAV-aided Wireless Powered Sensor Networks
    Li, Kai
    Ni, Wei
    Kurunathan, Harrison
    Dressler, Falko
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,