Multi-Agent Deep Reinforcement Learning Based UAV Trajectory Optimization for Differentiated Services

被引:20
|
作者
Ning, Zhaolong [1 ]
Yang, Yuxuan [2 ]
Wang, Xiaojie [1 ]
Song, Qingyang [1 ]
Guo, Lei [1 ]
Jamalipour, Abbas [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2050, Australia
关键词
Autonomous aerial vehicles; Servers; Computational efficiency; Task analysis; Trajectory optimization; Resource management; Costs; Multi-access edge computing; UAV-assisted communications; game theory; multi-agent DRL; RESOURCE-ALLOCATION; TASK;
D O I
10.1109/TMC.2023.3312276
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Driven by the increasing computational demand of real-time mobile applications, Unmanned Aerial Vehicle (UAV) assisted Multi-access Edge Computing (MEC) has been envisioned as a promising paradigm for pushing computational resources to network edges and constructing high-throughput line-of-sight links for ground users. Most exsiting studies consider simplified scenarios, such as a single UAV, Service Provider (SP) or service type, and centralized UAV trajectory control. In order to be more in line with real-world cases, we intend to achieve distributed trajectory control of multiple UAVs in UAV-assisted MEC networks with multiple SPs providing differentiated services. Our objective is to minimize the short-term computational costs of ground users and the long-term computational cost of UAVs, simultaneously based on incomplete information. We first solve the formulated problem by reaching the Nash Equilibrium (NE) of the game among SPs based on complete information. We further formulate a Markov game model and propose a Deep Reinforcement Learning (DRL)-based UAV trajectory optimization algorithm, where only local observations of each UAV are required for each SP's flying action execution. Theoretical analysis and performance evaluation demonstrate the convergence, efficiency, scalability, and robustness of our algorithm compared with other representative algorithms.
引用
收藏
页码:5818 / 5834
页数:17
相关论文
共 50 条
  • [41] UAV-Enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement Learning
    Liu, Saichao
    Sun, Geng
    Li, Jiahui
    Liang, Shuang
    Wu, Qingqing
    Wang, Pengfei
    Niyato, Dusit
    [J]. IEEE Transactions on Mobile Computing, 2024, 23 (12) : 13015 - 13032
  • [42] Multi-agent Computation Offloading in UAV Assisted MEC via Deep Reinforcement Learning
    He, Hang
    Ren, Tao
    Qiu, Yuan
    Hu, Zheyuan
    Li, Yanqi
    [J]. SMART COMPUTING AND COMMUNICATION, 2022, 13202 : 416 - 426
  • [43] Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning
    Zhao, Xiaoru
    Yang, Rennong
    Zhong, Liangsheng
    Hou, Zhiwei
    [J]. DRONES, 2024, 8 (01)
  • [44] Multi-Agent Reinforcement Learning for Convex Optimization
    Morcos, Amir
    West, Aaron
    Maguire, Brian
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS III, 2021, 11746
  • [45] Distributed Drive Autonomous Vehicle Trajectory Tracking Control Based on Multi-Agent Deep Reinforcement Learning
    Liu, Yalei
    Ding, Weiping
    Yang, Mingliang
    Zhu, Honglin
    Liu, Liyuan
    Jin, Tianshi
    [J]. MATHEMATICS, 2024, 12 (11)
  • [46] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    [J]. 2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [47] Cooperative Internet of UAVs: Distributed Trajectory Design by Multi-Agent Deep Reinforcement Learning
    Hu, Jingzhi
    Zhang, Hongliang
    Song, Lingyang
    Schober, Robert
    Poor, H. Vincent
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (11) : 6807 - 6821
  • [48] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
  • [49] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743
  • [50] Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning
    Guo, Delin
    Tang, Lan
    Zhang, Xinggan
    Liang, Ying-Chang
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13124 - 13138