Multi-Agent Deep Reinforcement Learning Based UAV Trajectory Optimization for Differentiated Services

被引:20
|
作者
Ning, Zhaolong [1 ]
Yang, Yuxuan [2 ]
Wang, Xiaojie [1 ]
Song, Qingyang [1 ]
Guo, Lei [1 ]
Jamalipour, Abbas [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2050, Australia
关键词
Autonomous aerial vehicles; Servers; Computational efficiency; Task analysis; Trajectory optimization; Resource management; Costs; Multi-access edge computing; UAV-assisted communications; game theory; multi-agent DRL; RESOURCE-ALLOCATION; TASK;
D O I
10.1109/TMC.2023.3312276
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Driven by the increasing computational demand of real-time mobile applications, Unmanned Aerial Vehicle (UAV) assisted Multi-access Edge Computing (MEC) has been envisioned as a promising paradigm for pushing computational resources to network edges and constructing high-throughput line-of-sight links for ground users. Most exsiting studies consider simplified scenarios, such as a single UAV, Service Provider (SP) or service type, and centralized UAV trajectory control. In order to be more in line with real-world cases, we intend to achieve distributed trajectory control of multiple UAVs in UAV-assisted MEC networks with multiple SPs providing differentiated services. Our objective is to minimize the short-term computational costs of ground users and the long-term computational cost of UAVs, simultaneously based on incomplete information. We first solve the formulated problem by reaching the Nash Equilibrium (NE) of the game among SPs based on complete information. We further formulate a Markov game model and propose a Deep Reinforcement Learning (DRL)-based UAV trajectory optimization algorithm, where only local observations of each UAV are required for each SP's flying action execution. Theoretical analysis and performance evaluation demonstrate the convergence, efficiency, scalability, and robustness of our algorithm compared with other representative algorithms.
引用
收藏
页码:5818 / 5834
页数:17
相关论文
共 50 条
  • [1] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
    Chen, Binqiang
    Liu, Dong
    Hanzo, Lajos
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
  • [2] UAV Swarm Confrontation Based on Multi-agent Deep Reinforcement Learning
    Wang, Zhi
    Liu, Fan
    Guo, Jing
    Hong, Chen
    Chen, Ming
    Wang, Ershen
    Zhao, Yunbo
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4996 - 5001
  • [3] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    [J]. IEEE ACCESS, 2020, 8 : 139670 - 139679
  • [4] Multi-Agent Deep Reinforcement Learning for Secure UAV Communications
    Zhang, Yu
    Zhuang, Zirui
    Gao, Feifei
    Wang, Jingyu
    Han, Zhu
    [J]. 2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [5] Dynamic UAV Deployment for Differentiated Services: A Multi-Agent Imitation Learning Based Approach
    Wang, Xiaojie
    Ning, Zhaolong
    Guo, Song
    Wen, Miaowen
    Guo, Lei
    Poor, H. Vincent
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (04) : 2131 - 2146
  • [6] Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing
    Wang, Liang
    Wang, Kezhi
    Pan, Cunhua
    Xu, Wei
    Aslam, Nauman
    Hanzo, Lajos
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 73 - 84
  • [7] Multi-UAV Redeployment Optimization Based on Multi-Agent Deep Reinforcement Learning Oriented to Swarm Performance Restoration
    Wu, Qilong
    Geng, Zitao
    Ren, Yi
    Feng, Qiang
    Zhong, Jilong
    [J]. SENSORS, 2023, 23 (23)
  • [8] Computing Over the Sky: Joint UAV Trajectory and Task Offloading Scheme Based on Optimization-Embedding Multi-Agent Deep Reinforcement Learning
    Li, Xuanheng
    Du, Xinyang
    Zhao, Nan
    Wang, Xianbin
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (03) : 1355 - 1369
  • [9] Multi-agent Deep Reinforcement Learning-based Trajectory Design for UAV-aided Edge Computing System
    Lu, Gengyuan
    Chang, Zheng
    [J]. 2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [10] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Zhou, Xuanhan
    Xiong, Jun
    Zhao, Haitao
    Liu, Xiaoran
    Ren, Baoquan
    Zhang, Xiaochen
    Wei, Jibo
    Yin, Hao
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)