Bandwidth Allocation and Trajectory Control in UAV-Assisted IoV Edge Computing Using Multiagent Reinforcement Learning

被引:9
|
作者
Wang, Juzhen [1 ]
Zhang, Xiaoli [2 ]
He, Xingshi [3 ]
Sun, Yongqiang [4 ]
机构
[1] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[3] ZTE, Shenzhen 518000, Peoples R China
[4] Nexperia, Shenzhen 518057, Peoples R China
关键词
Attention mechanism; bandwidth assignment; location deployment; multiagent deep reinforcement learning (DRL); value decomposition network (VDN); EFFICIENT DEPLOYMENT; COMMUNICATION; MAXIMIZATION;
D O I
10.1109/TR.2022.3192020
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of an unmanned aerial vehicle (UAV) has brought new opportunities for wireless communication and edge computing. In this article, we investigate the scenario where multiple UAVs serve as edge computing devices for the Internet of Vehicles (IoV). Regardless of the allocation of computing resources, we focus on bandwidth allocation and trajectory control to maximize the communication capacity of the system so that the UAV edge computing network can process more data. With this intent, a UAV-assisted IoV edge computing system model is constructed as a nonconvex optimization problem, aiming to maximize the achievable channel capacity of the network. To solve this problem, two "quasi-distributed" multiagent algorithms, i.e., actor-critic mixing network (AC-Mix) and multi-attentive agent deep deterministic policy gradient (MA2DDPG), are proposed based on deep deterministic policy gradient. Specifically, AC-Mix utilizes a mixing network to obtain a global Q-value for better evaluation of joint action, while MA2DDPG employs a multihead attention mechanism to achieve multiagent collaboration. Using multi-agents deep deterministic policy gradient (MADDPG) as benchmark, several experiments are carried out to verify the performance of the proposed algorithms. Simulation results show that the convergence velocity of AC-Mix and MA2DDPG is improved by 30.0% and 63.3%, respectively, compared with MADDPG.
引用
收藏
页码:599 / 608
页数:10
相关论文
共 50 条
  • [1] Computation Offloading and Trajectory Control for UAV-Assisted Edge Computing Using Deep Reinforcement Learning
    Qi, Huamei
    Zhou, Zheng
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [2] Task and Bandwidth Allocation for UAV-Assisted Mobile Edge Computing with Trajectory Design
    Hu, Xiaoyan
    Wong, Kai-Kit
    Yang, Kun
    Zheng, Zhongbin
    [J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [3] Task Offloading and Trajectory Control for UAV-Assisted Mobile Edge Computing Using Deep Reinforcement Learning
    Zhang, Lu
    Zhang, Zi-Yan
    Min, Luo
    Tang, Chao
    Zhang, Hong-Ying
    Wang, Ya-Hong
    Cai, Peng
    [J]. IEEE ACCESS, 2021, 9 : 53708 - 53719
  • [4] Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-Assisted Mobile Edge Computing
    Wang, Liang
    Wang, Kezhi
    Pan, Cunhua
    Xu, Wei
    Aslam, Nauman
    Nallanathan, Arumugam
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (10) : 3536 - 3550
  • [5] Deep Reinforcement Learning Driven UAV-Assisted Edge Computing
    Zhang, Liang
    Jabbari, Bijan
    Ansari, Nirwan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25449 - 25459
  • [6] Fair Data Allocation and Trajectory Optimization for UAV-Assisted Mobile Edge Computing
    Diao, Xianbang
    Zheng, Jianchao
    Cai, Yueming
    Wu, Yuan
    Anpalagan, Alagan
    [J]. IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) : 2357 - 2361
  • [7] UAV-Assisted Mobile Edge Computing: Dynamic Trajectory Design and Resource Allocation
    Wang, Zhuwei
    Zhao, Wenjing
    Hu, Pengyu
    Zhang, Xige
    Liu, Lihan
    Fang, Chao
    Sun, Yanhua
    [J]. SENSORS, 2024, 24 (12)
  • [8] Caching on the Sky: A Multiagent Federated Reinforcement Learning Approach for UAV-Assisted Edge Caching
    Li, Xuanheng
    Liu, Jiahong
    Chen, Xianhao
    Wang, Jie
    Pan, Miao
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (17): : 28213 - 28226
  • [9] Evolutionary Multi-Objective Reinforcement Learning Based Trajectory Control and Task Offloading in UAV-Assisted Mobile Edge Computing
    Song, Fuhong
    Xing, Huanlai
    Wang, Xinhan
    Luo, Shouxi
    Dai, Penglin
    Xiao, Zhiwen
    Zhao, Bowen
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (12) : 7387 - 7405
  • [10] Joint Resource Allocation and Trajectory Design for UAV-assisted Mobile Edge Computing Systems
    Ji, Jiequ
    Zhu, Kun
    Yi, Changyan
    Wang, Ran
    Niyato, Dusit
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,