Bandwidth Allocation and Trajectory Control in UAV-Assisted IoV Edge Computing Using Multiagent Reinforcement Learning

被引：9

作者：

Wang, Juzhen ^{[1
]}

Zhang, Xiaoli ^{[2
]}

He, Xingshi ^{[3
]}

Sun, Yongqiang ^{[4
]}

机构：

[1] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China

[3] ZTE, Shenzhen 518000, Peoples R China

[4] Nexperia, Shenzhen 518057, Peoples R China

来源：

IEEE TRANSACTIONS ON RELIABILITY | 2023年 / 72卷 / 02期

关键词：

Attention mechanism; bandwidth assignment; location deployment; multiagent deep reinforcement learning (DRL); value decomposition network (VDN); EFFICIENT DEPLOYMENT; COMMUNICATION; MAXIMIZATION;

D O I：

10.1109/TR.2022.3192020

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The rapid development of an unmanned aerial vehicle (UAV) has brought new opportunities for wireless communication and edge computing. In this article, we investigate the scenario where multiple UAVs serve as edge computing devices for the Internet of Vehicles (IoV). Regardless of the allocation of computing resources, we focus on bandwidth allocation and trajectory control to maximize the communication capacity of the system so that the UAV edge computing network can process more data. With this intent, a UAV-assisted IoV edge computing system model is constructed as a nonconvex optimization problem, aiming to maximize the achievable channel capacity of the network. To solve this problem, two "quasi-distributed" multiagent algorithms, i.e., actor-critic mixing network (AC-Mix) and multi-attentive agent deep deterministic policy gradient (MA2DDPG), are proposed based on deep deterministic policy gradient. Specifically, AC-Mix utilizes a mixing network to obtain a global Q-value for better evaluation of joint action, while MA2DDPG employs a multihead attention mechanism to achieve multiagent collaboration. Using multi-agents deep deterministic policy gradient (MADDPG) as benchmark, several experiments are carried out to verify the performance of the proposed algorithms. Simulation results show that the convergence velocity of AC-Mix and MA2DDPG is improved by 30.0% and 63.3%, respectively, compared with MADDPG.

引用

页码：599 / 608

页数：10

共 50 条

[1] Computation Offloading and Trajectory Control for UAV-Assisted Edge Computing Using Deep Reinforcement Learning
Qi, Huamei
Zhou, Zheng
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
[2] Task and Bandwidth Allocation for UAV-Assisted Mobile Edge Computing with Trajectory Design
Hu, Xiaoyan
Wong, Kai-Kit
Yang, Kun
Zheng, Zhongbin
[J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[3] Task Offloading and Trajectory Control for UAV-Assisted Mobile Edge Computing Using Deep Reinforcement Learning
Zhang, Lu
Zhang, Zi-Yan
Min, Luo
Tang, Chao
Zhang, Hong-Ying
Wang, Ya-Hong
Cai, Peng
[J]. IEEE ACCESS, 2021, 9 : 53708 - 53719
[4] Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-Assisted Mobile Edge Computing
Wang, Liang
Wang, Kezhi
Pan, Cunhua
Xu, Wei
Aslam, Nauman
Nallanathan, Arumugam
[J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (10) : 3536 - 3550
[5] Deep Reinforcement Learning Driven UAV-Assisted Edge Computing
Zhang, Liang
Jabbari, Bijan
Ansari, Nirwan
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25449 - 25459
[6] Fair Data Allocation and Trajectory Optimization for UAV-Assisted Mobile Edge Computing
Diao, Xianbang
Zheng, Jianchao
Cai, Yueming
Wu, Yuan
Anpalagan, Alagan
[J]. IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) : 2357 - 2361
[7] UAV-Assisted Mobile Edge Computing: Dynamic Trajectory Design and Resource Allocation
Wang, Zhuwei
Zhao, Wenjing
Hu, Pengyu
Zhang, Xige
Liu, Lihan
Fang, Chao
Sun, Yanhua
[J]. SENSORS, 2024, 24 (12)
[8] Caching on the Sky: A Multiagent Federated Reinforcement Learning Approach for UAV-Assisted Edge Caching
Li, Xuanheng
Liu, Jiahong
Chen, Xianhao
Wang, Jie
Pan, Miao
[J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (17): : 28213 - 28226
[9] Evolutionary Multi-Objective Reinforcement Learning Based Trajectory Control and Task Offloading in UAV-Assisted Mobile Edge Computing
Song, Fuhong
Xing, Huanlai
Wang, Xinhan
Luo, Shouxi
Dai, Penglin
Xiao, Zhiwen
Zhao, Bowen
[J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (12) : 7387 - 7405
[10] Joint Resource Allocation and Trajectory Design for UAV-assisted Mobile Edge Computing Systems
Ji, Jiequ
Zhu, Kun
Yi, Changyan
Wang, Ran
Niyato, Dusit
[J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,

← 1 2 3 4 5 →