Multi-Agent Federated Reinforcement Learning Strategy for Mobile Virtual Reality Delivery Networks

被引：5

作者：

Liu, Zhikai ^{[1
]}

Garg, Navneet ^{[2
]}

Ratnarajah, Tharmalingam ^{[1
]}

机构：

[1] Univ Edinburgh, Edinburgh EH8 9YL, Scotland

[2] Univ Edinburgh, Inst Digital Commun, Edinburgh EH8 9YL, Scotland

来源：

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2024年 / 11卷 / 01期

基金：

英国工程与自然科学研究理事会;

关键词：

Servers; Delays; Three-dimensional displays; Energy consumption; Edge computing; Task analysis; Base stations; 3C strategy; multi-agent reinforcement learning; federated learning; VR delivery; massive MIMO; RESOURCE-ALLOCATION; JOINT OPTIMIZATION; EDGE; COMPUTATION; COMMUNICATION; MANAGEMENT; PLACEMENT; SYSTEMS; CACHE;

D O I：

10.1109/TNSE.2023.3292570

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Virtual reality (VR) services have become increasingly popular but presented challenges for wireless communications due to the large amounts of data requirements. In this work, we consider a dynamic changing VR scenario and propose a joint caching, computing, and communication (3 C) strategy, subject to bounded latency, power, caching, and computing constraints, to minimize long-term discounted delay and energy consumption for VR projection. Our approach involves a three-layer communication system consisting of a cloud server, UAV (Unmanned Aerial Vehicle) base stations with mMIMO (massive Multiple-Input Multiple-Output) acting as edge servers, and mobile user devices. To satisfy different users' requirements, we design eight service routes for 3 C decisions. We then employ federated multi-agent deep reinforcement learning (RL) to help users obtain optimal service routes influenced by their location, orientation, and content preference, with edge servers acting as learning agents. For the RL part, we design multi-input and output actor and critic networks deployed on edge servers. For the Federated Learning (FL) part, we present the federated average process and mathematically prove its convergence. Simulation results demonstrate our proposed algorithm can effectively reduce training loss, converge smoothly, and significantly reduce both delay and energy consumption by approximately 17.2% and 23.5%, respectively.

引用

页码：100 / 114

页数：15

共 50 条

[21] FedQMIX: Communication-efficient federated learning via multi-agent reinforcement learning
Cao, Shaohua
Zhang, Hanqing
Wen, Tian
Zhao, Hongwei
Zheng, Quancheng
Zhang, Weishan
Zheng, Danyang
HIGH-CONFIDENCE COMPUTING, 2024, 4 (02):
[22] Multi-Agent Reinforcement Learning
Stankovic, Milos
2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
[23] Multi-agent deep reinforcement learning strategy for distributed energy
Xi, Lei
Sun, Mengmeng
Zhou, Huan
Xu, Yanchun
Wu, Junnan
Li, Yanying
MEASUREMENT, 2021, 185
[24] Federated Dynamic Spectrum Access through Multi-Agent Deep Reinforcement Learning
Song, Yifei
Chang, Hao-Hsuan
Liu, Lingjia
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3466 - 3471
[25] The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning With Efficient Communication
Xu, Xing
Li, Rongpeng
Zhao, Zhifeng
Zhang, Honggang
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (01) : 507 - 528
[26] Dynamical Accessing Handoff by Using Multi-Agent Reinforcement Learning in Slice Based Mobile Networks
Qin S.
Zhao G.-Q.
Feng G.
Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (02): : 162 - 168
[27] Multi-agent deep reinforcement learning for collaborative task offloading in mobile edge computing networks
Chen, Minxuan
Guo, Aihuang
Song, Chunlin
DIGITAL SIGNAL PROCESSING, 2023, 140
[28] Competitive Pricing for Resource Trading in Sliced Mobile Networks: A Multi-Agent Reinforcement Learning Approach
Sun, Guolin
Boateng, Gordon Owusu
Luo, Liyuan
Chen, Huan
Mensah, Daniel Ayepah
Liu, Guisong
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 3830 - 3845
[29] Cooperative caching algorithm for mobile edge networks based on multi-agent meta reinforcement learning
Wei, Zhenchun
Zhao, Yang
Lyu, Zengwei
Yuan, Xiaohui
Zhang, Yu
Feng, Lin
COMPUTER NETWORKS, 2024, 242
[30] TraCo: Learning Virtual Traffic Coordinator for Cooperation with Multi-Agent Reinforcement Learning
Liu, Weiwei
Jing, Wei
Gao, Lingping
Guo, Ke
Xu, Gang
Liu, Yong
CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229

← 1 2 3 4 5 →