Multi-Agent Federated Reinforcement Learning Strategy for Mobile Virtual Reality Delivery Networks

被引:5
|
作者
Liu, Zhikai [1 ]
Garg, Navneet [2 ]
Ratnarajah, Tharmalingam [1 ]
机构
[1] Univ Edinburgh, Edinburgh EH8 9YL, Scotland
[2] Univ Edinburgh, Inst Digital Commun, Edinburgh EH8 9YL, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Servers; Delays; Three-dimensional displays; Energy consumption; Edge computing; Task analysis; Base stations; 3C strategy; multi-agent reinforcement learning; federated learning; VR delivery; massive MIMO; RESOURCE-ALLOCATION; JOINT OPTIMIZATION; EDGE; COMPUTATION; COMMUNICATION; MANAGEMENT; PLACEMENT; SYSTEMS; CACHE;
D O I
10.1109/TNSE.2023.3292570
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Virtual reality (VR) services have become increasingly popular but presented challenges for wireless communications due to the large amounts of data requirements. In this work, we consider a dynamic changing VR scenario and propose a joint caching, computing, and communication (3 C) strategy, subject to bounded latency, power, caching, and computing constraints, to minimize long-term discounted delay and energy consumption for VR projection. Our approach involves a three-layer communication system consisting of a cloud server, UAV (Unmanned Aerial Vehicle) base stations with mMIMO (massive Multiple-Input Multiple-Output) acting as edge servers, and mobile user devices. To satisfy different users' requirements, we design eight service routes for 3 C decisions. We then employ federated multi-agent deep reinforcement learning (RL) to help users obtain optimal service routes influenced by their location, orientation, and content preference, with edge servers acting as learning agents. For the RL part, we design multi-input and output actor and critic networks deployed on edge servers. For the Federated Learning (FL) part, we present the federated average process and mathematically prove its convergence. Simulation results demonstrate our proposed algorithm can effectively reduce training loss, converge smoothly, and significantly reduce both delay and energy consumption by approximately 17.2% and 23.5%, respectively.
引用
收藏
页码:100 / 114
页数:15
相关论文
共 50 条
  • [21] FedQMIX: Communication-efficient federated learning via multi-agent reinforcement learning
    Cao, Shaohua
    Zhang, Hanqing
    Wen, Tian
    Zhao, Hongwei
    Zheng, Quancheng
    Zhang, Weishan
    Zheng, Danyang
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (02):
  • [22] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [23] Multi-agent deep reinforcement learning strategy for distributed energy
    Xi, Lei
    Sun, Mengmeng
    Zhou, Huan
    Xu, Yanchun
    Wu, Junnan
    Li, Yanying
    MEASUREMENT, 2021, 185
  • [24] Federated Dynamic Spectrum Access through Multi-Agent Deep Reinforcement Learning
    Song, Yifei
    Chang, Hao-Hsuan
    Liu, Lingjia
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3466 - 3471
  • [25] The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning With Efficient Communication
    Xu, Xing
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (01) : 507 - 528
  • [26] Dynamical Accessing Handoff by Using Multi-Agent Reinforcement Learning in Slice Based Mobile Networks
    Qin S.
    Zhao G.-Q.
    Feng G.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (02): : 162 - 168
  • [27] Multi-agent deep reinforcement learning for collaborative task offloading in mobile edge computing networks
    Chen, Minxuan
    Guo, Aihuang
    Song, Chunlin
    DIGITAL SIGNAL PROCESSING, 2023, 140
  • [28] Competitive Pricing for Resource Trading in Sliced Mobile Networks: A Multi-Agent Reinforcement Learning Approach
    Sun, Guolin
    Boateng, Gordon Owusu
    Luo, Liyuan
    Chen, Huan
    Mensah, Daniel Ayepah
    Liu, Guisong
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 3830 - 3845
  • [29] Cooperative caching algorithm for mobile edge networks based on multi-agent meta reinforcement learning
    Wei, Zhenchun
    Zhao, Yang
    Lyu, Zengwei
    Yuan, Xiaohui
    Zhang, Yu
    Feng, Lin
    COMPUTER NETWORKS, 2024, 242
  • [30] TraCo: Learning Virtual Traffic Coordinator for Cooperation with Multi-Agent Reinforcement Learning
    Liu, Weiwei
    Jing, Wei
    Gao, Lingping
    Guo, Ke
    Xu, Gang
    Liu, Yong
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229