Trusted Collaboration for MEC-Enabled VR Video Streaming: A Multi-Agent Reinforcement Learning Approach

被引:2
|
作者
Xu, Yueqiang [1 ]
Zhang, Heli [1 ]
Li, Xi [1 ]
Yu, F. Richard [2 ]
Leung, Victor C. M. [3 ,4 ]
Ji, Hong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Beijing 100876, Peoples R China
[2] Shenzhen Univ, Shenzhen Key Lab Digital & Intelligent Technol &, Shenzhen 518060, Peoples R China
[3] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[4] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
基金
中国国家自然科学基金;
关键词
Trust evaluation; wireless virtual reality; edge collaboration; multi-agent DDPG; WIRELESS NETWORKS; RESOURCE-ALLOCATION; EDGE; AWARE; OPTIMIZATION; PREDICTION;
D O I
10.1109/TVT.2023.3267181
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Collaboration among mobile edge computing (MEC) has been envisioned as a promising paradigm to meet the requirements of wireless virtual reality (VR) applications. However, trust risks create tremendous challenges in MEC collaboration due to the distributed, complex, and unreliable nature of resource providers. In this paper, we present a trusted collaboration framework for VR video streaming to manage the video buffer in VR devices (VDs) under a more realistic distributed environment. In the framework, the rendering tasks can be processed collaboratively among edge servers (ESs) by exploring their behaviors (e.g., selfish behavior, malicious behavior, and cooperative behavior). Considering the collaborator may not be fully trustworthy, we present a novel trust evaluation method by combining direct and indirect values, aiming to ensure reliable collaborator selection. Then, we formulate an optimization problem to maintain an effective buffer state in VR devices (VDs) through jointly optimizing collaborator selection, spectrum allocation, and rendering resource allocation. Due to the fluctuating wireless fading channel and the dynamic video rate, the optimization problem is intractable by adopting traditional methods. Then, we adopt the multi-agent deep deterministic policy gradient (MADDPG) to tackle this dynamic and distributed problem. Simulation results indicate that the proposed approach can achieve a good performance.
引用
收藏
页码:12167 / 12180
页数:14
相关论文
共 50 条
  • [41] Multi-agent Computation Offloading in UAV Assisted MEC via Deep Reinforcement Learning
    He, Hang
    Ren, Tao
    Qiu, Yuan
    Hu, Zheyuan
    Li, Yanqi
    SMART COMPUTING AND COMMUNICATION, 2022, 13202 : 416 - 426
  • [42] Computation Offloading and Resource Allocation in MEC-Enabled Integrated Aerial-Terrestrial Vehicular Networks: A Reinforcement Learning Approach
    Waqar, Noor
    Hassan, Syed Ali
    Mahmood, Aamir
    Dev, Kapal
    Dinh-Thuan Do
    Gidlund, Mikael
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21478 - 21491
  • [43] Learning without Gradients: Multi-Agent Reinforcement Learning approach to optimization
    Morcos, Amir
    Man, Hong
    West, Aaron
    Maguire, Brian
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS IV, 2022, 12276
  • [44] Learning-Based Online QoE Optimization in Multi-Agent Video Streaming
    Wang, Yimeng
    Agarwal, Mridul
    Lan, Tian
    Aggarwal, Vaneet
    ALGORITHMS, 2022, 15 (07)
  • [45] UAV-Enabled Secure Communications by Multi-Agent Deep Reinforcement Learning
    Zhang, Yu
    Mou, Zhiyu
    Gao, Feifei
    Jiang, Jing
    Ding, Ruijin
    Han, Zhu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) : 11599 - 11611
  • [46] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
  • [47] A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation
    Koratamaddi, Prahlad
    Wadhwani, Karan
    Gupta, Mridul
    Sanjeevi, Sriram G.
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 410 - 410
  • [48] Multi-agent reinforcement learning approach for hedging portfolio problem
    Pham, Uyen
    Luu, Quoc
    Tran, Hien
    SOFT COMPUTING, 2021, 25 (12) : 7877 - 7885
  • [49] A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
    Corcoran, Diarmuid
    Kreuger, Per
    Boman, Magnus
    2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 338 - 344
  • [50] A multi-agent reinforcement learning approach to dynamic service composition
    Wang, Hongbing
    Wang, Xiaojun
    Hu, Xingguo
    Zhang, Xingzhi
    Gu, Mingzhu
    INFORMATION SCIENCES, 2016, 363 : 96 - 119