Distributed Deep Multi-Agent Reinforcement Learning for Cooperative Edge Caching in Internet-of-Vehicles

被引:29
|
作者
Zhou, Huan [1 ,2 ]
Jiang, Kai [3 ]
He, Shibo [4 ]
Min, Geyong [5 ]
Wu, Jie [6 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443002, Peoples R China
[3] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430000, Peoples R China
[4] Zhejiang Univ, Coll Control Sci & Technol, Hangzhou 310027, Peoples R China
[5] Univ Exeter, Coll Engn Math & Phys Sci, Dept Comp Sci, Exeter EX4 4QF, England
[6] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
基金
中国国家自然科学基金;
关键词
Computer architecture; Delays; Costs; Backhaul networks; Reinforcement learning; Quality of service; Optimization; Edge caching; Internet-of-Vehicles; content delivery; cache replacement; multi-agent reinforcement learning;
D O I
10.1109/TWC.2023.3272348
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Edge caching is a promising approach to reduce duplicate content transmission in Internet-of-Vehicles (IoVs). Several Reinforcement Learning (RL) based edge caching methods have been proposed to improve the resource utilization and reduce the backhaul traffic load. However, they only obtain the local sub-optimal solution, as they neglect the influence from environments by other agents. This paper investigates the edge caching strategies with consideration of the content delivery and cache replacement by exploiting the distributed Multi-Agent Reinforcement Learning (MARL). A hierarchical edge caching architecture for IoVs is proposed and the corresponding problem is formulated with the goal to minimize the long-term content access cost in the system. Then, we extend the Markov Decision Process (MDP) in the single agent RL to the context of a multi-agent system, and tackle the corresponding combinatorial multi-armed bandit problem based on the framework of a stochastic game. Specifically, we firstly propose a Distributed MARL-based Edge caching method (DMRE), where each agent can adaptively learn its best behaviour in conjunction with other agents for intelligent caching. Meanwhile, we attempt to reduce the computation complexity of DMRE by parameter approximation, which legitimately simplifies the training targets. However, DMRE is enabled to represent and update the parameter by creating a lookup table, essentially a tabular-based method, which generally performs inefficiently in large-scale scenarios. To circumvent the issue and make more expressive parametric models, we incorporate the technical advantage of the Deep- $Q$ Network into DMRE, and further develop a computationally efficient method (DeepDMRE) with neural network-based Nash equilibria approximation. Extensive simulations are conducted to verify the effectiveness of the proposed methods. Especially, DeepDMRE outperforms DMRE, $Q$ -learning, LFU, and LRU, and the edge hit rate is improved by roughly 5%, 19%, 40%, and 35%, respectively, when the cache capacity reaches 1, 000 MB.
引用
收藏
页码:9595 / 9609
页数:15
相关论文
共 50 条
  • [41] Survey of Fully Cooperative Multi-Agent Deep Reinforcement Learning
    Zhao, Liyang
    Chang, Tianqing
    Chu, Kaixuan
    Guo, Libin
    Zhang, Lei
    [J]. Computer Engineering and Applications, 2023, 59 (12) : 14 - 27
  • [42] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
    Shao, Kun
    Zhu, Yuanheng
    Tang, Zhentao
    Zhao, Dongbin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [43] Multi-Agent Reinforcement Learning for Cooperative Coded Caching via Homotopy Optimization
    Wu, Xiongwei
    Li, Jun
    Xiao, Ming
    Ching, P. C.
    Poor, H. Vincent
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (08) : 5258 - 5272
  • [44] Multi-Agent Deep Reinforcement Learning-Based Cooperative Edge Caching for Ultra-Dense Next-Generation Networks
    Chen, Shuangwu
    Yao, Zhen
    Jiang, Xiaofeng
    Yang, Jian
    Hanzo, Lajos
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (04) : 2441 - 2456
  • [45] UAV Assisted Cooperative Caching on Network Edge Using Multi-Agent Actor-Critic Reinforcement Learning
    Araf, Sadman
    Saha, Adittya Soukarjya
    Kazi, Sadia Hamid
    Tran, Nguyen H. H.
    Alam, Md. Golam Rabiul
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (02) : 2322 - 2337
  • [46] Spatial-Temporal Graph Attention-based Multi-Agent Reinforcement Learning in Cooperative Edge Caching
    Hou, Jiacheng
    Nayak, Amiya
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3078 - 3083
  • [47] Cooperative Task Offloading for Mobile Edge Computing Based on Multi-Agent Deep Reinforcement Learning
    Yang, Jian
    Yuan, Qifeng
    Chen, Shuangwu
    He, Huasen
    Jiang, Xiaofeng
    Tan, Xiaobin
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (03): : 3205 - 3219
  • [48] Allocation Algorithm of Distributed Cooperative Jamming Power Based on Multi-Agent Deep Reinforcement Learning
    Rao, Ning
    Xu, Hua
    Jiang, Lei
    Song, Bai-Lin
    Shi, Yun-Hao
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (06): : 1319 - 1330
  • [49] Multi-Agent Deep Reinforcement Learning for Distributed Load Restoration
    Linh Vu
    Tuyen Vu
    Thanh Long Vu
    Srivastava, Anurag
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1749 - 1760
  • [50] Multi-agent deep reinforcement learning strategy for distributed energy
    Xi, Lei
    Sun, Mengmeng
    Zhou, Huan
    Xu, Yanchun
    Wu, Junnan
    Li, Yanying
    [J]. MEASUREMENT, 2021, 185