Distributed Deep Multi-Agent Reinforcement Learning for Cooperative Edge Caching in Internet-of-Vehicles

被引：29

作者：

Zhou, Huan ^{[1
,2
]}

Jiang, Kai ^{[3
]}

He, Shibo ^{[4
]}

Min, Geyong ^{[5
]}

Wu, Jie ^{[6
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China

[2] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443002, Peoples R China

[3] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430000, Peoples R China

[4] Zhejiang Univ, Coll Control Sci & Technol, Hangzhou 310027, Peoples R China

[5] Univ Exeter, Coll Engn Math & Phys Sci, Dept Comp Sci, Exeter EX4 4QF, England

[6] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2023年 / 22卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Computer architecture; Delays; Costs; Backhaul networks; Reinforcement learning; Quality of service; Optimization; Edge caching; Internet-of-Vehicles; content delivery; cache replacement; multi-agent reinforcement learning;

D O I：

10.1109/TWC.2023.3272348

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Edge caching is a promising approach to reduce duplicate content transmission in Internet-of-Vehicles (IoVs). Several Reinforcement Learning (RL) based edge caching methods have been proposed to improve the resource utilization and reduce the backhaul traffic load. However, they only obtain the local sub-optimal solution, as they neglect the influence from environments by other agents. This paper investigates the edge caching strategies with consideration of the content delivery and cache replacement by exploiting the distributed Multi-Agent Reinforcement Learning (MARL). A hierarchical edge caching architecture for IoVs is proposed and the corresponding problem is formulated with the goal to minimize the long-term content access cost in the system. Then, we extend the Markov Decision Process (MDP) in the single agent RL to the context of a multi-agent system, and tackle the corresponding combinatorial multi-armed bandit problem based on the framework of a stochastic game. Specifically, we firstly propose a Distributed MARL-based Edge caching method (DMRE), where each agent can adaptively learn its best behaviour in conjunction with other agents for intelligent caching. Meanwhile, we attempt to reduce the computation complexity of DMRE by parameter approximation, which legitimately simplifies the training targets. However, DMRE is enabled to represent and update the parameter by creating a lookup table, essentially a tabular-based method, which generally performs inefficiently in large-scale scenarios. To circumvent the issue and make more expressive parametric models, we incorporate the technical advantage of the Deep- $Q$ Network into DMRE, and further develop a computationally efficient method (DeepDMRE) with neural network-based Nash equilibria approximation. Extensive simulations are conducted to verify the effectiveness of the proposed methods. Especially, DeepDMRE outperforms DMRE, $Q$ -learning, LFU, and LRU, and the edge hit rate is improved by roughly 5%, 19%, 40%, and 35%, respectively, when the cache capacity reaches 1, 000 MB.

引用

页码：9595 / 9609

页数：15

共 50 条

[1] Multi-Agent Reinforcement Learning for Cooperative Edge Caching in Internet of Vehicles
Jiang, Kai
Zhou, Huan
Zeng, Deze
Wu, Jie
[J]. 2020 IEEE 17TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2020), 2020, : 455 - 463
[2] Multi-Agent Reinforcement Learning for Cooperative Task Offloading in Internet-of-Vehicles
Lei, Yuchen
Jiang, Kai
Wang, Zhenning
Cao, Yue
Lin, Hai
Chen, Liang
[J]. 2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[3] Deep Reinforcement Learning-based Edge Caching and Multi-link Cooperative Communication in Internet-of-Vehicles
Ma, Teng
Chen, Xin
Jiao, Libo
Chen, Ying
[J]. 2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 567 - 574
[4] Novel Edge Caching Approach Based on Multi-Agent Deep Reinforcement Learning for Internet of Vehicles
Zhang, Degan
Wang, Wenjing
Zhang, Jie
Zhang, Ting
Du, Jinyu
Yang, Chun
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8324 - 8338
[5] Multi-Agent Deep Reinforcement Learning for content caching within the Internet of Vehicles
Knari, Anas
Derfouf, Mostapha
Koulali, Mohammed-Amine
Khoumsi, Ahmed
[J]. Ad Hoc Networks, 2024, 152
[6] Multi-Agent Deep Reinforcement Learning for content caching within the Internet of Vehicles
Knari, Anas
Derfouf, Mostapha
Koulali, Mohammed-Amine
Khoumsi, Ahmed
[J]. AD HOC NETWORKS, 2024, 152
[7] Multi-Agent Deep Reinforcement Learning for Cooperative Edge Caching via Hybrid Communication
Wang, Fei
Emara, Salma
Kaplan, Isidor
Li, Baochun
Zeyl, Timothy
[J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1206 - 1211
[8] Deep Multi-Agent Reinforcement Learning Based Cooperative Edge Caching in Wireless Networks
Zhong, Chen
Gursoy, M. Cenk
Velipasalar, Senem
[J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[9] COOPERATIVE SCENARIOS FOR MULTI-AGENT REINFORCEMENT LEARNING IN WIRELESS EDGE CACHING
Garg, Navneet
Ratnarajah, Tharmalingam
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3435 - 3439
[10] Innovative edge caching: A multi-agent deep reinforcement learning approach for cooperative replacement strategies
Lyu, Zengwei
Zhang, Yu
Yuan, Xiaohui
Wei, Zhenchun
Fu, Yu
Feng, Lin
Zhou, Haodong
[J]. COMPUTER NETWORKS, 2024, 253

← 1 2 3 4 5 →