Attention Mechanism-Aided Deep Reinforcement Learning for Dynamic Edge Caching

被引：3

作者：

Teng, Ziyi ^{[1
]}

Fang, Juan ^{[1
]}

Yang, Huijing ^{[1
]}

Yu, Lu ^{[2
]}

Chen, Huijie ^{[1
]}

Xiang, Wei ^{[3
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

[2] James Cook Univ, Dept Elect & Comp Engn, Cairns, Qld 4878, Australia

[3] La Trobe Univ, Sch Comp Engn & Math Sci, Melbourne, Vic 3086, Australia

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Servers; Wireless communication; Optimization; Load modeling; Resource management; Internet of Things; Telecommunication traffic; Attention-weighted channel assignment; deep reinforcement learning; edge caching; wireless network; USER ASSOCIATION; PLACEMENT;

D O I：

10.1109/JIOT.2023.3327656

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The dynamic mechanism of joint proactive caching and cache replacement, which involves placing content items close to cache-enabled edge devices ahead of time until they are requested, is a promising technique for enhancing traffic offloading and relieving heavy network loads. However, due to limited edge cache capacity and wireless transmission resources, accurately predicting users' future requests and performing dynamic caching is crucial to effectively utilizing these limited resources. This article investigates joint proactive caching and cache replacement strategies in a general mobile-edge computing (MEC) network with multiple users under a cloud-edge-device collaboration architecture. The joint optimization problem is formulated as a Markov decision process (MDP) problem with an infinite range of average network load costs, aiming to reduce network load traffic while efficiently utilizing the limited available transport resources. To address this issue, we design an attention-weighted deep deterministic policy gradient (AWD2PG) model, which uses attention weights to allocate the number of channels from server to user, and applies deep deterministic policies on both user and server sides for Cache decision-making, so as to achieve the purpose of reducing network traffic load and improving network and cache resource utilization. We verify the convergence of the corresponding algorithms and demonstrate the effectiveness of the proposed AWD2PG strategy and benchmark in reducing network load and improving hit rate.

引用

页码：10197 / 10213

页数：17

共 50 条

[1] Dynamic Content Update for Wireless Edge Caching via Deep Reinforcement Learning
Wu, Pingyang
Li, Jun
Shi, Long
Ding, Ming
Cai, Kui
Yang, Fuli
IEEE COMMUNICATIONS LETTERS, 2019, 23 (10) : 1773 - 1777
[2] Federated Reinforcement Learning Based on Multi-head Attention Mechanism for Vehicle Edge Caching
Li, XinRan
Wei, ZhenChun
Lyu, ZengWei
Yuan, XiaoHui
Xu, Juan
Zhang, ZeYu
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 648 - 656
[3] Caching in Dynamic IoT Networks by Deep Reinforcement Learning
Yao, Jingjing
Ansari, Nirwan
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3268 - 3275
[4] Deep Reinforcement Learning for Cooperative Edge Caching in Vehicular Networks
Xing, Yuping
Sun, Yanhua
Qiao, Lan
Wang, Zhuwei
Si, Pengbo
Zhang, Yanhua
2021 13TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2021), 2021, : 144 - 149
[5] A Survey on Reinforcement Learning-Aided Caching in Heterogeneous Mobile Edge Networks
Nomikos, Nikolaos
Zoupanos, Spyros
Charalambous, Themistoklis
Krikidis, Ioannis
IEEE ACCESS, 2022, 10 : 4380 - 4413
[6] Collaborative Caching in Edge Computing via Federated Learning and Deep Reinforcement Learning
Wang, Yali
Chen, Jiachao
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
[7] Partially Collaborative Edge Caching Based on Federated Deep Reinforcement Learning
Lei, Meng
Li, Qiang
Ge, Xiaohu
Pandharipande, Ashish
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 1389 - 1394
[8] Intelligent edge content caching: A deep recurrent reinforcement learning method
Haitao Xu
Yuejun Sun
Jingnan Gao
Jianbo Guo
Peer-to-Peer Networking and Applications, 2022, 15 : 2619 - 2632
[9] Edge Caching for IoT Transient Data Using Deep Reinforcement Learning
Sheng, Shuran
Chen, Peng
Chen, Zhimin
Wu, Lenan
Jiang, Hao
IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 4477 - 4482
[10] Collaborative Edge Computing and Caching With Deep Reinforcement Learning Decision Agents
Ren, Jianji
Wang, Haichao
Hou, Tingting
Zheng, Shuai
Tang, Chaosheng
IEEE ACCESS, 2020, 8 : 120604 - 120612

← 1 2 3 4 5 →