Attention Mechanism-Aided Deep Reinforcement Learning for Dynamic Edge Caching

被引:3
|
作者
Teng, Ziyi [1 ]
Fang, Juan [1 ]
Yang, Huijing [1 ]
Yu, Lu [2 ]
Chen, Huijie [1 ]
Xiang, Wei [3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] James Cook Univ, Dept Elect & Comp Engn, Cairns, Qld 4878, Australia
[3] La Trobe Univ, Sch Comp Engn & Math Sci, Melbourne, Vic 3086, Australia
基金
中国国家自然科学基金;
关键词
Servers; Wireless communication; Optimization; Load modeling; Resource management; Internet of Things; Telecommunication traffic; Attention-weighted channel assignment; deep reinforcement learning; edge caching; wireless network; USER ASSOCIATION; PLACEMENT;
D O I
10.1109/JIOT.2023.3327656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The dynamic mechanism of joint proactive caching and cache replacement, which involves placing content items close to cache-enabled edge devices ahead of time until they are requested, is a promising technique for enhancing traffic offloading and relieving heavy network loads. However, due to limited edge cache capacity and wireless transmission resources, accurately predicting users' future requests and performing dynamic caching is crucial to effectively utilizing these limited resources. This article investigates joint proactive caching and cache replacement strategies in a general mobile-edge computing (MEC) network with multiple users under a cloud-edge-device collaboration architecture. The joint optimization problem is formulated as a Markov decision process (MDP) problem with an infinite range of average network load costs, aiming to reduce network load traffic while efficiently utilizing the limited available transport resources. To address this issue, we design an attention-weighted deep deterministic policy gradient (AWD2PG) model, which uses attention weights to allocate the number of channels from server to user, and applies deep deterministic policies on both user and server sides for Cache decision-making, so as to achieve the purpose of reducing network traffic load and improving network and cache resource utilization. We verify the convergence of the corresponding algorithms and demonstrate the effectiveness of the proposed AWD2PG strategy and benchmark in reducing network load and improving hit rate.
引用
收藏
页码:10197 / 10213
页数:17
相关论文
共 50 条
  • [1] Dynamic Content Update for Wireless Edge Caching via Deep Reinforcement Learning
    Wu, Pingyang
    Li, Jun
    Shi, Long
    Ding, Ming
    Cai, Kui
    Yang, Fuli
    IEEE COMMUNICATIONS LETTERS, 2019, 23 (10) : 1773 - 1777
  • [2] Federated Reinforcement Learning Based on Multi-head Attention Mechanism for Vehicle Edge Caching
    Li, XinRan
    Wei, ZhenChun
    Lyu, ZengWei
    Yuan, XiaoHui
    Xu, Juan
    Zhang, ZeYu
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 648 - 656
  • [3] Caching in Dynamic IoT Networks by Deep Reinforcement Learning
    Yao, Jingjing
    Ansari, Nirwan
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3268 - 3275
  • [4] Deep Reinforcement Learning for Cooperative Edge Caching in Vehicular Networks
    Xing, Yuping
    Sun, Yanhua
    Qiao, Lan
    Wang, Zhuwei
    Si, Pengbo
    Zhang, Yanhua
    2021 13TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2021), 2021, : 144 - 149
  • [5] A Survey on Reinforcement Learning-Aided Caching in Heterogeneous Mobile Edge Networks
    Nomikos, Nikolaos
    Zoupanos, Spyros
    Charalambous, Themistoklis
    Krikidis, Ioannis
    IEEE ACCESS, 2022, 10 : 4380 - 4413
  • [6] Collaborative Caching in Edge Computing via Federated Learning and Deep Reinforcement Learning
    Wang, Yali
    Chen, Jiachao
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [7] Partially Collaborative Edge Caching Based on Federated Deep Reinforcement Learning
    Lei, Meng
    Li, Qiang
    Ge, Xiaohu
    Pandharipande, Ashish
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 1389 - 1394
  • [8] Intelligent edge content caching: A deep recurrent reinforcement learning method
    Haitao Xu
    Yuejun Sun
    Jingnan Gao
    Jianbo Guo
    Peer-to-Peer Networking and Applications, 2022, 15 : 2619 - 2632
  • [9] Edge Caching for IoT Transient Data Using Deep Reinforcement Learning
    Sheng, Shuran
    Chen, Peng
    Chen, Zhimin
    Wu, Lenan
    Jiang, Hao
    IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 4477 - 4482
  • [10] Collaborative Edge Computing and Caching With Deep Reinforcement Learning Decision Agents
    Ren, Jianji
    Wang, Haichao
    Hou, Tingting
    Zheng, Shuai
    Tang, Chaosheng
    IEEE ACCESS, 2020, 8 : 120604 - 120612