TTL-Based Cache Utility Maximization Using Deep Reinforcement Learning

被引:0
|
作者
Cho, Chunglae [1 ]
Shin, Seungjae [1 ]
Jeon, Hongseok [1 ]
Yoon, Seunghyun [1 ]
机构
[1] Elect & Telecommun Res Inst, Daejeon, South Korea
关键词
caching; utility maximization; deep reinforcement learning; non-stationary traffic;
D O I
10.1109/GLOBECOM46510.2021.9685845
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Utility-driven caching opened up a new design opportunity for caching algorithms by modeling the admission and eviction control as a utility maximization process with essential support for service differentiation. Nevertheless, there is still to go in terms of adaptability to changing environment. Slow convergence to an optimal state may degrade actual user-experienced utility, which gets even worse in non-stationary scenarios where cache control should be adaptive to time-varying content request traffic. This paper proposes to exploit deep reinforcement learning (DRL) to enhance the adaptability of utility-driven time-to-live (TTL)-based caching. Employing DRL with long short-term memory helps a caching agent learn how it adapts to the temporal correlation of content popularities to shorten the transient-state before the optimal steady-state. In addition, we elaborately design the state and action spaces of DRL to overcome the curse of dimensionality, which is one of the most frequently raised issues in machine learning-based approaches. Experimental results show that policies trained by DRL can outperform the conventional utility-driven caching algorithm under some non-stationary environments where content request traffic changes rapidly.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Adaptive Action Selection Using Utility-based Reinforcement Learning
    Chen, Kunrong
    Lin, Fen
    Tan, Qing
    Shi, Zhongzhi
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 67 - 72
  • [42] Receding Horizon Cache and Extreme Learning Machine Based Reinforcement Learning
    Shao, Zhifei
    Er, Meng Joo
    Huang, Guang-Bin
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS & VISION (ICARCV), 2012, : 1591 - 1596
  • [43] A cache-aware congestion control mechanism using deep reinforcement learning for wireless sensor networks
    Alipio, Melchizedek
    Bures, Miroslav
    AD HOC NETWORKS, 2025, 166
  • [44] Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics
    Mallick, Prakash
    Chen, Zhiyiong
    Zamani, Mohsen
    NEUROCOMPUTING, 2022, 484 : 79 - 88
  • [45] Cache Sharing in UAV-Enabled Cellular Network: A Deep Reinforcement Learning-Based Approach
    Muslih, Hamidullah
    Kazmi, S. M. Ahsan
    Mazzara, Manuel
    Baye, Gaspard
    IEEE ACCESS, 2024, 12 : 43422 - 43435
  • [46] Vision-based Navigation Using Deep Reinforcement Learning
    Kulhanek, Jonas
    Derner, Erik
    de Bruin, Tim
    Babuska, Robert
    2019 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2019,
  • [47] INVERSE REINFORCEMENT LEARNING USING EXPECTATION MAXIMIZATION IN MIXTURE MODELS
    Hahn, Juergen
    Zoubir, Abdelhak M.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3721 - 3725
  • [48] Addressing Competitive Influence Maximization on Unknown Social Network with Deep Reinforcement Learning
    Ali, Khurshed
    Wang, Chih-Yu
    Yeh, Mi-Yen
    Chen, Yi-Shin
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, : 196 - 203
  • [49] ToupleGDD: A Fine-Designed Solution of Influence Maximization by Deep Reinforcement Learning
    Chen, Tiantian
    Yan, Siwen
    Guo, Jianxiong
    Wu, Weili
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2210 - 2221
  • [50] Cooperative cache update using multi-agent recurrent deep reinforcement learning for mobile edge networks
    Somesula, Manoj Kumar
    Rout, Rashmi Ranjan
    Somayajulu, D. V. L. N.
    COMPUTER NETWORKS, 2022, 209