TTL-Based Cache Utility Maximization Using Deep Reinforcement Learning

被引：0

作者：

Cho, Chunglae ^{[1
]}

Shin, Seungjae ^{[1
]}

Jeon, Hongseok ^{[1
]}

Yoon, Seunghyun ^{[1
]}

机构：

[1] Elect & Telecommun Res Inst, Daejeon, South Korea

来源：

2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2021年

关键词：

caching; utility maximization; deep reinforcement learning; non-stationary traffic;

D O I：

10.1109/GLOBECOM46510.2021.9685845

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Utility-driven caching opened up a new design opportunity for caching algorithms by modeling the admission and eviction control as a utility maximization process with essential support for service differentiation. Nevertheless, there is still to go in terms of adaptability to changing environment. Slow convergence to an optimal state may degrade actual user-experienced utility, which gets even worse in non-stationary scenarios where cache control should be adaptive to time-varying content request traffic. This paper proposes to exploit deep reinforcement learning (DRL) to enhance the adaptability of utility-driven time-to-live (TTL)-based caching. Employing DRL with long short-term memory helps a caching agent learn how it adapts to the temporal correlation of content popularities to shorten the transient-state before the optimal steady-state. In addition, we elaborately design the state and action spaces of DRL to overcome the curse of dimensionality, which is one of the most frequently raised issues in machine learning-based approaches. Experimental results show that policies trained by DRL can outperform the conventional utility-driven caching algorithm under some non-stationary environments where content request traffic changes rapidly.

引用

页数：6

共 50 条

[41] Adaptive Action Selection Using Utility-based Reinforcement Learning
Chen, Kunrong
Lin, Fen
Tan, Qing
Shi, Zhongzhi
2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 67 - 72
[42] Receding Horizon Cache and Extreme Learning Machine Based Reinforcement Learning
Shao, Zhifei
Er, Meng Joo
Huang, Guang-Bin
2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS & VISION (ICARCV), 2012, : 1591 - 1596
[43] A cache-aware congestion control mechanism using deep reinforcement learning for wireless sensor networks
Alipio, Melchizedek
Bures, Miroslav
AD HOC NETWORKS, 2025, 166
[44] Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics
Mallick, Prakash
Chen, Zhiyiong
Zamani, Mohsen
NEUROCOMPUTING, 2022, 484 : 79 - 88
[45] Cache Sharing in UAV-Enabled Cellular Network: A Deep Reinforcement Learning-Based Approach
Muslih, Hamidullah
Kazmi, S. M. Ahsan
Mazzara, Manuel
Baye, Gaspard
IEEE ACCESS, 2024, 12 : 43422 - 43435
[46] Vision-based Navigation Using Deep Reinforcement Learning
Kulhanek, Jonas
Derner, Erik
de Bruin, Tim
Babuska, Robert
2019 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2019,
[47] INVERSE REINFORCEMENT LEARNING USING EXPECTATION MAXIMIZATION IN MIXTURE MODELS
Hahn, Juergen
Zoubir, Abdelhak M.
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3721 - 3725
[48] Addressing Competitive Influence Maximization on Unknown Social Network with Deep Reinforcement Learning
Ali, Khurshed
Wang, Chih-Yu
Yeh, Mi-Yen
Chen, Yi-Shin
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, : 196 - 203
[49] ToupleGDD: A Fine-Designed Solution of Influence Maximization by Deep Reinforcement Learning
Chen, Tiantian
Yan, Siwen
Guo, Jianxiong
Wu, Weili
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2210 - 2221
[50] Cooperative cache update using multi-agent recurrent deep reinforcement learning for mobile edge networks
Somesula, Manoj Kumar
Rout, Rashmi Ranjan
Somayajulu, D. V. L. N.
COMPUTER NETWORKS, 2022, 209

← 1 2 3 4 5 →