Caching in Dynamic Environments: A Near-Optimal Online Learning Approach

被引：7

作者：

Zhou, Shiji ^{[1
]}

Wang, Zhi ^{[2
,3
]}

Hu, Chenghao ^{[4
]}

Mao, Yinan ^{[5
]}

Yan, Haopeng ^{[6
]}

Zhang, Shanghang ^{[7
]}

Wu, Chuan ^{[8
]}

Zhu, Wenwu ^{[6
]}

机构：

[1] Tsinghua Berkeley Shenzhen Inst, Shenzhen 518000, Peoples R China

[2] Tsinghua Shenzhen Int Grad Sch, Shenzhen 518000, Peoples R China

[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China

[4] Univ Toronto, Elect & Comp Engn, Toronto, ON M4Y 1M7, Canada

[5] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518000, Peoples R China

[6] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100080, Peoples R China

[7] Univ Calif Berkeley, EECS, Berkeley, CA USA

[8] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

关键词：

Heuristic algorithms; Streaming media; Size measurement; Reinforcement learning; Proposals; Optimization; Area measurement; Dynamic environment; dynamic regret; online learning; POLICIES;

D O I：

10.1109/TMM.2021.3132156

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The rapid growth of rich multimedia data in today's Internet, especially video traffic, has challenged the content delivery networks (CDNs). Caching serves as an important means to reduce user access latency so as to enable faster content downloads. Motivated by the dynamic nature of the real-world edge traces, this paper introduces a provably well online caching policy in dynamic environments where: 1) the popularity is highly dynamic; 2) no regular stochastic pattern can model this dynamic evaluation process. First, we design an online optimization framework, which aims to minimize the dynamic regret that finds the distance between an online caching policy and the best dynamic policy in hindsight. Second, we propose a dynamic online learning method to solve the non-stationary caching problem formulated in the previous framework. Compared to the linear dynamic regret of previous methods, our proposal is proved to achieve a sublinear dynamic regret, from which it is guaranteed to be nearly optimal. We verify the design using both synthetic and real-world traces: the proposed policy achieves the best performance in the synthetic traces with different levels of dynamicity, which verifies the dynamic adaptation; our proposal consistently achieves at least 9.4% improvement than the baselines, including LRU, LFU, Static Online Learning based replacement, and Deep Reinforcement Learning based replacement, in random edge areas from real-world traces (from iQIYI), further verifying the effectiveness and robustness on the edge.

引用

页码：792 / 804

页数：13

共 50 条

[1] Near-Optimal Bounds for Online Caching with Machine Learned Advice
Rohatgi, Dhruv
[J]. PROCEEDINGS OF THE 2020 ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2020, : 1834 - 1845
[2] Near-Optimal Bounds for Online Caching with Machine Learned Advice
Rohatgi, Dhruv
[J]. PROCEEDINGS OF THE THIRTY-FIRST ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA'20), 2020, : 1834 - 1845
[3] Near-optimal parallel prefetching and caching
Kimbrel, T
Karlin, AR
[J]. 37TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1996, : 540 - 549
[4] Near-optimal parallel prefetching and caching
Kimbrel, T
Karlin, AR
[J]. SIAM JOURNAL ON COMPUTING, 2000, 29 (04) : 1051 - 1082
[5] A Dynamic Near-Optimal Algorithm for Online Linear Programming
Agrawal, Shipra
Wang, Zizhuo
Ye, Yinyu
[J]. OPERATIONS RESEARCH, 2014, 62 (04) : 876 - 890
[6] Near-Optimal Online Auctions
Blum, Avrim
Hartline, Jason D.
[J]. PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2005, : 1156 - 1163
[7] Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes
Zhang, Junzhe
Bareinboim, Elias
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[8] Whether to Charge an Electric Vehicle or Not? A Near-Optimal Online Approach
Deng, Ruilong
Liang, Hao
[J]. 2016 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PESGM), 2016,
[9] Near-optimal online control of dynamic discrete-event systems
Grigorov, Lenko
Rudie, Karen
[J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2006, 16 (04): : 419 - 449
[10] Near-Optimal Online Control of Dynamic Discrete-Event Systems
Lenko Grigorov
Karen Rudie
[J]. Discrete Event Dynamic Systems, 2006, 16 : 419 - 449

← 1 2 3 4 5 →