Caching in Dynamic Environments: A Near-Optimal Online Learning Approach

被引:7
|
作者
Zhou, Shiji [1 ]
Wang, Zhi [2 ,3 ]
Hu, Chenghao [4 ]
Mao, Yinan [5 ]
Yan, Haopeng [6 ]
Zhang, Shanghang [7 ]
Wu, Chuan [8 ]
Zhu, Wenwu [6 ]
机构
[1] Tsinghua Berkeley Shenzhen Inst, Shenzhen 518000, Peoples R China
[2] Tsinghua Shenzhen Int Grad Sch, Shenzhen 518000, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[4] Univ Toronto, Elect & Comp Engn, Toronto, ON M4Y 1M7, Canada
[5] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518000, Peoples R China
[6] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100080, Peoples R China
[7] Univ Calif Berkeley, EECS, Berkeley, CA USA
[8] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Heuristic algorithms; Streaming media; Size measurement; Reinforcement learning; Proposals; Optimization; Area measurement; Dynamic environment; dynamic regret; online learning; POLICIES;
D O I
10.1109/TMM.2021.3132156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid growth of rich multimedia data in today's Internet, especially video traffic, has challenged the content delivery networks (CDNs). Caching serves as an important means to reduce user access latency so as to enable faster content downloads. Motivated by the dynamic nature of the real-world edge traces, this paper introduces a provably well online caching policy in dynamic environments where: 1) the popularity is highly dynamic; 2) no regular stochastic pattern can model this dynamic evaluation process. First, we design an online optimization framework, which aims to minimize the dynamic regret that finds the distance between an online caching policy and the best dynamic policy in hindsight. Second, we propose a dynamic online learning method to solve the non-stationary caching problem formulated in the previous framework. Compared to the linear dynamic regret of previous methods, our proposal is proved to achieve a sublinear dynamic regret, from which it is guaranteed to be nearly optimal. We verify the design using both synthetic and real-world traces: the proposed policy achieves the best performance in the synthetic traces with different levels of dynamicity, which verifies the dynamic adaptation; our proposal consistently achieves at least 9.4% improvement than the baselines, including LRU, LFU, Static Online Learning based replacement, and Deep Reinforcement Learning based replacement, in random edge areas from real-world traces (from iQIYI), further verifying the effectiveness and robustness on the edge.
引用
收藏
页码:792 / 804
页数:13
相关论文
共 50 条
  • [1] Near-Optimal Bounds for Online Caching with Machine Learned Advice
    Rohatgi, Dhruv
    [J]. PROCEEDINGS OF THE 2020 ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2020, : 1834 - 1845
  • [2] Near-Optimal Bounds for Online Caching with Machine Learned Advice
    Rohatgi, Dhruv
    [J]. PROCEEDINGS OF THE THIRTY-FIRST ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA'20), 2020, : 1834 - 1845
  • [3] Near-optimal parallel prefetching and caching
    Kimbrel, T
    Karlin, AR
    [J]. 37TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1996, : 540 - 549
  • [4] Near-optimal parallel prefetching and caching
    Kimbrel, T
    Karlin, AR
    [J]. SIAM JOURNAL ON COMPUTING, 2000, 29 (04) : 1051 - 1082
  • [5] A Dynamic Near-Optimal Algorithm for Online Linear Programming
    Agrawal, Shipra
    Wang, Zizhuo
    Ye, Yinyu
    [J]. OPERATIONS RESEARCH, 2014, 62 (04) : 876 - 890
  • [6] Near-Optimal Online Auctions
    Blum, Avrim
    Hartline, Jason D.
    [J]. PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2005, : 1156 - 1163
  • [7] Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes
    Zhang, Junzhe
    Bareinboim, Elias
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Whether to Charge an Electric Vehicle or Not? A Near-Optimal Online Approach
    Deng, Ruilong
    Liang, Hao
    [J]. 2016 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PESGM), 2016,
  • [9] Near-optimal online control of dynamic discrete-event systems
    Grigorov, Lenko
    Rudie, Karen
    [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2006, 16 (04): : 419 - 449
  • [10] Near-Optimal Online Control of Dynamic Discrete-Event Systems
    Lenko Grigorov
    Karen Rudie
    [J]. Discrete Event Dynamic Systems, 2006, 16 : 419 - 449