Characterizing the impact of last-level cache replacement policies on big-data workloads

被引:5
|
作者
Jamet, Alexandre Valentin [1 ]
Alvarez, Lluc [1 ,2 ]
Jimenez, Daniel A. [3 ]
Casas, Marc [1 ]
机构
[1] Barcelona Supercomp Ctr BSC, Barcelona, Spain
[2] Univ Politecn Cataluna, Barcelona, Spain
[3] Texas A&M Univ, College Stn, TX 77843 USA
基金
美国国家科学基金会; 欧盟地平线“2020”;
关键词
cache management; big data; graph processing; workload evaluation; micro-architecture;
D O I
10.1109/IISWC50251.2020.00022
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The vast disparity between Last Level Cache (LLC) and memory latencies has motivated the need for efficient cache management policies. The computer architecture literature abounds with work on LLC replacement policy. Although these works greatly improve over the least-recently-used (LRU) policy, they tend to focus only on the SPEC CPU 2006 benchmark suite - and more recently on the SPEC CPU 2017 benchmark suite - for evaluation. However, these workloads are representative for only a subset of current High-Performance Computing (HPC) workloads. In this paper we evaluate the behavior of a mix of graph processing, scientific and industrial workloads (GAP, XSBench and Qualcomm) along with the well-known SPEC CPU 2006 and SPEC CPU 2017 workloads on state-of-the-art LLC replacement policies such as Multiperspective Reuse Prediction (MPPPB), Glider, Hawkeye, SHiP, DRRIP and SRRIP. Our evaluation reveals that, even though current state-of-the-art LLC replacement policies provide a significant performance improvement over LRU for both SPEC CPU 2006 and SPEC CPU 2017 workloads, those policies are hardly able to capture the access patterns and yield sensible improvement on current HPC and big data workloads due to their highly complex behavior. In addition, this paper introduces two new LLC replacement policies derived from MPPPB. The first proposed replacement policy, Multi-Sampler Multiperspective (MS-MPPPB), uses multiple samplers instead of a single one and dynamically selects the best-behaving sampler to drive reuse distance predictions. The second replacement policy presented in this paper, Multiperspective with Dynamic Features Selector (DS-MPPPB), selects the best behaving features among a set of 64 features to improve the accuracy of the predictions. On a large set of workloads that stress the LLC, MS-MPPPB achieves a geometric mean speed-up of 8.3% over LRU, while DS-MPPPB outperforms LRU by a geometric mean speedup of 8.0%. For big data and HPC workloads, the two proposed techniques present higher performance benefits than state-of-the-art approaches such as MPPPB, Glider and Hawkeye, which yield geometric mean speedups of 7.0%, 5.0% and 4.8% over LRU, respectively.
引用
收藏
页码:134 / 144
页数:11
相关论文
共 50 条
  • [1] Performance and Energy Assessment of Last-Level Cache Replacement Policies
    Peneau, Pierre-Yves
    Novo, David
    Bruguier, Florent
    Sassatelli, Gilles
    Gamatie, Abdoulaye
    [J]. PROCEEDINGS OF 2017 FIRST INTERNATIONAL CONFERENCE ON EMBEDDED & DISTRIBUTED SYSTEMS (EDIS 2017), 2017, : 149 - 154
  • [2] Characterizing Multi-threaded Applications for Designing Sharing-aware Last-level Cache Replacement Policies
    Natarajan, Ragavendra
    Chaudhuri, Mainak
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2013), 2013, : 1 - +
  • [3] Dynamic Adaptive Replacement Policy in Shared Last-Level Cache of DRAM/PCM Hybrid Memory for Big Data Storage
    Jia, Gangyong
    Han, Guangjie
    Jiang, Jinfang
    Liu, Li
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (04) : 1951 - 1960
  • [4] Access Pattern Characterization of Last-level Cache for Effective Replacement
    Anik, Shafayat Mowla
    Lee, Byeong Kil
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1113 - 1116
  • [5] Last-level Cache Deduplication
    Tian, Yingying
    Khan, Samira M.
    Jimenez, Daniel A.
    Loh, Gabriel H.
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, (ICS'14), 2014, : 53 - 62
  • [6] The Virtual Write Queue: Coordinating DRAM and Last-Level Cache Policies
    Stuecheli, Jeffrey
    Kaseridis, Dimitris
    Daly, David
    Hunter, Hillery C.
    John, Lizy K.
    [J]. ISCA 2010: THE 37TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2010, : 72 - 82
  • [7] An Application-Aware Cache Replacement Policy for Last-Level Caches
    Warrier, Tripti S.
    Anupama, B.
    Mutyam, Madhu
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2013, 2013, 7767 : 207 - 219
  • [8] Locality-Aware Data Replication in the Last-Level Cache
    Kurian, George
    Devadas, Srinivas
    Khan, Omer
    [J]. 2014 20TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA-20), 2014, : 1 - 12
  • [9] Cooperatively Managing Dynamic Writeback and Insertion Policies in a Last-level DRAM Cache
    Yin, Shouyi
    Li, Jiakun
    Liu, Leibo
    Wei, Shaojun
    Guo, Yike
    [J]. 2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 187 - 192
  • [10] Contention Tracking in GPU Last-Level Cache
    Barrera, Javier
    Kosmidis, Leonidas
    Tabani, Hamid
    Abella, Jaume
    Cazorla, Francisco J.
    [J]. 2022 IEEE 40TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2022), 2022, : 76 - 79