Locality-aware data replication in the last-level cache for large scale multicores

被引:4
|
作者
Hijaz, Farrukh [1 ]
Shi, Qingchuan [1 ]
Kurian, George [2 ,3 ]
Devadas, Srinivas [2 ]
Khan, Omer [1 ]
机构
[1] Univ Connecticut, Storrs, CT USA
[2] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] Google, Mountain View, CA USA
来源
JOURNAL OF SUPERCOMPUTING | 2016年 / 72卷 / 02期
基金
美国国家科学基金会;
关键词
Multicore; Cache hierarchy; Data management; Energy efficiency; CAPACITY ALLOCATION; CHIP; HIERARCHY; PLACEMENT; COHERENCE;
D O I
10.1007/s11227-015-1608-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Next generation large single-chip multicores will process massive data with varying degree of locality. Harnessing on-chip data locality to optimize the utilization of on-chip cache and network resources is of fundamental importance. We propose a locality-aware selective data replication protocol for the last-level cache (LLC). The goal is to lower memory access latency and energy by only replicating cache lines with high reuse in the LLC slice of the requesting core, while simultaneously keep the off-chip miss rate low. The approach relies on low-overhead yet highly accurate in hardware runtime cache line level classifier that only allows replication of cache lines with high reuse. Furthermore, a classifier captures the LLC pressure at the existing replica locations and adapts its replication decision accordingly. On a set of parallel benchmarks, the proposed protocol reduces overall energy by 14.7, 10.7, 10.5, and 16.7 % and completion time by 2.5, 6.5, 4.5, and 9.5 % when compared to the previously proposed Victim Replication, Adaptive Selective Replication, Reactive-NUCA, and Static-NUCA LLC management schemes. An efficient classifier implementation is evaluated with an overhead of 5.44 KB, which translates to only 1.58 % on top of the Static-NUCA baseline's cache related per-core storage.
引用
收藏
页码:718 / 752
页数:35
相关论文
共 50 条
  • [21] Cost aware cache replacement policy in shared last-level cache for hybrid memory based fog computing
    Jia, Gangyong
    Han, Guangjie
    Wang, Hao
    Wang, Feng
    ENTERPRISE INFORMATION SYSTEMS, 2018, 12 (04) : 435 - 451
  • [22] Avoiding Cache Thrashing due to Private Data Placement in Last-level Cache for Manycore Scaling
    Meng, Jiayuan
    Skadron, Kevin
    2009 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2009, : 282 - 288
  • [23] A cluster file system for high data availability using locality-aware partial replication
    Kim, Jinseok
    Sim, Sangman
    Park, Sungyong
    2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 345 - 350
  • [24] RExCache: Rapid Exploration of Unified Last-level Cache
    Shwe, Su Myat Min
    Javaid, Haris
    Parameswaran, Sri
    2013 18TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2013, : 582 - 587
  • [25] Availability of Data in Locality-Aware Unreliable Networks
    Geibig, Joanna
    MESH: 2009 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN MESH NETWORKS, 2009, : 163 - 166
  • [27] A Pragmatic Delineation on Cache Bypass Algorithm in Last-Level Cache (LLC)
    Dash, Banchhanidhi
    Swain, Debabala
    Swain, Debabrata
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM, VOL 2, 2016, 411 : 37 - 45
  • [28] LACS: A Locality-Aware Cost-Sensitive Cache Replacement Algorithm
    Kharbutli, Mazen
    Sheikh, Rami
    IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (08) : 1975 - 1987
  • [29] Premier: A Concurrency-Aware Pseudo-Partitioning Framework for Shared Last-Level Cache
    Lu, Xiaoyang
    Wang, Rujia
    Sun, Xian-He
    2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 391 - 394
  • [30] R2Cache: Reliability-Aware Reconfigurable Last-Level Cache Architecture for Multi-Cores
    Kriebel, Florian
    Subramaniyan, Arun
    Rehman, Semeen
    Ahandagbe, Segnon Jean Bruno
    Shafique, Muhammad
    Henkel, Joerg
    2015 INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN AND SYSTEM SYNTHESIS (CODES+ISSS), 2015, : 1 - 10