LDAC: Locality-Aware Data Access Control for Large-Scale Multicore Cache Hierarchies

被引:4
|
作者
Shi, Qingchuan [1 ]
Kurian, George [2 ]
Hijaz, Farrukh [3 ]
Devadas, Srinivas [4 ]
Khan, Omer [1 ]
机构
[1] Univ Connecticut, Elect & Comp Engn, Storrs, CT 06269 USA
[2] Google Inc, Mountain View, CA USA
[3] Qualcomm Inc, San Diegoi, CA USA
[4] MIT, Elect Engn & Comp Sci, Cambridge, MA USA
基金
美国国家科学基金会;
关键词
Multicore; cache; locality; REPLICATION;
D O I
10.1145/2983632
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The trend of increasing the number of cores to achieve higher performance has challenged efficient management of on-chip data. Moreover, many emerging applications process massive amounts of data with varying degrees of locality. Therefore, exploiting locality to improve on-chip traffic and resource utilization is of fundamental importance. Conventional multicore cache management schemes either manage the private cache (L1) or the Last-Level Cache (LLC), while ignoring the other. We propose a holistic locality-aware cache hierarchy management protocol for large-scale multicores. The proposed scheme improves on-chip data access latency and energy consumption by intelligently bypassing cache line replication in the L1 caches, and/or intelligently replicating cache lines in the LLC. The approach relies on low overhead yet highly accurate in-hardware runtime classification of data locality at both L1 cache and the LLC. The decision to bypass L1 and/or replicate in LLC is then based on the measured reuse at the fine granularity of cache lines. The locality tracking mechanism is decoupled from the sharer tracking structures that cause scalability concerns in traditional cache coherence protocols. Moreover, the complexity of the protocol is low since no additional coherence states are created. However, the proposed classifier incurs a 5.6KBper-core storage overhead. On a set of parallel benchmarks, the locality-aware protocol reduces average energy consumption by 26% and completion time by 16%, when compared to the state-of-the-art Reactive-NUCA multicore cache management scheme.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] DCSACA: distributed constraint service-aware collaborative access algorithm based on large-scale access to the Internet of Things
    Yi Meng
    Chen QingKui
    [J]. The Journal of Supercomputing, 2018, 74 : 6408 - 6427
  • [42] Multidimensional data organization and random access in large-scale DNA storage systems
    Song, Xin
    Shah, Shalin
    Reif, John
    [J]. THEORETICAL COMPUTER SCIENCE, 2021, 894 : 190 - 202
  • [43] Ntuple Wizard: An Application to Access Large-Scale Open Data from LHCb
    Aidala C.A.
    Burr C.
    Cattaneo M.
    Fitzgerald D.S.
    Morris A.
    Neubert S.
    Tropmann D.
    [J]. Computing and Software for Big Science, 2023, 7 (1)
  • [44] Multidimensional data organization and random access in large-scale DNA storage systems
    Song, Xin
    Shah, Shalin
    Reif, John
    [J]. Theoretical Computer Science, 2021, 894 : 190 - 202
  • [45] An architecture and implementation to support large-scale data access in scientific simulation environments
    Holmes, VP
    Kleban, SD
    Miller, DJ
    Pavlakos, C
    Poore, CA
    Vandewart, RL
    Crowley, CP
    [J]. 35TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2002, : 169 - 176
  • [46] A Protocol-aware Network Control Method for Large-scale Data Transfer in Long-distance Broadband Networks
    Hirose, Jin
    Baba, Ken-ichi
    Abe, Hirotake
    [J]. 2011 20TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2011,
  • [47] Swarming Medium Access Control Protocol for Large-Scale Wireless Sensor Networks
    Mickus, Tautvydas
    Clarke, Tim
    Mitchell, Paul
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON NEXT GENERATION MOBILE APPLICATIONS, SERVICES AND TECHNOLOGIES (NGMAST 2015), 2015, : 102 - 107
  • [48] Low Latency and Resource-aware Program Composition for Large-scale Data Analysis
    Tanaka, Masahiro
    Taura, Kenjiro
    Torisawa, Kentaro
    [J]. 2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 325 - 330
  • [49] Salience-aware adaptive resonance theory for large-scale sparse data clustering
    Meng, Lei
    Tan, Ah-Hwee
    Miao, Chunyan
    [J]. NEURAL NETWORKS, 2019, 120 : 143 - 157
  • [50] Communication-Aware Container Placement and Reassignment in Large-Scale Internet Data Centers
    Lv, Liang
    Zhang, Yuchao
    Li, Yusen
    Xu, Ke
    Wang, Dan
    Wang, Wendong
    Li, Minghui
    Cao, Xuan
    Liang, Qingqing
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (03) : 540 - 555