LDAC: Locality-Aware Data Access Control for Large-Scale Multicore Cache Hierarchies

被引：4

作者：

Shi, Qingchuan ^{[1
]}

Kurian, George ^{[2
]}

Hijaz, Farrukh ^{[3
]}

Devadas, Srinivas ^{[4
]}

Khan, Omer ^{[1
]}

机构：

[1] Univ Connecticut, Elect & Comp Engn, Storrs, CT 06269 USA

[2] Google Inc, Mountain View, CA USA

[3] Qualcomm Inc, San Diegoi, CA USA

[4] MIT, Elect Engn & Comp Sci, Cambridge, MA USA

来源：

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION | 2016年 / 13卷 / 04期

基金：

美国国家科学基金会;

关键词：

Multicore; cache; locality; REPLICATION;

D O I：

10.1145/2983632

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The trend of increasing the number of cores to achieve higher performance has challenged efficient management of on-chip data. Moreover, many emerging applications process massive amounts of data with varying degrees of locality. Therefore, exploiting locality to improve on-chip traffic and resource utilization is of fundamental importance. Conventional multicore cache management schemes either manage the private cache (L1) or the Last-Level Cache (LLC), while ignoring the other. We propose a holistic locality-aware cache hierarchy management protocol for large-scale multicores. The proposed scheme improves on-chip data access latency and energy consumption by intelligently bypassing cache line replication in the L1 caches, and/or intelligently replicating cache lines in the LLC. The approach relies on low overhead yet highly accurate in-hardware runtime classification of data locality at both L1 cache and the LLC. The decision to bypass L1 and/or replicate in LLC is then based on the measured reuse at the fine granularity of cache lines. The locality tracking mechanism is decoupled from the sharer tracking structures that cause scalability concerns in traditional cache coherence protocols. Moreover, the complexity of the protocol is low since no additional coherence states are created. However, the proposed classifier incurs a 5.6KBper-core storage overhead. On a set of parallel benchmarks, the locality-aware protocol reduces average energy consumption by 26% and completion time by 16%, when compared to the state-of-the-art Reactive-NUCA multicore cache management scheme.

引用

页数：28

共 50 条

[41] DCSACA: distributed constraint service-aware collaborative access algorithm based on large-scale access to the Internet of Things
Yi Meng
Chen QingKui
[J]. The Journal of Supercomputing, 2018, 74 : 6408 - 6427
[42] Multidimensional data organization and random access in large-scale DNA storage systems
Song, Xin
Shah, Shalin
Reif, John
[J]. THEORETICAL COMPUTER SCIENCE, 2021, 894 : 190 - 202
[43] Ntuple Wizard: An Application to Access Large-Scale Open Data from LHCb
Aidala C.A.
Burr C.
Cattaneo M.
Fitzgerald D.S.
Morris A.
Neubert S.
Tropmann D.
[J]. Computing and Software for Big Science, 2023, 7 (1)
[44] Multidimensional data organization and random access in large-scale DNA storage systems
Song, Xin
Shah, Shalin
Reif, John
[J]. Theoretical Computer Science, 2021, 894 : 190 - 202
[45] An architecture and implementation to support large-scale data access in scientific simulation environments
Holmes, VP
Kleban, SD
Miller, DJ
Pavlakos, C
Poore, CA
Vandewart, RL
Crowley, CP
[J]. 35TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2002, : 169 - 176
[46] A Protocol-aware Network Control Method for Large-scale Data Transfer in Long-distance Broadband Networks
Hirose, Jin
Baba, Ken-ichi
Abe, Hirotake
[J]. 2011 20TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2011,
[47] Swarming Medium Access Control Protocol for Large-Scale Wireless Sensor Networks
Mickus, Tautvydas
Clarke, Tim
Mitchell, Paul
[J]. 2015 9TH INTERNATIONAL CONFERENCE ON NEXT GENERATION MOBILE APPLICATIONS, SERVICES AND TECHNOLOGIES (NGMAST 2015), 2015, : 102 - 107
[48] Low Latency and Resource-aware Program Composition for Large-scale Data Analysis
Tanaka, Masahiro
Taura, Kenjiro
Torisawa, Kentaro
[J]. 2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 325 - 330
[49] Salience-aware adaptive resonance theory for large-scale sparse data clustering
Meng, Lei
Tan, Ah-Hwee
Miao, Chunyan
[J]. NEURAL NETWORKS, 2019, 120 : 143 - 157
[50] Communication-Aware Container Placement and Reassignment in Large-Scale Internet Data Centers
Lv, Liang
Zhang, Yuchao
Li, Yusen
Xu, Ke
Wang, Dan
Wang, Wendong
Li, Minghui
Cao, Xuan
Liang, Qingqing
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (03) : 540 - 555

← 1 2 3 4 5 →