LDAC: Locality-Aware Data Access Control for Large-Scale Multicore Cache Hierarchies

被引:4
|
作者
Shi, Qingchuan [1 ]
Kurian, George [2 ]
Hijaz, Farrukh [3 ]
Devadas, Srinivas [4 ]
Khan, Omer [1 ]
机构
[1] Univ Connecticut, Elect & Comp Engn, Storrs, CT 06269 USA
[2] Google Inc, Mountain View, CA USA
[3] Qualcomm Inc, San Diegoi, CA USA
[4] MIT, Elect Engn & Comp Sci, Cambridge, MA USA
基金
美国国家科学基金会;
关键词
Multicore; cache; locality; REPLICATION;
D O I
10.1145/2983632
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The trend of increasing the number of cores to achieve higher performance has challenged efficient management of on-chip data. Moreover, many emerging applications process massive amounts of data with varying degrees of locality. Therefore, exploiting locality to improve on-chip traffic and resource utilization is of fundamental importance. Conventional multicore cache management schemes either manage the private cache (L1) or the Last-Level Cache (LLC), while ignoring the other. We propose a holistic locality-aware cache hierarchy management protocol for large-scale multicores. The proposed scheme improves on-chip data access latency and energy consumption by intelligently bypassing cache line replication in the L1 caches, and/or intelligently replicating cache lines in the LLC. The approach relies on low overhead yet highly accurate in-hardware runtime classification of data locality at both L1 cache and the LLC. The decision to bypass L1 and/or replicate in LLC is then based on the measured reuse at the fine granularity of cache lines. The locality tracking mechanism is decoupled from the sharer tracking structures that cause scalability concerns in traditional cache coherence protocols. Moreover, the complexity of the protocol is low since no additional coherence states are created. However, the proposed classifier incurs a 5.6KBper-core storage overhead. On a set of parallel benchmarks, the locality-aware protocol reduces average energy consumption by 26% and completion time by 16%, when compared to the state-of-the-art Reactive-NUCA multicore cache management scheme.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Locality-aware data replication in the last-level cache for large scale multicores
    Farrukh Hijaz
    Qingchuan Shi
    George Kurian
    Srinivas Devadas
    Omer Khan
    [J]. The Journal of Supercomputing, 2016, 72 : 718 - 752
  • [2] Locality-aware data replication in the last-level cache for large scale multicores
    Hijaz, Farrukh
    Shi, Qingchuan
    Kurian, George
    Devadas, Srinivas
    Khan, Omer
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (02): : 718 - 752
  • [3] A Locality-Aware, Energy-Efficient Cache Design for Large-Scale Multi-Core Systems
    Alshegaifi, Abdulrahman
    Huang, Chun-Hsi
    [J]. IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 497 - 502
  • [4] Locality-Aware Data Replication in the Last-Level Cache
    Kurian, George
    Devadas, Srinivas
    Khan, Omer
    [J]. 2014 20TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA-20), 2014, : 1 - 12
  • [5] A Locality-Aware Write Filter Cache for Energy Reduction of STTRAM-Based L1 Data Cache
    Kong, Joonho
    [J]. JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2016, 16 (01) : 80 - 90
  • [6] Constraint-aware large-scale CMP cache design
    Zhao, L.
    Lyer, R.
    Makineni, S.
    Illikkal, R.
    Moses, J.
    Newell, D.
    [J]. HIGH PERFORMANCE COMPUTING - HIPC 2007, PROCEEDINGS, 2007, 4873 : 161 - 171
  • [7] RCBAC: A risk-aware content-based access control model for large-scale text data
    Ma, Ke
    Yang, Geng
    Xiang, Yang
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2020, 167
  • [8] A Data Locality Optimization Algorithm for Large-scale Data Processing in Hadoop
    Zhao, Yanrong
    Wang, Weiping
    Meng, Dan
    Yang, Xiufeng
    Zhang, Shubin
    Li, Jun
    Guan, Gang
    [J]. 2012 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2012, : 655 - 661
  • [9] Distributed cache management for context-aware services in large-scale networks
    Takase, Masaaki
    Sano, Takeshi
    Fukuda, Kenichi
    Chugo, Akira
    [J]. MANAGING NEXT GENERATION NETWORKS AND SERVICES, PROCEEDINGS, 2007, 4773 : 31 - +
  • [10] A Comparison of Systems to Large-Scale Data Access
    Mesmoudi, Amin
    Hacid, Mohand-Said
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, 2014, 8505 : 161 - 175