Hierarchical Cache Directory for CMP

被引:21
|
作者
Guo, Song-Liu [1 ]
Wang, Hai-Xia [2 ]
Xue, Yi-Bo [2 ]
Li, Chong-Min [1 ]
Wang, Dong-Sheng [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
cache coherence protocol; hierarchical directory; chip multiprocessor; ARCHITECTURE;
D O I
10.1007/s11390-010-9321-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As more processing cores are integrated into one chip and feature size continues to shrink, the average access latency for remote nodes using directory-based coherence protocol becomes higher, which greatly impacts system performance. Previous techniques such as, data replication and data migration optimize the performance of the requesting core, but offer little improvement for neighbor nodes. Other techniques such as in-transit optimization try to reduce latency at the cost of increased storage. This paper introduces hierarchical cache directory into CMP (chip multiprocessor), which divides CMP tiles into multiple regions hierarchically, and combines it with data replication. A new directory organization is proposed to record the share status within a, region and assist the regional home to complete operation efficiently. Simulation results show that for a 16-core CMP, compared to traditional directory, hierarchical cache directory reduces average access latency by 9% and on-chip network traffic by 34% on average with less storage. Theoretical analyses show that for a 2(n) x 2(n) tiled CMP, the average access latency in hierarchical cache directory asymptotically approaches a function that is independent of n, hence the architecture is highly scalable.
引用
收藏
页码:246 / 256
页数:11
相关论文
共 50 条
  • [31] Seclusive Cache Hierarchy for Mitigating Cross-Core Cache and Coherence Directory Attacks
    Gupta, Vishal
    Ganesan, Vinod
    Panda, Biswabandan
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 637 - 640
  • [32] An Implementation of a Directory Protocol for a Cache Coherent System on FPGAs
    Mirian, Vincent
    Chow, Paul
    2012 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2012,
  • [33] A Directory Cache with Dynamic Private-Shared Partitioning
    Valls, Joan J.
    Gomez, Maria E.
    Ros, Alberto
    Sahuquillo, Julio
    PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 382 - 391
  • [34] Directory cache design for multi-core processor
    State Key Laboratory of High-End Server & Storage Technology , Beijing
    100085, China
    Jisuanji Yanjiu yu Fazhan, 6 (1242-1253):
  • [35] A hierarchical Internet object cache
    Chankhunthod, A
    Danzig, PB
    Neerdaels, C
    Schwartz, MF
    Worrell, KJ
    PROCEEDINGS OF THE USENIX 1996 ANNUAL TECHNICAL CONFERENCE, 1996, : 153 - 163
  • [36] Hierarchical cache consistency in a WAN
    Yin, J
    Alvisi, L
    Dahlin, M
    Lin, C
    USENIX ASSOCIATION PROCEEDINGS OF THE 2ND USENIX SYMPOSIUM ON INTERNET TECHNOLOGIES AND SYSTEMS (USITS'99), 1999, : 13 - 24
  • [37] Predicting Cache Needs and Cache Sensitivity for Applications in Cloud Computing on CMP Servers with Configurable Caches
    Machina, Jacob
    Sodan, Angela
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 3042 - 3049
  • [38] Hybrid Limited-Pointer Linked-List Cache Directory and Cache Coherence Protocol
    Mahmoud, Mostafa
    Wassal, Amr
    PROCEEDINGS OF THE 2013 SECOND INTERNATIONAL JAPAN-EGYPT CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (JEC-ECC), 2013, : 77 - 82
  • [39] NCID: A Non-inclusive Cache, Inclusive Directory Architecture for Flexible and Efficient Cache Hierarchies
    Zhao, Li
    Iyer, Ravi
    Makineni, Srihari
    Newell, Don
    Cheng, Liqun
    PROCEEDINGS OF THE 2010 COMPUTING FRONTIERS CONFERENCE (CF 2010), 2010, : 121 - 130
  • [40] Array Regrouping on CMP with Non-uniform Cache Sharing
    Jiang, Yunlian
    Zhang, Eddy Z.
    Shen, Xipeng
    Gao, Yaoqing
    Arehambault, Roch
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2011, 6548 : 92 - +