Hierarchical Cache Directory for CMP

被引:21
|
作者
Guo, Song-Liu [1 ]
Wang, Hai-Xia [2 ]
Xue, Yi-Bo [2 ]
Li, Chong-Min [1 ]
Wang, Dong-Sheng [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
cache coherence protocol; hierarchical directory; chip multiprocessor; ARCHITECTURE;
D O I
10.1007/s11390-010-9321-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As more processing cores are integrated into one chip and feature size continues to shrink, the average access latency for remote nodes using directory-based coherence protocol becomes higher, which greatly impacts system performance. Previous techniques such as, data replication and data migration optimize the performance of the requesting core, but offer little improvement for neighbor nodes. Other techniques such as in-transit optimization try to reduce latency at the cost of increased storage. This paper introduces hierarchical cache directory into CMP (chip multiprocessor), which divides CMP tiles into multiple regions hierarchically, and combines it with data replication. A new directory organization is proposed to record the share status within a, region and assist the regional home to complete operation efficiently. Simulation results show that for a 16-core CMP, compared to traditional directory, hierarchical cache directory reduces average access latency by 9% and on-chip network traffic by 34% on average with less storage. Theoretical analyses show that for a 2(n) x 2(n) tiled CMP, the average access latency in hierarchical cache directory asymptotically approaches a function that is independent of n, hence the architecture is highly scalable.
引用
收藏
页码:246 / 256
页数:11
相关论文
共 50 条
  • [41] Performance evaluation of a novel CMP cache structure for hybrid workloads
    Zhao, Xuemei
    Sammut, Karl
    He, Fangpo
    EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2007, : 89 - 96
  • [42] SARC COHERENCE: SCALING DIRECTORY CACHE COHERENCE IN PERFORMANCE AND POWER
    Kaxiras, Stefanos
    Keramidas, Georgios
    IEEE MICRO, 2010, 30 (05) : 54 - 65
  • [43] Improving cache performance with Full-Map Block Directory
    Peir, JK
    Hsu, WW
    Young, H
    Ong, S
    JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (05) : 439 - 454
  • [44] A COHERENT DISTRIBUTED FILE CACHE WITH DIRECTORY WRITE-BEHIND
    MANN, T
    BIRRELL, A
    HISGEN, A
    JERIAN, C
    SWART, G
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1994, 12 (02): : 123 - 164
  • [45] A New Approach to Directory Based Solution for Cache Coherence Problem
    Mittal, Shaily
    Nitin
    2014 3RD INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS (ICECCS 2014), 2014, : 9 - 13
  • [46] WiDir: A Wireless-Enabled Directory Cache Coherence Protocol
    Franques, Antonio
    Kokolis, Apostolos
    Abadal, Sergi
    Fernando, Vimuth
    Misailovic, Sasa
    Torrellas, Josep
    2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 304 - 317
  • [47] Fast cache access with full-map block directory
    Peir, JK
    Hsu, WW
    Young, H
    Ong, S
    INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1997, : 578 - 586
  • [48] Directory Based Cache Coherence Modeller in Multiprocessors: Medium Insight
    Arora, Harsh
    Mukherjee, Rijubrata
    Bej, Abhijit
    Adak, Hillol
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2611 - 2617
  • [49] DS-Cache: A Refined Directory Entry Lookup Cache with Prefix-Awareness for Mobile Devices
    Han, Lei
    Xiao, Bin
    Dong, Xuwei
    Shen, Zhaoyan
    Shao, Zili
    2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 1052 - 1057
  • [50] Hierarchical Cache Optimization in IPTV Networks
    Krogfoss, Bill
    Sofman, Lev B.
    Agrawal, Anshul
    BMSB: 2009 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING, VOLS 1 AND 2, 2009, : 341 - 350