A two-level directory architecture for highly scalable cc-NUMA multiprocessors

被引:27
|
作者
Acacio, ME
González, J
García, JM
Duato, J
机构
[1] Univ Murcia, Dept Ingn & Tecnol Comp, Fac Informat, E-30071 Murcia, Spain
[2] Intel Labs Barcelona, Intel Barcelona Res Ctr, Barcelona 08034, Spain
[3] Univ Politecn Valencia, Dept Informat Sistemas & Comp, Valencia 46010, Spain
关键词
scalability; directory memory overhead; two-level directory architecture; compressed sharing codes; unnecessary coherence messages; cc-NUMA multiprocessor;
D O I
10.1109/TPDS.2005.4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
One important issue the designer of a scalable shared-memory multiprocessor must deal with is the amount of extra memory required to store the directory information. It is desirable that the directory memory overhead be kept as low as possible, and that it scales very slowly with the size of the machine. Unfortunately, current directory architectures provide scalability at the expense of performance. This work presents a scalable directory architecture that significantly reduces the size of the directory for large-scale configurations of a multiprocessor without degrading performance. First, we propose multilayer clustering as an effective approach to reduce the width of directory entries. Based on this concept, we derive three new compressed sharing codes, some of them with a space complexity of O(log(2)(log(2)(N))) for an N-node system. Then, we present a novel two-level directory architecture to eliminate the penalty caused by compressed directories in general. The proposed organization consists of a small full-map first-level directory (which provides precise information for the most recently referenced lines) and a compressed second-level directory (which provides in-excess information for all the lines). The proposals are evaluated based on extensive execution-driven simulations (using RSIM) of a 64-node cc-NUMA multiprocessor. Results demonstrate that a system with a two-level directory architecture achieves the same performance as a multiprocessor with a big and nonscalable full-map directory, with a very significant reduction of the memory overhead.
引用
收藏
页码:67 / 79
页数:13
相关论文
共 50 条
  • [1] A two-level directory organization solution for CC-NUMA systems
    Pan, Guoteng
    Don, Qiang
    Xie, Lunguo
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PROCEEDINGS, 2007, 4494 : 142 - +
  • [2] Implementation and evaluation of directory hints in CC-NUMA multiprocessors
    Hsiao, HC
    King, CT
    [J]. PARALLEL COMPUTING, 2002, 28 (01) : 107 - 132
  • [3] Design and evaluation of a switch cache architecture for CC-NUMA multiprocessors
    Iyer, RR
    Bhuyan, LN
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2000, 49 (08) : 779 - 797
  • [4] A split L2 data cache for scalable CC-NUMA multiprocessors
    Chung, SW
    Kim, HS
    Jhon, CS
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2005, 14 (03) : 605 - 617
  • [5] Exploiting Network Locality for CC-NUMA Multiprocessors
    Hung-Chang Hsiao
    Chung-Ta King
    [J]. The Journal of Supercomputing, 2001, 18 : 63 - 87
  • [6] Exploiting network locality for CC-NUMA multiprocessors
    Hsiao, HC
    King, CT
    [J]. JOURNAL OF SUPERCOMPUTING, 2001, 18 (01): : 63 - 87
  • [7] Coherence Controller Architectures for SMP-Based CC-NUMA Multiprocessors Architecture
    Michael, M. M.
    Nanda, A. K.
    Lim, B.-H.
    Scott, M. L.
    [J]. Computer Architecture News, 25 (02):
  • [8] Performance evaluation of cache depot on CC-NUMA multiprocessors
    Hsiao, HC
    King, CT
    [J]. 1998 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 519 - 526
  • [9] Page-mapping techniques for CC-NUMA multiprocessors
    Huang, J
    Jin, GH
    Li, ZY
    [J]. ICA(3)PP 97 - 1997 3RD INTERNATIONAL CONFERENCE ON ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, 1997, : 91 - 104
  • [10] Performance evaluation for CC-NUMA multiprocessors using an OLTP workload
    Chung, YW
    Kim, H
    Park, JW
    Lee, K
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2001, 25 (04) : 221 - 229