A new scalable directory architecture for large-scale multiprocessors

被引:23
|
作者
Acacio, ME [1 ]
González, J [1 ]
García, JM [1 ]
Duato, J [1 ]
机构
[1] Univ Murcia, Dipartimento Ing & Tecnol Computadores, E-30071 Murcia, Spain
关键词
D O I
10.1109/HPCA.2001.903255
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The memory overhead introduced by directories constitutes a major hurdle in the scalability of cc-NUMA architectures, which makes the shared-memory paradigm unfeasible for very large-scale systems. This work is focused on improving the scalability of shared-memory multiprocessors by significantly reducing the size of the director)! We propose multilayer clustering as an effective approach to reduce the directory-entry width. Detailed evaluation for 64 processors shows that using this approach we can drastically reduce the memory overhead, while suffering a performance degradation very similar to previous compressed schemes (such as Coarse Vector). In addition, a novel two-level directory architecture is proposed in order to eliminate the penalty caused by these compressed directories. This organization consists of a small Full-Map first-level directory (which provides precise information for the most recently referenced lines) and a compressed second-level directory (which provides in-excess information). Results show that a system with this directory architecture can achieve the same performance as a multiprocessor with a big and non-scalable Full-Map directory, with a very significant reduction of the memory overhead.
引用
收藏
页码:97 / 106
页数:10
相关论文
共 50 条
  • [21] Architectural Exploration of Large-Scale Hierarchical Chip Multiprocessors
    Nikitin, Nikita
    de San Pedro, Javier
    Cortadella, Jordi
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2013, 32 (10) : 1569 - 1582
  • [22] SPARCLE - AN EVOLUTIONARY PROCESSOR DESIGN FOR LARGE-SCALE MULTIPROCESSORS
    AGARWAL, A
    KUBIATOWICZ, J
    KRANZ, D
    LIM, BH
    YEUNG, D
    DSOUZA, G
    PARKIN, M
    IEEE MICRO, 1993, 13 (03) : 48 - 61
  • [23] Clustered affinity scheduling on large-scale NUMA multiprocessors
    Wang, YM
    Wang, HH
    Chang, RC
    JOURNAL OF SYSTEMS AND SOFTWARE, 1997, 39 (01) : 61 - 70
  • [24] A reliable and scalable overlay multicast architecture for large-scale video surveillance applications
    Yang Hongyun
    Chen Xuhui
    Hu Ruiming
    Chen Jun
    2008 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1-3, 2008, : 526 - +
  • [25] SHIP: A Scalable Hierarchical Power Control Architecture for Large-Scale Data Centers
    Wang, Xiaorui
    Chen, Ming
    Lefurgy, Charles
    Keller, Tom W.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (01) : 168 - 176
  • [26] Elastic pointer directory organization for scalable shared memory multiprocessors
    Yuhang Liu
    Mingfa Zhu
    Limin Xiao
    Journal of Systems Engineering and Electronics, 2014, 25 (01) : 158 - 167
  • [27] Elastic pointer directory organization for scalable shared memory multiprocessors
    Liu, Yuhang
    Zhu, Mingfa
    Xiao, Limin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2014, 25 (01) : 158 - 167
  • [28] Load-balanced Resource Directory Architecture for Large-scale Internet of Things Local Networks
    Kwon, Jung-Hyok
    Park, Jaehoon
    Kim, Eui-Jik
    SENSORS AND MATERIALS, 2018, 30 (08) : 1817 - 1824
  • [29] Design and performance of directory caches for scalable shared memory multiprocessors
    Michael, MM
    Nanda, AK
    FIFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1999, : 142 - 151
  • [30] DESIGN OF AN ADAPTIVE CACHE COHERENCE PROTOCOL FOR LARGE-SCALE MULTIPROCESSORS
    YANG, Q
    THANGADURAI, G
    BHUYAN, LN
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1992, 3 (03) : 281 - 293