A new scalable directory architecture for large-scale multiprocessors

被引:23
|
作者
Acacio, ME [1 ]
González, J [1 ]
García, JM [1 ]
Duato, J [1 ]
机构
[1] Univ Murcia, Dipartimento Ing & Tecnol Computadores, E-30071 Murcia, Spain
关键词
D O I
10.1109/HPCA.2001.903255
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The memory overhead introduced by directories constitutes a major hurdle in the scalability of cc-NUMA architectures, which makes the shared-memory paradigm unfeasible for very large-scale systems. This work is focused on improving the scalability of shared-memory multiprocessors by significantly reducing the size of the director)! We propose multilayer clustering as an effective approach to reduce the directory-entry width. Detailed evaluation for 64 processors shows that using this approach we can drastically reduce the memory overhead, while suffering a performance degradation very similar to previous compressed schemes (such as Coarse Vector). In addition, a novel two-level directory architecture is proposed in order to eliminate the penalty caused by these compressed directories. This organization consists of a small Full-Map first-level directory (which provides precise information for the most recently referenced lines) and a compressed second-level directory (which provides in-excess information). Results show that a system with this directory architecture can achieve the same performance as a multiprocessor with a big and non-scalable Full-Map directory, with a very significant reduction of the memory overhead.
引用
收藏
页码:97 / 106
页数:10
相关论文
共 50 条
  • [31] Peer-to-Peer models for resource discovery in large-scale Grids: A scalable architecture
    Talia, Domenico
    Trunfio, Paolo
    Zeng, Jingdi
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2006, 2007, 4395 : 66 - +
  • [32] DISTRIBUTING HOT-SPOT ADDRESSING IN LARGE-SCALE MULTIPROCESSORS
    YEW, PC
    TZENG, NF
    LAWRIE, DH
    IEEE TRANSACTIONS ON COMPUTERS, 1987, 36 (04) : 388 - 395
  • [33] Directory support for large-scale, automated service composition
    Binder, W
    Constantinescu, I
    Faltings, B
    SOFTWARE COMPOSITION, 2005, 3628 : 57 - 66
  • [34] Physical Planning for the Architectural Exploration of Large-Scale Chip Multiprocessors
    de San Pedro, Javier
    Nikitin, Nikita
    Cortadella, Jordi
    Petit, Jordi
    2013 SEVENTH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS 2013), 2013,
  • [35] PERFORMANCE OF PRUNING-CACHE DIRECTORIES FOR LARGE-SCALE MULTIPROCESSORS
    SCOTT, SL
    GOODMAN, JR
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1993, 4 (05) : 520 - 534
  • [36] Evaluating the impact of locality on the performance of large-scale SCI multiprocessors
    Al-Rousan, M
    Archibald, JK
    Bearnson, L
    PERFORMANCE EVALUATION, 2001, 46 (04) : 275 - 302
  • [37] Asynchronous parallel algorithm of large-scale system of equations for multiprocessors
    Bi, He-ping
    Feng, Guo-huan
    Proceedings of the International Symposium on Space Technology and Science, 1990,
  • [38] Scalable Algorithms for Bayesian Inference of Large-Scale Models from Large-Scale Data
    Ghattas, Omar
    Isaac, Tobin
    Petra, Noemi
    Stadler, Georg
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 3 - 6
  • [39] A scalable architecture for directory assistance automation
    Natarajan, P
    Prasad, R
    Schwartz, RM
    Makhoul, J
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 21 - 24
  • [40] An adaptive limited pointers directory scheme for cache coherence of scalable multiprocessors
    Park, CH
    Choi, JH
    Park, KH
    Park, D
    EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 753 - 756