A new scalable directory architecture for large-scale multiprocessors

被引：23

作者：

Acacio, ME ^{[1
]}

González, J ^{[1
]}

García, JM ^{[1
]}

Duato, J ^{[1
]}

机构：

[1] Univ Murcia, Dipartimento Ing & Tecnol Computadores, E-30071 Murcia, Spain

来源：

HPCA: SEVENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTING ARCHITECTURE, PROCEEDINGS | 2001年

关键词：

D O I：

10.1109/HPCA.2001.903255

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The memory overhead introduced by directories constitutes a major hurdle in the scalability of cc-NUMA architectures, which makes the shared-memory paradigm unfeasible for very large-scale systems. This work is focused on improving the scalability of shared-memory multiprocessors by significantly reducing the size of the director)! We propose multilayer clustering as an effective approach to reduce the directory-entry width. Detailed evaluation for 64 processors shows that using this approach we can drastically reduce the memory overhead, while suffering a performance degradation very similar to previous compressed schemes (such as Coarse Vector). In addition, a novel two-level directory architecture is proposed in order to eliminate the penalty caused by these compressed directories. This organization consists of a small Full-Map first-level directory (which provides precise information for the most recently referenced lines) and a compressed second-level directory (which provides in-excess information). Results show that a system with this directory architecture can achieve the same performance as a multiprocessor with a big and non-scalable Full-Map directory, with a very significant reduction of the memory overhead.

引用

页码：97 / 106

页数：10

共 50 条

[31] Peer-to-Peer models for resource discovery in large-scale Grids: A scalable architecture
Talia, Domenico
Trunfio, Paolo
Zeng, Jingdi
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2006, 2007, 4395 : 66 - +
[32] DISTRIBUTING HOT-SPOT ADDRESSING IN LARGE-SCALE MULTIPROCESSORS
YEW, PC
TZENG, NF
LAWRIE, DH
IEEE TRANSACTIONS ON COMPUTERS, 1987, 36 (04) : 388 - 395
[33] Directory support for large-scale, automated service composition
Binder, W
Constantinescu, I
Faltings, B
SOFTWARE COMPOSITION, 2005, 3628 : 57 - 66
[34] Physical Planning for the Architectural Exploration of Large-Scale Chip Multiprocessors
de San Pedro, Javier
Nikitin, Nikita
Cortadella, Jordi
Petit, Jordi
2013 SEVENTH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS 2013), 2013,
[35] PERFORMANCE OF PRUNING-CACHE DIRECTORIES FOR LARGE-SCALE MULTIPROCESSORS
SCOTT, SL
GOODMAN, JR
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1993, 4 (05) : 520 - 534
[36] Evaluating the impact of locality on the performance of large-scale SCI multiprocessors
Al-Rousan, M
Archibald, JK
Bearnson, L
PERFORMANCE EVALUATION, 2001, 46 (04) : 275 - 302
[37] Asynchronous parallel algorithm of large-scale system of equations for multiprocessors
Bi, He-ping
Feng, Guo-huan
Proceedings of the International Symposium on Space Technology and Science, 1990,
[38] Scalable Algorithms for Bayesian Inference of Large-Scale Models from Large-Scale Data
Ghattas, Omar
Isaac, Tobin
Petra, Noemi
Stadler, Georg
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 3 - 6
[39] A scalable architecture for directory assistance automation
Natarajan, P
Prasad, R
Schwartz, RM
Makhoul, J
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 21 - 24
[40] An adaptive limited pointers directory scheme for cache coherence of scalable multiprocessors
Park, CH
Choi, JH
Park, KH
Park, D
EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 753 - 756

← 1 2 3 4 5 →