EXPERIMENTAL COMPARISON OF MEMORY MANAGEMENT POLICIES FOR NUMA MULTIPROCESSORS

被引：31

作者：

LAROWE, RP ^{[1
]}

ELLIS, CS ^{[1
]}

机构：

[1] DUKE UNIV,DEPT COMP SCI,DURHAM,NC 27706

来源：

ACM TRANSACTIONS ON COMPUTER SYSTEMS | 1991年 / 9卷 / 04期

关键词：

EXPERIMENTATION; MANAGEMENT; MEASUREMENT; PERFORMANCE;

D O I：

10.1145/118544.118546

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Nonuniformity of memory access is an almost inevitable feature of the memory architecture in shared memory multiprocessor designs that can scale to large numbers of processors. One implication of NUMA architectures is that the placement and movement of code and data are crucial to performance. As memory architectures become more complex and the nonuniformity becomes less well hidden, system software must assume a larger role in providing memory management support for the programmer. This paper investigates the role of the operating system. We take an experimental approach to evaluating a wide-range of memory management policies. The target NUMA environment is BBN's GP-1000 multiprocessor. Extensive local modifications have been made to the memory management subsystem of BBN's nX operating system to support multiple policy implementations. Policy comparisons are based on the measured performance of real parallel applications. Our results show that there are memory management policies implemented in our system that can improve the performance of programs written using the simpler uniform memory access (UMA) programming model. While achieving the level of performance of a highly tuned NUMA program is still a difficult problem, some examples come close. There appears to be no single policy that can be considered the best over our set of test applications. Investigations into the contributions made by individual policy features toward overall behavior of the workload provide some insight into the design of a set of effective policies.

引用

页码：319 / 363

页数：45

共 50 条

[21] Exploiting network locality for CC-NUMA multiprocessors
Hsiao, HC
King, CT
JOURNAL OF SUPERCOMPUTING, 2001, 18 (01): : 63 - 87
[22] Switch MSHR: A technique to reduce remote read memory access time in CC-NUMA multiprocessors
Bhuyan, LN
Wang, HJ
IEEE TRANSACTIONS ON COMPUTERS, 2003, 52 (05) : 617 - 632
[23] EVALUATION OF NUMA MEMORY MANAGEMENT THROUGH MODELING AND MEASUREMENTS
LAROWE, RP
ELLIS, CS
HOLLIDAY, MA
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1992, 3 (06) : 686 - 701
[24] Load balancing for parallel query execution on NUMA multiprocessors
INRIA Rocquencourt, France
Distrib Parallel Databases, 1 (99-121):
[25] Load balancing for parallel query execution on NUMA multiprocessors
Bouganim, L
Florescu, D
Valduriez, P
DISTRIBUTED AND PARALLEL DATABASES, 1999, 7 (01) : 99 - 121
[26] Load Balancing for Parallel Query Execution on NUMA Multiprocessors
Luc Bouganim
Daniela Florescu
Patrick Valduriez
Distributed and Parallel Databases, 1999, 7 : 99 - 121
[27] Performance comparison of MPI and OpenMP on shared memory multiprocessors
Krawezik, G
Cappello, F
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (01): : 29 - 61
[28] MEMORY MANAGEMENT ISSUES IN SPARSE MULTIFRONTAL METHODS ON MULTIPROCESSORS
AMESTOY, PR
DUFF, IS
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1993, 7 (01): : 64 - 82
[29] Performance evaluation of cache depot on CC-NUMA multiprocessors
Hsiao, HC
King, CT
1998 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 519 - 526
[30] Clustered affinity scheduling on large-scale NUMA multiprocessors
Wang, YM
Wang, HH
Chang, RC
JOURNAL OF SYSTEMS AND SOFTWARE, 1997, 39 (01) : 61 - 70

← 1 2 3 4 5 →