EXPERIMENTAL COMPARISON OF MEMORY MANAGEMENT POLICIES FOR NUMA MULTIPROCESSORS

被引:31
|
作者
LAROWE, RP [1 ]
ELLIS, CS [1 ]
机构
[1] DUKE UNIV,DEPT COMP SCI,DURHAM,NC 27706
来源
关键词
EXPERIMENTATION; MANAGEMENT; MEASUREMENT; PERFORMANCE;
D O I
10.1145/118544.118546
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nonuniformity of memory access is an almost inevitable feature of the memory architecture in shared memory multiprocessor designs that can scale to large numbers of processors. One implication of NUMA architectures is that the placement and movement of code and data are crucial to performance. As memory architectures become more complex and the nonuniformity becomes less well hidden, system software must assume a larger role in providing memory management support for the programmer. This paper investigates the role of the operating system. We take an experimental approach to evaluating a wide-range of memory management policies. The target NUMA environment is BBN's GP-1000 multiprocessor. Extensive local modifications have been made to the memory management subsystem of BBN's nX operating system to support multiple policy implementations. Policy comparisons are based on the measured performance of real parallel applications. Our results show that there are memory management policies implemented in our system that can improve the performance of programs written using the simpler uniform memory access (UMA) programming model. While achieving the level of performance of a highly tuned NUMA program is still a difficult problem, some examples come close. There appears to be no single policy that can be considered the best over our set of test applications. Investigations into the contributions made by individual policy features toward overall behavior of the workload provide some insight into the design of a set of effective policies.
引用
收藏
页码:319 / 363
页数:45
相关论文
共 50 条
  • [11] POSTER: NUMA-aware Power Management for Chip Multiprocessors
    Ahn, Changmin
    Guzman, Camilo. A. Celis
    Egger, Bernhard
    2017 26TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2017, : 160 - 161
  • [12] Impact of CC-NUMA memory management policies on the application performance of multistage switching networks
    Bhuyan, LN
    Iyer, R
    Wang, HJ
    Kumar, A
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2000, 11 (03) : 230 - 246
  • [13] Challenges of memory management: On modern numa systems
    Gaud, Fabien
    Lepers, Baptiste
    Funston, Justin
    Dashti, Mohammad
    Fedorova, Alexandra
    Quéma, Vivien
    Lachaize, Renaud
    Roth, Mark
    2015, Association for Computing Machinery (13): : 99 - 124
  • [14] SIMPLE BUT EFFECTIVE TECHNIQUES FOR NUMA MEMORY MANAGEMENT
    BOLOSKY, WJ
    FITZGERALD, RP
    SCOTT, ML
    OPERATING SYSTEMS REVIEW, VOL 23, NO 5, SPECIAL ISSUE: PROCEEDINGS OF THE TWELFTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 1989, : 19 - 31
  • [15] NUMA Awareness: Improving Thread and Memory Management
    Patrou, Maria
    Kent, Kenneth B.
    Dueck, Gerhard W.
    Gracie, Charlie
    Micic, Aleksandar
    44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 119 - 123
  • [16] Challenges of Memory Management on Modern NUMA Systems
    Gaud, Fabien
    Lepers, Baptiste
    Funston, Justin
    Dashti, Mohammad
    Fedorova, Alexandra
    Quema, Vivien
    Lachaize, Renaud
    Roth, Mark
    COMMUNICATIONS OF THE ACM, 2015, 58 (12) : 59 - 66
  • [17] Integrating dynamic memory placement with adaptive load-balancing for parallel codes on NUMA multiprocessors
    Slavin, Paul
    Freeman, Len
    EURO-PAR 2008 PARALLEL PROCESSING, PROCEEDINGS, 2008, 5168 : 274 - 284
  • [18] PERFORMANCE BENEFITS AND LIMITATIONS OF LARGE NUMA MULTIPROCESSORS
    SEVCIK, KC
    ZHOU, SN
    PERFORMANCE EVALUATION, 1994, 20 (1-3) : 185 - 205
  • [19] Switch cache: A framework for improving the remote memory access latency of CC-NUMA multiprocessors
    Iyer, R
    Bhuyan, LN
    FIFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1999, : 152 - 160
  • [20] Exploiting Network Locality for CC-NUMA Multiprocessors
    Hung-Chang Hsiao
    Chung-Ta King
    The Journal of Supercomputing, 2001, 18 : 63 - 87