Performance analysis of four parallel programming models on NUMA architectures

被引:0
|
作者
Mohamed, AS [1 ]
Cantonnet, F [1 ]
机构
[1] George Washington Univ, Dept Elect & Comp Engn, Washington, DC 20052 USA
关键词
NAS; OpenMP; MPI; SHMEM; and DSM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient parallel implementation of the NAS NPB benchmark is a challenging task. In this paper, We compare the performance of and the programming effort required for coding the NAS NPB benchmark under four leading parallel programming models: MPI, OpenMP, SHMEM, and DSM on an SGI NUMA Origin 3800 system, a machine which supports all four models efficiently. We make use of the spectrum of performance analysis and profiler tools within the SGI NUMA environment to monitor various low-level physical parameters to analyze the efficiency and performance of each of the programming models. Our objective is to be able to compare the physically monitored parameters across the four programming models. Using this visualized information, we will be able to better understand the communication, data/threads layouts, and I/O bottlenecks in these parallel programming models. Results indicate that the four models deliver comparable performance; however, the implementations differ significantly beyond merely using explicit messages versus implicit loads/stores even though the basic parallel algorithms are similar. Compared with the message-passing (using MPI) and SHMEM programming models, the cache-coherent distributed shared address space DSM-UPC and shared OpenMP models provide substantial ease of programming at both the conceptual and program orchestration levels, often accompanied by performance gains. However, DSM-UPC currently has portability limitations and may suffer from poor spatial locality of physically distributed shared data on large numbers of processors.
引用
收藏
页码:119 / 125
页数:7
相关论文
共 50 条
  • [1] Task-Parallel Programming on NUMA Architectures
    Terboven, Christian
    Schmidl, Dirk
    Cramer, Tim
    Mey, Dieter An
    [J]. EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 638 - 649
  • [2] Performance oriented programming for NUMA architectures
    Chapman, B
    Patil, A
    Prabhakar, A
    [J]. OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2001, 2104 : 137 - 154
  • [3] A Survey on Parallel Architectures and Programming Models
    Pervan, Branimir
    Knezovic, Josip
    [J]. 2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 999 - 1005
  • [4] PARALLEL PROGRAMMING MODELS FOR HETEROGENEOUS MULTICORE ARCHITECTURES
    Ferrer, Roger
    Bellens, Pieter
    Beltran, Vicenc
    Gonzalez, Marc
    Martorell, Xavier
    Badia, Rosa M.
    Ayguade, Eduard
    Yeom, Jae-Seung
    Schneider, Scott
    Koukos, Konstantinos
    Alvanos, Michail
    Nikolopoulos, Dimitrios S.
    Bilas, Angelos
    [J]. IEEE MICRO, 2010, 30 (05) : 42 - 53
  • [5] PARALLEL SUPERCOMPUTER ARCHITECTURES AND THEIR PROGRAMMING-MODELS
    GILOI, WK
    [J]. PARALLEL COMPUTING, 1994, 20 (10-11) : 1443 - 1470
  • [6] Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures
    Catalan, Sandra
    Igual, Francisco D.
    Herrero, Jose R.
    Rodriguez-Sanchez, Rafael
    Quintana-Orti, Enrique S.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 175 : 51 - 65
  • [7] On the performance of BWA on NUMA architectures
    Lenis, Josefina
    Senar, Miquel Angel
    [J]. 2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 3, 2015, : 236 - 241
  • [8] A Methodology Approach to Compare Performance of Parallel Programming Models for Shared-Memory Architectures
    Utrera, Gladys
    Gil, Marisa
    Martorell, Xavier
    [J]. NUMERICAL COMPUTATIONS: THEORY AND ALGORITHMS, PT I, 2020, 11973 : 318 - 325
  • [9] Performance and availability evaluation of NUMA architectures
    Geist, R
    Westall, J
    [J]. IEEE INTERNATIONAL COMPUTER PERFORMANCE AND DEPENDABILITY SYMPOSIUM - IPDS'96, PROCEEDINGS, 1996, : 271 - 280
  • [10] Performance bounds for modeling NUMA architectures
    Geist, R
    [J]. INFORMATION PROCESSING LETTERS, 1997, 63 (02) : 113 - 117