OpenMP-oriented applications for distributed shared memory architectures

被引:6
|
作者
Marowka, A [1 ]
Liu, ZY [1 ]
Chapman, B [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
来源
关键词
OpenMP; data locality; NAS parallel benchmarks; programming model;
D O I
10.1002/cpe.752
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The rapid rise of OpenMP as the preferred parallel programming paradigm for small-to-medium scale parallelism could slow unless OpenMP can show capabilities for becoming the model-of-choice for large scale high-performance parallel computing in the coming decade. The main stumbling block for the adaptation of OpenMP to distributed shared memory (DSM) machines, which are based on architectures like cc-NUMA, stems from the lack of capabilities for data placement among processors and threads for achieving data locality. The absence of such a mechanism causes remote memory accesses and inefficient cache memory use, both of which lead to poor performance. This paper presents a simple software programming approach called copy-inside-copy-back (CC) that exploits the data privatization mechanism of OpenMP for data placement and replacement. This technique enables one to distribute data manually without taking away control and flexibility from the programmer and is thus an alternative to the automat and implicit approaches. Moreover, the CC approach improves on the OpenMP-SPMD style of programming that makes the development process of an OpenMP application more structured and simpler. The CC technique was tested and analyzed using the NAS Parallel Benchmarks on SGI Origin 2000 multiprocessor machines. This study shows that OpenMP improves performance of coarse-grained parallelism, although a fast copy mechanism is essential. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:371 / 384
页数:14
相关论文
共 50 条
  • [21] Gecko: Hierarchical Distributed View of Heterogeneous Shared Memory Architectures
    Ghane, Millad
    Chandrasekaran, Sunita
    Cheung, Margaret S.
    PROCEEDINGS OF THE TENTH INTERNATIONAL WORKSHOP ON PROGRAMMING MODELS AND APPLICATIONS FOR MULTICORES AND MANYCORES (PMAM 2019), 2019, : 21 - 30
  • [22] Parallelizing the ZSWEEP algorithm for distributed-shared memory architectures
    Farias, R
    Silva, CT
    VOLUME GRAPHICS 2001, 2001, : 181 - +
  • [23] Improving Hash Distributed A* for Shared Memory Architectures Using Abstraction
    Sanz, Victoria
    De Giusti, Armando
    Naiouf, Marcelo
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016, 2016, 10048 : 431 - 439
  • [24] Visualizing the memory access behavior of shared memory applications on NUMA architectures
    Tao, J
    Karl, W
    Schulz, M
    COMPUTATIONAL SCIENCE -- ICCS 200, PROCEEDINGS PT 2, 2001, 2074 : 861 - 870
  • [25] Evaluation of SMP Shared Memory Machines for Use With In-Memory and OpenMP Big Data Applications
    Younge, Andrew J.
    Reidy, Christopher
    Henschel, Robert
    Fox, Geoffrey C.
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 1597 - 1606
  • [26] Performance of cluster-enabled OpenMP for the SCASH software distributed shared memory system
    Ojima, Y
    Sato, M
    Harada, H
    Ishikawa, Y
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 450 - 456
  • [27] DVSA and SHOB: Support to shared data structures on distributed memory architectures
    Baiardi, F
    Guerri, D
    Mori, P
    Moroni, L
    Ricci, L
    NINTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2001, : 165 - 172
  • [28] A SHARED ENVIRONMENT PARALLEL LOGIC PROGRAMMING SYSTEM ON DISTRIBUTED MEMORY ARCHITECTURES
    DELGADORANNAURO, SA
    DOROCHEVSKY, M
    SCHUERMAN, K
    VERON, A
    XU, JY
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 487 : 371 - 380
  • [29] A Message-Passing Microcoded Synchronization for Distributed Shared Memory Architectures
    Tasoulas, Zois-Gerasimos
    Anagnostopoulos, Iraklis
    Papadopoulos, Lazaros
    Soudris, Dimitrios
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (05) : 975 - 979
  • [30] Exploration of distributed shared memory architectures for NoC-based multiprocessors
    Monchiero, Matteo
    Palermo, Gianluca
    Silvano, Cristina
    Villa, Oreste
    2006 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION, PROCEEDINGS, 2006, : 144 - +