OpenMP-oriented applications for distributed shared memory architectures

被引:6
|
作者
Marowka, A [1 ]
Liu, ZY [1 ]
Chapman, B [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
来源
关键词
OpenMP; data locality; NAS parallel benchmarks; programming model;
D O I
10.1002/cpe.752
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The rapid rise of OpenMP as the preferred parallel programming paradigm for small-to-medium scale parallelism could slow unless OpenMP can show capabilities for becoming the model-of-choice for large scale high-performance parallel computing in the coming decade. The main stumbling block for the adaptation of OpenMP to distributed shared memory (DSM) machines, which are based on architectures like cc-NUMA, stems from the lack of capabilities for data placement among processors and threads for achieving data locality. The absence of such a mechanism causes remote memory accesses and inefficient cache memory use, both of which lead to poor performance. This paper presents a simple software programming approach called copy-inside-copy-back (CC) that exploits the data privatization mechanism of OpenMP for data placement and replacement. This technique enables one to distribute data manually without taking away control and flexibility from the programmer and is thus an alternative to the automat and implicit approaches. Moreover, the CC approach improves on the OpenMP-SPMD style of programming that makes the development process of an OpenMP application more structured and simpler. The CC technique was tested and analyzed using the NAS Parallel Benchmarks on SGI Origin 2000 multiprocessor machines. This study shows that OpenMP improves performance of coarse-grained parallelism, although a fast copy mechanism is essential. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:371 / 384
页数:14
相关论文
共 50 条
  • [1] Parallel PIC codes for distributed and shared memory architectures with HPF and OpenMP
    Di Martino, B
    Briguglio, S
    Fogaccia, G
    Vlad, G
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2233 - 2239
  • [2] OpenMP compiler for distributed memory architectures
    WANG Jue
    Science China(Information Sciences), 2010, 53 (05) : 932 - 944
  • [3] OpenMP compiler for distributed memory architectures
    Jue Wang
    ChangJun Hu
    JiLin Zhang
    JianJiang Li
    Science China Information Sciences, 2010, 53 : 932 - 944
  • [4] OpenMP compiler for distributed memory architectures
    Wang Jue
    Hu ChangJun
    Zhang JiLin
    Li JianJiang
    SCIENCE CHINA-INFORMATION SCIENCES, 2010, 53 (05) : 932 - 944
  • [5] MPI plus OpenMP implementation of memory-saving parallel PIC applications on hierarchical distributed-shared memory architectures
    Briguglio, S
    Fogaccia, G
    Vlad, G
    Di Martino, B
    MODELLING AND SIMULATION 2003, 2003, : 247 - 251
  • [6] Coding Productivity in MapReduce Applications for Distributed and Shared Memory Architectures
    Adornes, Daniel
    Griebler, Dalvan
    Ledur, Cleverson
    Fernandes, Luiz Gustavo
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2015, 25 (9-10) : 1739 - 1741
  • [7] Characterization of OpenMP applications on the InfiniBand-based distributed virtual shared memory system
    Park, I
    Kim, SW
    Park, K
    HIGH PERFORMANCE COMPUTING - HIPC 2004, 2004, 3296 : 430 - 439
  • [8] Study of OpenMP applications on the InfiniBand-based software distributed shared-memory system
    Park, I
    Kim, SW
    PARALLEL COMPUTING, 2005, 31 (10-12) : 1099 - 1113
  • [9] Optimizing OpenMP Programs on Software Distributed Shared Memory Systems
    Seung-Jai Min
    Ayon Basumallik
    Rudolf Eigenmann
    International Journal of Parallel Programming, 2003, 31 : 225 - 249
  • [10] A fully compliant OpenMP implementation on software distributed shared memory
    Karlsson, S
    Lee, SW
    Brorsson, M
    HIGH PERFORMANCE COMPUTING - HIPC 2002, PROCEEDINGS, 2002, 2552 : 195 - 206