OpenMP-oriented applications for distributed shared memory architectures

被引:6
|
作者
Marowka, A [1 ]
Liu, ZY [1 ]
Chapman, B [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
来源
关键词
OpenMP; data locality; NAS parallel benchmarks; programming model;
D O I
10.1002/cpe.752
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The rapid rise of OpenMP as the preferred parallel programming paradigm for small-to-medium scale parallelism could slow unless OpenMP can show capabilities for becoming the model-of-choice for large scale high-performance parallel computing in the coming decade. The main stumbling block for the adaptation of OpenMP to distributed shared memory (DSM) machines, which are based on architectures like cc-NUMA, stems from the lack of capabilities for data placement among processors and threads for achieving data locality. The absence of such a mechanism causes remote memory accesses and inefficient cache memory use, both of which lead to poor performance. This paper presents a simple software programming approach called copy-inside-copy-back (CC) that exploits the data privatization mechanism of OpenMP for data placement and replacement. This technique enables one to distribute data manually without taking away control and flexibility from the programmer and is thus an alternative to the automat and implicit approaches. Moreover, the CC approach improves on the OpenMP-SPMD style of programming that makes the development process of an OpenMP application more structured and simpler. The CC technique was tested and analyzed using the NAS Parallel Benchmarks on SGI Origin 2000 multiprocessor machines. This study shows that OpenMP improves performance of coarse-grained parallelism, although a fast copy mechanism is essential. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:371 / 384
页数:14
相关论文
共 50 条
  • [31] Exploration of distributed shared memory architectures for NoC-based multiprocessors
    Monchiero, Matteo
    Palermo, Gianluca
    Silvano, Cristina
    Villa, Oreste
    JOURNAL OF SYSTEMS ARCHITECTURE, 2007, 53 (10) : 719 - 732
  • [32] Data-object oriented design for distributed shared memory
    Manis, G
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 2001, 2110 : 583 - 586
  • [33] GRID APPLICATIONS ON DISTRIBUTED MEMORY ARCHITECTURES - IMPLEMENTATION AND EVALUATION
    SOLCHENBACH, K
    PARALLEL COMPUTING, 1988, 7 (03) : 341 - 356
  • [34] SIMT/OMP: A toolset to study and exploit memory locality of OpenMP applications on NUMA architectures
    Tao, J
    Schulz, M
    Karl, W
    SHARED MEMORY PARALLEL PROGRAMMING WITH OPENMP, 2005, 3349 : 41 - 52
  • [35] Teaching Shared Memory Parallel Concepts with OpenMP
    Adams, Joel
    Brown, Richard
    Shoop, Elizabeth
    PROCEEDINGS OF THE 45TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION (SIGCSE'14), 2014, : 743 - 743
  • [36] Efficient translation of OpenMP to distributed memory
    Huang, L
    Chapman, B
    Liu, Z
    Kendall, R
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 3, PROCEEDINGS, 2004, 3038 : 408 - 413
  • [37] Distributed shared memory layer for cooperative work applications
    Guyennet, H
    Lapayre, JC
    Trehel, M
    LCN'97 - 22ND ANNUAL CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 1997, : 72 - 78
  • [38] IMPLEMENTATION OF A PARALLEL UNSTRUCTURED EULER SOLVER ON SHARED-MEMORY AND DISTRIBUTED-MEMORY ARCHITECTURES
    MAVRIPLIS, DJ
    DAS, R
    SALTZ, J
    VERMELAND, RE
    JOURNAL OF SUPERCOMPUTING, 1995, 8 (04): : 329 - 344
  • [39] Parallel Mining of Correlated Heavy Hitters on Distributed and Shared-Memory Architectures
    Pulimeno, Marco
    Epicoco, Italo
    Cafaro, Massimo
    Melle, Catiuscia
    Aloisio, Giovanni
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5111 - 5118
  • [40] Prospects for Optical Interconnects in Distributed, Shared-Memory Organized MIMD Architectures
    Edward E. E. Frietman
    Ramon J. Ernst
    Roy Crosbie
    Masao Shimoji
    The Journal of Supercomputing, 1999, 14 : 107 - 128