Performance oriented programming for NUMA architectures

被引:0
|
作者
Chapman, B [1 ]
Patil, A
Prabhakar, A
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
[2] NASA, Langley Res Ctr, ICASE, Hampton, VA 23665 USA
关键词
shared memory parallelism; parallel programming models; OpenMP; ccNUMA Architectures; restructuring; data locality; data distribution; Software Distributed Shared Memory;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, it is hard to obtain high performance on such systems, particularly when large numbers of threads are involved. Moreover, it is applicable to NUMA machines only if a software DSM system is present. In this paper, we discuss: various ways in which OpenMP may be used on ccNUMA and NUMA architectures, and evaluate several programming styles on the SGI Origin 2000, and on TreadMarks, a Software Distributed Shared Memory System from Rice University. These results have encouraged us to begin work on a compiler that accepts an extended OpenMP and translates such code to an equivalent version that provides superior performance on both of these platforms.
引用
收藏
页码:137 / 154
页数:18
相关论文
共 50 条
  • [1] Performance analysis of four parallel programming models on NUMA architectures
    Mohamed, AS
    Cantonnet, F
    [J]. PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2003, : 119 - 125
  • [2] On the performance of BWA on NUMA architectures
    Lenis, Josefina
    Senar, Miquel Angel
    [J]. 2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 3, 2015, : 236 - 241
  • [3] Task-Parallel Programming on NUMA Architectures
    Terboven, Christian
    Schmidl, Dirk
    Cramer, Tim
    Mey, Dieter An
    [J]. EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 638 - 649
  • [4] Performance and availability evaluation of NUMA architectures
    Geist, R
    Westall, J
    [J]. IEEE INTERNATIONAL COMPUTER PERFORMANCE AND DEPENDABILITY SYMPOSIUM - IPDS'96, PROCEEDINGS, 1996, : 271 - 280
  • [5] Performance bounds for modeling NUMA architectures
    Geist, R
    [J]. INFORMATION PROCESSING LETTERS, 1997, 63 (02) : 113 - 117
  • [6] A hybrid tool for the performance evaluation of NUMA architectures
    Westall, J
    Geist, R
    [J]. PROCEEDINGS OF THE 1997 WINTER SIMULATION CONFERENCE, 1997, : 1029 - 1036
  • [7] A Tool to Analyze the Performance of Multithreaded Programs on NUMA Architectures
    Liu, Xu
    Mellor-Crummey, John
    [J]. ACM SIGPLAN NOTICES, 2014, 49 (08) : 259 - 271
  • [8] Optimizing operating system performance for CC-NUMA architectures
    Chang, MS
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2003, 15 (14): : 1257 - 1274
  • [9] ARCHITECTURES, PROGRAMMING AND PERFORMANCE OF SUPERCOMPUTERS
    VOLKERT, J
    [J]. KERNTECHNIK, 1988, 52 (02) : 112 - 119
  • [10] Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures
    Catalan, Sandra
    Igual, Francisco D.
    Herrero, Jose R.
    Rodriguez-Sanchez, Rafael
    Quintana-Orti, Enrique S.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 175 : 51 - 65