Performance oriented programming for NUMA architectures

被引:0
|
作者
Chapman, B [1 ]
Patil, A
Prabhakar, A
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
[2] NASA, Langley Res Ctr, ICASE, Hampton, VA 23665 USA
关键词
shared memory parallelism; parallel programming models; OpenMP; ccNUMA Architectures; restructuring; data locality; data distribution; Software Distributed Shared Memory;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, it is hard to obtain high performance on such systems, particularly when large numbers of threads are involved. Moreover, it is applicable to NUMA machines only if a software DSM system is present. In this paper, we discuss: various ways in which OpenMP may be used on ccNUMA and NUMA architectures, and evaluate several programming styles on the SGI Origin 2000, and on TreadMarks, a Software Distributed Shared Memory System from Rice University. These results have encouraged us to begin work on a compiler that accepts an extended OpenMP and translates such code to an equivalent version that provides superior performance on both of these platforms.
引用
收藏
页码:137 / 154
页数:18
相关论文
共 50 条
  • [41] Achieving High Performance with TCP over 40GbE on NUMA Architectures for CMS Data Acquisition
    Bawej, Tomasz
    Behrens, Ulf
    Branson, James
    Chaze, Olivier
    Cittolin, Sergio
    Darlea, Georgiana-Lavinia
    Deldicque, Christian
    Dobson, Marc
    Dupont, Aymeric
    Erhan, Samim
    Forrest, Andrew
    Gigi, Dominique
    Glege, Frank
    Gomez-Ceballos, Guillelmo
    Gomez-Reino, Robert
    Hegeman, Jeroen
    Holzner, Andre
    Masetti, Lorenzo
    Meijers, Frans
    Meschi, Emilio
    Mommsen, Remigius K.
    Morovic, Srecko
    Nunez-Barranco-Fernandez, Carlos
    O'Dell, Vivian
    Orsini, Luciano
    Paus, Christoph
    Petrucci, Andrea
    Pieri, Marco
    Racz, Attila
    Sakulin, Hannes
    Schwick, Christoph
    Stieger, Benjamin
    Sumorok, Konstanty
    Veverka, Jan
    Wakefield, Christopher C.
    Zejdl, Petr
    [J]. 2014 19TH IEEE-NPSS REAL TIME CONFERENCE (RT), 2014,
  • [42] On the Performance and Energy Efficiency of the PGAS Programming Model on Multicore Architectures
    Lagraviere, Jeremie
    Langguth, Johannes
    Sourouri, Mohammed
    Ha, Phuong H.
    Cai, Xing
    [J]. 2016 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2016), 2016, : 800 - 807
  • [43] Analysis of Security and Performance Aspects in Service-Oriented Architectures
    Rodrigues, Douglas
    Estrella, Julio C.
    Branco, Kalinka R. L. J. C.
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2011, 5 (01): : 13 - 30
  • [44] PERFORMANCE-ORIENTED DESIGN OF FAULT TOLERANT CACHE ARCHITECTURES
    Novac Ovidiu Constantin
    Vidiu-Kakas Stefan
    Novac Cornelia Mihaela
    Gordan Cornelia Emilia
    [J]. ANNALS OF DAAAM FOR 2012 & PROCEEDINGS OF THE 23RD INTERNATIONAL DAAAM SYMPOSIUM - INTELLIGENT MANUFACTURING AND AUTOMATION - FOCUS ON SUSTAINABILITY, 2012, 23 : 627 - 630
  • [45] Performance annotated business processes in service-oriented architectures
    Rud, Dmytro
    Schmietendorf, Andreas
    Dumke, Reiner
    [J]. International Journal of Simulation: Systems, Science and Technology, 2007, 8 (03): : 61 - 71
  • [46] StReAm:: Object-oriented programming of stream architectures using PAM-Blox
    Mencer, O
    Hübert, H
    Morf, M
    Flynn, MJ
    [J]. 2000 IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2000, : 309 - 310
  • [47] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures
    Zhang, Kaifang
    Su, Huayou
    Dou, Yong
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (11): : 13584 - 13600
  • [48] Case Studies on the Impact and Challenges of Heterogeneous NUMA Architectures for HPC
    Zaourar, Lilia
    Benazouz, Mohamed
    Mouhagir, Ayoub
    Falquez, Carlos
    Portero, Antoni
    Ho, Nam
    Suarez, Estela
    Petrakis, Polydoros
    Marazakis, Manolis
    Sgherzi, Francesco
    Fernandez, Ivan
    Dolbeau, Romain
    Pleiter, Dirk
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2024, 2024, 14842 : 251 - 265
  • [49] Resolving Load Balancing Issues in BWA on NUMA Multicore Architectures
    Herzeel, Charlotte
    Ashby, Thomas J.
    Costanza, Pascal
    De Meuter, Wolfgang
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 227 - 236
  • [50] Optimising MPI tree-based communication for NUMA architectures
    Karlsson, Christer
    Chen, Zizhong
    [J]. International Journal of Autonomous and Adaptive Communications Systems, 2015, 8 (04) : 407 - 423