Performance oriented programming for NUMA architectures

被引:0
|
作者
Chapman, B [1 ]
Patil, A
Prabhakar, A
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
[2] NASA, Langley Res Ctr, ICASE, Hampton, VA 23665 USA
关键词
shared memory parallelism; parallel programming models; OpenMP; ccNUMA Architectures; restructuring; data locality; data distribution; Software Distributed Shared Memory;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, it is hard to obtain high performance on such systems, particularly when large numbers of threads are involved. Moreover, it is applicable to NUMA machines only if a software DSM system is present. In this paper, we discuss: various ways in which OpenMP may be used on ccNUMA and NUMA architectures, and evaluate several programming styles on the SGI Origin 2000, and on TreadMarks, a Software Distributed Shared Memory System from Rice University. These results have encouraged us to begin work on a compiler that accepts an extended OpenMP and translates such code to an equivalent version that provides superior performance on both of these platforms.
引用
收藏
页码:137 / 154
页数:18
相关论文
共 50 条
  • [21] PERFORMANCE STUDY OF DESCRIPTOR-ORIENTED ARCHITECTURES
    MOHAMAD, SMA
    CAVOURAS, JC
    COMPUTER PERFORMANCE, 1984, 5 (01): : 14 - 22
  • [22] ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
    Broquedis, Francois
    Furmento, Nathalie
    Goglin, Brice
    Wacrenier, Pierre-Andre
    Namyst, Raymond
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (5-6) : 418 - 439
  • [23] Porting, monitoring and tuning UPC on NUMA architectures
    Mohamed, AS
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1518 - 1525
  • [24] ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
    François Broquedis
    Nathalie Furmento
    Brice Goglin
    Pierre-André Wacrenier
    Raymond Namyst
    International Journal of Parallel Programming, 2010, 38 : 418 - 439
  • [25] Nap: Persistent Memory Indexes for NUMA Architectures
    Wang, Qing
    Lu, Youyou
    Li, Junru
    Xie, Minhui
    Shu, Jiwu
    ACM TRANSACTIONS ON STORAGE, 2022, 18 (01)
  • [26] Balancing Shared and Distributed Heaps on NUMA Architectures
    Aljabri, Malak
    Loidl, Hans-Wolfgang
    Trinder, Phil
    TRENDS IN FUNCTIONAL PROGRAMMING, TFP 2014, 2015, 8843 : 1 - 17
  • [27] An Adaptive Concurrent Priority Queue for NUMA Architectures
    Strati, Foteini
    Giannoula, Christina
    Siakavaras, Dimitrios
    Goumas, Georgios
    Koziris, Nectarios
    CF '19 - PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, 2019, : 135 - 144
  • [28] Experiments with service-oriented architectures for industrial robotic cells programming
    Veiga, G.
    Pires, J. N.
    Nilsson, K.
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2009, 25 (4-5) : 746 - 755
  • [29] Achieving High Performance With TCP Over 40 GbE on NUMA Architectures for CMS Data Acquisition
    Bawej, Tomasz
    Behrens, Ulf
    Branson, James
    Chaze, Olivier
    Cittolin, Sergio
    Darlea, Georgiana-Lavinia
    Deldicque, Christian
    Dobson, Marc
    Dupont, Aymeric
    Erhan, Samim
    Forrest, Andrew
    Gigi, Dominique
    Glege, Frank
    Gomez-Ceballos, Guillelmo
    Gomez-Reino, Robert
    Hegeman, Jeroen
    Holzner, Andre
    Masetti, Lorenzo
    Meijers, Frans
    Meschi, Emilio
    Mommsen, Remigius K.
    Morovic, Srecko
    Nunez-Barranco-Fernandez, Carlos
    O'Dell, Vivian
    Orsini, Luciano
    Paus, Christoph
    Petrucci, Andrea
    Pieri, Marco
    Racz, Attila
    Sakulin, Hannes
    Schwick, Christoph
    Stieger, Benjamin
    Sumorok, Konstanty
    Veverka, Jan
    Wakefield, Christopher C.
    Zejdl, Petr
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2015, 62 (03) : 1091 - 1098
  • [30] PROGRAMMING METHODOLOGY AND PERFORMANCE ISSUES FOR ADVANCED COMPUTER ARCHITECTURES
    DONGARRA, JJ
    SORENSEN, DC
    CONNOLLY, K
    PATTERSON, J
    PARALLEL COMPUTING, 1988, 8 (1-3) : 41 - 58