Optimizing UPC programs for multi-core systems

被引:0
|
作者
Zheng, Yili [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Lab, Berkeley, CA 94720 USA
关键词
UPC; PGAS;
D O I
10.1155/2010/646829
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Partitioned Global Address Space (PGAS) model of Unified Parallel C (UPC) can help users express and manage application data locality on non-uniform memory access (NUMA) multi-core shared-memory systems to get good performance. First, we describe several UPC program optimization techniques that are important to achieving good performance on NUMA multi-core computers with examples and quantitative performance results. Second, we use two numerical computing kernels, parallel matrix-matrix multiplication and parallel 3-D FFT, to demonstrate the end-to-end development and optimization for UPC applications. Our results show that the optimized UPC programs achieve very good and scalable performance on current multi-core systems and can even outperform vendor-optimized libraries in some cases.
引用
收藏
页码:183 / 191
页数:9
相关论文
共 50 条
  • [1] Automatic parallelization of XQuery programs on multi-core systems
    Rongxin Chen
    Husheng Liao
    Zongyue Wang
    Hang Su
    [J]. The Journal of Supercomputing, 2016, 72 : 1517 - 1548
  • [2] Automatic parallelization of XQuery programs on multi-core systems
    Chen, Rongxin
    Liao, Husheng
    Wang, Zongyue
    Su, Hang
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (04): : 1517 - 1548
  • [3] Optimizing Multi-Dimensional Packet Classification for Multi-Core Systems
    Shen, Tong
    Zhang, Da-Fang
    Xie, Gao-Gang
    Zhang, Xin-Yi
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (05) : 1056 - 1071
  • [4] Optimizing Multi-Dimensional Packet Classification for Multi-Core Systems
    Tong Shen
    Da-Fang Zhang
    Gao-Gang Xie
    Xin-Yi Zhang
    [J]. Journal of Computer Science and Technology, 2018, 33 : 1056 - 1071
  • [5] Performance Variability of SPECCPU 2006 programs in Multi-core systems
    Prabhu, Sapna
    Daruwala, R. D.
    [J]. 2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [6] Embedded Multi-Core Systems Dedicated to Dynamic Dataflow Programs
    Hervé Yviquel
    Alexandre Sanchez
    Pekka Jääskeläinen
    Jarmo Takala
    Mickaël Raulet
    Emmanuel Casseau
    [J]. Journal of Signal Processing Systems, 2015, 80 : 121 - 136
  • [7] Embedded Multi-Core Systems Dedicated to Dynamic Dataflow Programs
    Yviquel, Herve
    Sanchez, Alexandre
    Jaaskelainen, Pekka
    Takala, Jarmo
    Raulet, Mickael
    Casseau, Emmanuel
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2015, 80 (01): : 121 - 136
  • [8] Modeling and Verifying Multi-core Programs
    Zhang, Nan
    Duan, Zhenhua
    Tian, Cong
    Du, Hongwei
    Yang, Kai
    [J]. COMBINATORIAL OPTIMIZATION AND APPLICATIONS, COCOA 2017, PT II, 2017, 10628 : 492 - 500
  • [9] Optimizing the Linear Fascicle Evaluation Algorithm for Multi-core and Many-core Systems
    Aggarwal, Karan
    Bondhugula, Uday
    [J]. ACM TRANSACTIONS ON PARALLEL COMPUTING, 2020, 7 (04)
  • [10] On The Efficiency of Multi-core Grammatical Evolution (MCGE) Evolving Multi-Core Parallel Programs
    Chennupati, Gopinath
    Fitzgerald, Jeannie
    Ryan, Conor
    [J]. 2014 SIXTH WORLD CONGRESS ON NATURE AND BIOLOGICALLY INSPIRED COMPUTING (NABIC), 2014, : 238 - 243