Improving compiler and run-time support for irregular reductions using local writes

被引:1
|
作者
Han, HS [1 ]
Tseng, CW [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
D O I
10.1007/3-540-48319-5_12
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Current compilers for distributed-memory multiprocessors parallelize irregular reductions either by generating calls to sophisticated run-time systems (CHAOS) or by relying on replicated buffers and the shared-memory interface supported by software DSMs (TreadMarks). We introduce LOCALWRITE, a new technique for parallelizing irregular reductions based on the owner-computes rule. It eliminates the need for buffers or synchronized writes, but may replicate computation. We investigate the impact of connectivity (node/edge ratio), locality (accesses to local data) and adaptivity (edge modifications) on their relative performance. LOCALWRITE improves performance by 50-150% compared to using replicated buffers, and can match or exceed gather/scatter for applications with low locality or high adaptivity.
引用
收藏
页码:181 / 196
页数:16
相关论文
共 50 条
  • [1] Efficient compiler and run-time support for parallel irregular reductions
    Han, H
    Tseng, CW
    [J]. PARALLEL COMPUTING, 2000, 26 (13-14) : 1861 - 1887
  • [2] Improving compiler and run-time support for adaptive irregular codes
    Han, HS
    Tseng, CW
    [J]. 1998 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 1998, : 393 - 400
  • [3] Compiler and run-time support for exploiting regularity within irregular applications
    Lain, A
    Chakrabarti, DR
    Banerjee, P
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2000, 11 (02) : 119 - 135
  • [4] Run-time characterization of irregular accesses applied to parallelization of irregular reductions
    Singh, DE
    Rivera, FF
    Martín, MJ
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2001, : 17 - 22
  • [5] Techniques to provide run-time support for solving irregular problems
    No, J
    Choudhary, A
    [J]. 1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 73 - 80
  • [6] Adaptive Parallel Matrix Computing through Compiler and Run-time Support
    Buenabad-Chavez, Jorge
    Alfonso Castro-Garcia, Miguel
    Angelica Rosales-Camacho, Rosa
    Dominguez-Dominguez, Santiago
    Peralta, Julio C.
    Aguilar-Cornejo, Manuel
    [J]. PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 359 - 368
  • [7] Efficient run-time support for irregular block-structured applications
    Fink, SJ
    Baden, SB
    Kohn, SR
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 50 (1-2) : 61 - 82
  • [8] Efficient run-time support for irregular task computations with mixed granularities
    Fu, C
    Yang, T
    [J]. 10TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM - PROCEEDINGS OF IPPS '96, 1996, : 823 - 830
  • [9] Interprocedural Compiler Optimization for Partial Run-Time Reconfiguration
    Elena Moscu Panainte
    Koen Bertels
    Stamatis Vassiliadis
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2006, 43 : 161 - 172
  • [10] A Compiler and Run-time System for Network Programming Languages
    Monsanto, Christopher
    Foster, Nate
    Harrison, Rob
    Walker, David
    [J]. POPL 12: PROCEEDINGS OF THE 39TH ANNUAL ACM SIGPLAN-SIGACT SYMPOSIUM ON PRINCIPLES OF PROGRAMMING LANGUAGES, 2012, : 217 - 230