Compiler and run-time support for exploiting regularity within irregular applications

被引:1
|
作者
Lain, A
Chakrabarti, DR
Banerjee, P
机构
[1] Hewlett Packard Corp, Bristol BS34 8QZ, Avon, England
[2] Northwestern Univ, Ctr Parallel & Distributed Comp, ECE Dept, Evanston, IL 60208 USA
基金
美国国家科学基金会;
关键词
irregular applications; iterative; runtime support; compiler support; distributed memory multicomputers; runtime compilation;
D O I
10.1109/71.841749
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper starts from a well-known idea. that structure in irregular problems improves sequential performance, and tries to show that the same structure can also be exploited for parallelization of irregular problems on a distributed-memory multicomputer. In particular, we extend a well-known parallelization technique called run-time compilation to use structure information that is explicit on the array subscripts. This paper presents a number of internal representations suited to particular access patterns and shows how various preprocessing structures such as translation tables, trace arrays, and interprocessor communication schedules can be encoded in terms of one or more of these representations. We show how loop and index normalization are important for detection of irregularity in array references, as well as the presence of locality in such references. This paper presents methods for detection of irregularity, feasibility of inspection, and finally, placement of inspectors and interprocessor communication schedules. We show that this process can be automated through extensions to an HPF/Fortran-77 distributed-memory compiler (PARADIGM) and a new runtime support for irregular problems (PILAR) that uses a variety of internal representations of communication patterns. We devise performance measures which consider the relationship between the inspection cost, the execution cost, and the number of times the executor is invoked so that a comparison of the competing schemes can be performed independent of the number of iterations. Finally, we show experimental results on an IBM SP-2 that validate our approach. These results show that dramatic improvements in both memory requirements and execution time can be achieved by using these techniques.
引用
收藏
页码:119 / 135
页数:17
相关论文
共 50 条
  • [1] Efficient compiler and run-time support for parallel irregular reductions
    Han, H
    Tseng, CW
    [J]. PARALLEL COMPUTING, 2000, 26 (13-14) : 1861 - 1887
  • [2] Improving compiler and run-time support for adaptive irregular codes
    Han, HS
    Tseng, CW
    [J]. 1998 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 1998, : 393 - 400
  • [3] Improving compiler and run-time support for irregular reductions using local writes
    Han, HS
    Tseng, CW
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 1999, 1656 : 181 - 196
  • [4] Exploiting locality in the run-time parallelization of irregular loops
    Martín, MJ
    Singh, DE
    Touriño, J
    Rivera, FF
    [J]. 2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDING, 2002, : 27 - 34
  • [5] Efficient run-time support for irregular block-structured applications
    Fink, SJ
    Baden, SB
    Kohn, SR
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 50 (1-2) : 61 - 82
  • [6] Techniques to provide run-time support for solving irregular problems
    No, J
    Choudhary, A
    [J]. 1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 73 - 80
  • [7] Adaptive Parallel Matrix Computing through Compiler and Run-time Support
    Buenabad-Chavez, Jorge
    Alfonso Castro-Garcia, Miguel
    Angelica Rosales-Camacho, Rosa
    Dominguez-Dominguez, Santiago
    Peralta, Julio C.
    Aguilar-Cornejo, Manuel
    [J]. PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 359 - 368
  • [8] Run-time library for parallel I/O for irregular applications
    No, J
    Choudhary, A
    [J]. PARALLEL COMPUTING: FUNDAMENTALS, APPLICATIONS AND NEW DIRECTIONS, 1998, 12 : 437 - 440
  • [9] Run-time techniques for exploiting irregular task parallelism on distributed memory architectures
    Fu, C
    Yang, T
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 42 (02) : 143 - 156
  • [10] Efficient run-time support for irregular task computations with mixed granularities
    Fu, C
    Yang, T
    [J]. 10TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM - PROCEEDINGS OF IPPS '96, 1996, : 823 - 830