SUPPLE: An efficient run-time support for non-uniform parallel loops

被引:0
|
作者
Orlando, S
Perego, R
机构
[1] CNR, CNUCE, I-56126 Pisa, Italy
[2] Univ Ca Foscari Venezia, Dipartimento Matemat Applicata & Informat, I-30173 Venezia Mestre, Italy
关键词
data parallelism; parallel loop scheduling; load balancing; run-time supports; compiler optimizations;
D O I
10.1016/S1383-7621(98)00071-X
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents SUPPLE (SUPort for Parallel Loop Execution), an innovative run-time support for the execution of parallel loops with regular stencil data references and non-uniform iteration costs. SUPPLE relies upon a static block data distribution to exploit locality, and combines static and dynamic policies for scheduling non-uniform iterations. It adopts, as far as possible, a static scheduling policy derived from the owner computes rule, and moves data and iterations among processors only if a load imbalance actually occurs. SUPPLE always tries to overlap communications with useful computations by reordering loop iterations and prefetching remote ones in the case of workload imbalance. The SUPPLE approach has been validated by many experimental results obtained by running a multidimensional flame simulation kernel on a 64-node Gray T3D. We have fed the benchmark code with several synthetic input data sets built on the basis of a load imbalance model. We have compared our results with those obtained with a CRAFT Fortran implementation of the benchmark. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1323 / 1343
页数:21
相关论文
共 50 条
  • [1] Efficient run-time scheduling for parallelizing partially parallel loops
    Huang, TC
    Hsu, PH
    Sheng, TN
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 1998, 14 (01) : 255 - 264
  • [2] Run-time parallelization for partially parallel loops
    Yang, CT
    Tseng, SS
    Kao, SH
    Hsieh, MH
    Jiang, MF
    [J]. 1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 308 - 313
  • [3] Efficient compiler and run-time support for parallel irregular reductions
    Han, H
    Tseng, CW
    [J]. PARALLEL COMPUTING, 2000, 26 (13-14) : 1861 - 1887
  • [4] Efficient run-time parallelization for DO loops
    Yang, CT
    Tseng, SS
    Hsieh, MH
    Kao, SH
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 1998, 14 (01) : 237 - 253
  • [5] Run-time Non-uniform Quantization for Dynamic Neural Networks in Wireless Communication
    Allwin, Priscilla Sharon
    Gomony, Manil Dev
    Geilen, Marc
    [J]. 29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 915 - 920
  • [6] A support for non-uniform parallel loops and its application to a flame simulation code
    Orlando, S
    Perego, R
    [J]. SOLVING IRREGULARLY STRUCTURED PROBLEMS IN PARALLEL, 1997, 1253 : 186 - 197
  • [7] RUN-TIME PARALLELIZATION AND SCHEDULING OF LOOPS
    SALTZ, JH
    MIRCHANDANEY, R
    BAXTER, D
    [J]. SPAA 89: PROCEEDINGS OF THE 1989 ACM SYMPOSIUM ON PARALLEL ALGORITHMS AND ARCHITECTURES, 1989, : 303 - 312
  • [8] RUN-TIME PARALLELIZATION AND SCHEDULING OF LOOPS
    SALTZ, JH
    MIRCHANDANEY, R
    CROWLEY, K
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1991, 40 (05) : 603 - 612
  • [9] Run-time support for scheduling parallel applications in heterogeneous NOWs
    Weissman, JB
    Zhao, X
    [J]. SIXTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 1997, : 347 - 355
  • [10] Efficient run-time scheduling for parallelizing partially parallel loop
    Huang, TC
    Hsu, PH
    Sheng, TN
    [J]. ICA(3)PP 97 - 1997 3RD INTERNATIONAL CONFERENCE ON ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, 1997, : 397 - 403