Predecessor/successor approach for high-performance run-time wavefront scheduling

被引:1
|
作者
Huang, TC [1 ]
Hsu, PH
机构
[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 804, Taiwan
[2] Cheng Shiu Inst Technol, Dept Elect Engn, Kaohsiung 833, Taiwan
关键词
parallelizing compiler; dependence chain; loop parallelization; inspector/executor; wavefront scheduling;
D O I
10.1016/j.ins.2005.01.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most scientific applications rely on parallel Multiprocessor computing to enhance Performance. However, the irregular loops within these applications obstruct the parallefism analysis at compile-time. Rauchwerger et al. presented a run-time method to extract the hidden parallelism in a program using dependence chains. The relative overhead degrades this approach's performance due to the mass storage requirement and huge array reference processing. In this Study, a new predecessor/successor approach is developed in which high-level predecessor/successor information is recorded and processed efficiently. A predecessor/successor table is constructed first in the inspector phase so that only the successor iterations in the current wavefront need to be examined, instead of the entire loop iterations during wavefront scheduling. Usually, the performance of dependence chain approach degrades dramatically for a hot-spot access pattern, but Our scheme works very efficiently in this case. The experimental results using synthetic code and real programs are presented to prove the superiority of the proposed approach. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:845 / 860
页数:16
相关论文
共 50 条
  • [41] Run-Time Reference Clustering for cache performance optimization
    Kaplow, WK
    Szymanski, BK
    Tannenbaum, P
    Viktor, K
    [J]. SECOND AIZU INTERNATIONAL SYMPOSIUM ON PARALLEL ALGORITHMS/ARCHITECTURE SYNTHESIS, PROCEEDINGS, 1997, : 42 - 49
  • [42] A Performance Model for Run-Time Reconfigurable Hardware Accelerator
    Wang, Gang
    Chen, Du
    Chen, Jian
    Ma, Jianliang
    Chen, Tianzhou
    [J]. ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2009, 5737 : 54 - 66
  • [43] RUN-TIME BEHAVIOR OF SASL PROGRAMS - A PERFORMANCE STUDY
    SARWAR, SM
    [J]. COMPUTER LANGUAGES, 1993, 19 (03): : 205 - 211
  • [44] Run-time versus compile-time instruction scheduling in superscalar (RISC) processors: Performance and trade-off
    Leung, A
    Palem, KV
    Ungureanu, C
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 45 (01) : 13 - 28
  • [45] Cluster scheduling for real-time systems: utilization bounds and run-time overhead
    Qi, Xuan
    Zhu, Dakai
    Aydin, Hakan
    [J]. REAL-TIME SYSTEMS, 2011, 47 (03) : 253 - 284
  • [46] Cluster scheduling for real-time systems: utilization bounds and run-time overhead
    Xuan Qi
    Dakai Zhu
    Hakan Aydin
    [J]. Real-Time Systems, 2011, 47 : 253 - 284
  • [47] ON OFF FAN CLUTCH FOR HIGH RUN-TIME APPLICATIONS
    不详
    [J]. DIESEL PROGRESS ENGINES & DRIVES, 1994, 60 (10): : 68 - 68
  • [48] The Interplay of Compile-time and Run-time Options for Performance Prediction
    Lesoil, Luc
    Acher, Mathieu
    Ternava, Xhevahire
    Blouin, Arnaud
    Jezequel, Jean-Marc
    [J]. SPLC '21: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL SYSTEMS AND SOFTWARE PRODUCT LINE CONFERENCE, VOL A, 2021,
  • [49] Run-time monitoring for dependable systems: an approach and a case study
    Rota, SR
    de Almeida, JR
    [J]. 23RD IEEE INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2004, : 41 - 49
  • [50] An incremental and practical approach to enable the component run-time evolution
    Sun, CA
    [J]. 12TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2005, : 603 - 610