Predecessor/successor approach for high-performance run-time wavefront scheduling

被引:1
|
作者
Huang, TC [1 ]
Hsu, PH
机构
[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 804, Taiwan
[2] Cheng Shiu Inst Technol, Dept Elect Engn, Kaohsiung 833, Taiwan
关键词
parallelizing compiler; dependence chain; loop parallelization; inspector/executor; wavefront scheduling;
D O I
10.1016/j.ins.2005.01.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most scientific applications rely on parallel Multiprocessor computing to enhance Performance. However, the irregular loops within these applications obstruct the parallefism analysis at compile-time. Rauchwerger et al. presented a run-time method to extract the hidden parallelism in a program using dependence chains. The relative overhead degrades this approach's performance due to the mass storage requirement and huge array reference processing. In this Study, a new predecessor/successor approach is developed in which high-level predecessor/successor information is recorded and processed efficiently. A predecessor/successor table is constructed first in the inspector phase so that only the successor iterations in the current wavefront need to be examined, instead of the entire loop iterations during wavefront scheduling. Usually, the performance of dependence chain approach degrades dramatically for a hot-spot access pattern, but Our scheme works very efficiently in this case. The experimental results using synthetic code and real programs are presented to prove the superiority of the proposed approach. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:845 / 860
页数:16
相关论文
共 50 条
  • [1] An efficient run-time parallelizing scheme for wavefront scheduling
    Huang, TC
    Hsu, PH
    [J]. PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 580 - 583
  • [2] ACEcard™:: A high-performance architecture for run-time reconfiguration
    Davis, D
    Harris, J
    [J]. FIRST MERGED INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, 1998, : 616 - 619
  • [3] High-Performance Parallel Accelerator for Flexible and Efficient Run-Time Monitoring
    Deng, Daniel Y.
    Suh, G. Edward
    [J]. 2012 42ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2012,
  • [4] Run-time reconfiguration management for adaptive high-performance computing systems
    Taher, M
    El-Ghazawi, T
    [J]. FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2004, 3203 : 1183 - 1183
  • [5] RUN-TIME PARALLELIZATION AND SCHEDULING OF LOOPS
    SALTZ, JH
    MIRCHANDANEY, R
    BAXTER, D
    [J]. SPAA 89: PROCEEDINGS OF THE 1989 ACM SYMPOSIUM ON PARALLEL ALGORITHMS AND ARCHITECTURES, 1989, : 303 - 312
  • [6] RUN-TIME PARALLELIZATION AND SCHEDULING OF LOOPS
    SALTZ, JH
    MIRCHANDANEY, R
    CROWLEY, K
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1991, 40 (05) : 603 - 612
  • [7] The open run-time environment (OpenRTE): A transparent multicluster environment for high-performance computing
    Castain, R. H.
    Woodall, T. S.
    Daniel, D. J.
    Squyres, J. M.
    Barrett, B.
    Fagg, G. E.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2008, 24 (02): : 153 - 157
  • [8] Run-time power estimation in high performance microprocessors
    Joseph, R
    Martonosi, M
    [J]. ISLPED'01: PROCEEDINGS OF THE 2001 INTERNATIONAL SYMPOSIUM ON LOWPOWER ELECTRONICS AND DESIGN, 2001, : 135 - 140
  • [9] The SPEEDES-based Run-Time Infrastructure for the High-Level Architecture on high-performance computers
    Steinman, JS
    Berliner, G
    Blank, GE
    Brutocao, JS
    Burckhardt, J
    Peckham, M
    Shupe, S
    Stadsklev, K
    Tran, T
    Van Iwaarden, R
    Yu, L
    [J]. PROCEEDINGS OF THE HIGH PERFORMANCE COMPUTING SYMPOSIUM - HPC '99, 1999, : 255 - 266
  • [10] A run-time scheduling framework for a reconfigurable hardware emulator
    Beckert, R.
    Fuchs, T.
    Ruelke, St.
    Hardt, W.
    [J]. DSD 2007: 10TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN ARCHITECTURES, METHODS AND TOOLS, PROCEEDINGS, 2007, : 147 - +