Speculative parallel threading architecture and compilation

被引:0
|
作者
Li, XF [1 ]
Du, ZH [1 ]
Yang, C [1 ]
Lim, CC [1 ]
Ngai, TF [1 ]
机构
[1] Intel China Res Ctr, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Thread-level speculation is a technique that brings thread-level parallelism beyond the data flow limit by executing a piece of code ahead of time speculatively before all its input data are ready. This technique appears particularly appealing for speeding up hard-to-parallelize applications. Although various thread-level speculation architectures and compilation techniques have been proposed by the research community, scalar applications remain difficult to be parallelized. It has not yet shown how well these applications can actually be benefited from thread-level speculation and if the performance gain is significant enough to justify the required hardware support. In an attempt to understand and realize the potential gain with thread-level speculation especially for scalar applications, we proposed an SPT (Speculative Parallel Threading) architecture and developed an SPT compiler to generate optimal speculatively parallelized code. Our evaluation showed that with our SPT approach 10 SPECint2000 programs can achieve an average of 15.6% speedup on a two-core SPT processor by exploiting only loop parallelism. This paper describes the SPT architecture and the SPT compiler which performs aggressive cost-driven loop selection and transformation, and presents our performance evaluation results.
引用
收藏
页码:285 / 294
页数:10
相关论文
共 50 条
  • [31] PARALLEL CONTEXT-SENSITIVE COMPILATION
    ASTHAGIRI, CR
    POTTER, JL
    SOFTWARE-PRACTICE & EXPERIENCE, 1994, 24 (09): : 801 - 822
  • [32] Multi-threading parallel reinforcement learning
    Fu, Qiming
    Kang, Yiyi
    Gao, Zhen
    Wu, Hongjie
    Hu, Fuyuan
    Chen, Jianping
    Zhong, Shan
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2019, 61 (04) : 278 - 286
  • [33] Protein threading using parallel evolution strategy
    Islam, Rafiqul
    Ngom, Alioune
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 2332 - 2339
  • [34] Compilation to parallel programs from constraints
    John, A
    Browne, JC
    3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 1996, : 73 - 79
  • [35] A compilation framework for a dynamically reconfigurable architecture
    David, R
    Chillet, D
    Pillement, S
    Sentieys, O
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS: RECONFIGURABLE COMPUTING IS GOING MAINSTREAM, 2002, 2438 : 1058 - 1067
  • [36] Parallel Lisp compilation for distributed systems
    Feng, MD
    Yuen, CK
    AUSTRALIAN COMPUTER JOURNAL, 1995, 27 (03): : 77 - 91
  • [37] iThreads: A Threading Library for Parallel Incremental Computation
    Bhatotia, Pramod
    Fonseca, Pedro
    Acar, Umut A.
    Brandenburg, Bjoern B.
    Rodrigues, Rodrigo
    ACM SIGPLAN NOTICES, 2015, 50 (04) : 645 - 659
  • [38] PARALLEL ONE-PASS COMPILATION
    KLEIN, E
    KOSKIMIES, K
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 461 : 76 - 90
  • [39] OPTIMIZING VHDL COMPILATION FOR PARALLEL SIMULATION
    WILLIS, JC
    SIEWIOREK, DP
    IEEE DESIGN & TEST OF COMPUTERS, 1992, 9 (03): : 42 - 53
  • [40] Flexible constraint compilation in the parity architecture
    ter Hoeven, Roeland
    Messinger, Anette
    Lechner, Wolfgang
    PHYSICAL REVIEW A, 2023, 108 (04)