Globally Precise-restartable Execution of Parallel Programs

被引:0
|
作者
Gupta, Gagan [1 ]
Sridharan, Srinath [1 ]
Sohi, Gurindar S. [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
关键词
Design; Experimentation; Measurement; Performance; Reliability; Deterministic Multithreading; Precise Exceptions; ROLLBACK-RECOVERY; COST; SAFE;
D O I
10.1145/2666356.2594306
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Emerging trends in computer design and use are likely to make exceptions, once rare, the norm, especially as the system size grows. Due to exceptions, arising from hardware faults, approximate computing, dynamic resource management, etc., successful and error-free execution of programs may no longer be assured. Yet, designers will want to tolerate the exceptions so that the programs execute completely, efficiently and without external intervention. Modern computers easily handle exceptions in sequential programs, using precise interrupts. But they are ill-equipped to handle exceptions in parallel programs, which are growing in prevalence. In this work we introduce the notion of globally precise-restartable execution of parallel programs, analogous to precise-interruptible execution of sequential programs. We present a software runtime recovery system based on the approach to handle exceptions in suitably-written parallel programs. Qualitative and quantitative analyses show that the proposed system scales with the system size, especially when exceptions are frequent, unlike the conventional checkpoint-and-recovery method.
引用
收藏
页码:181 / 192
页数:12
相关论文
共 50 条
  • [1] Efficient, Precise-Restartable Program Execution on Future Multicores
    Gupta, Gagan
    Sridharan, Srinath
    Sohi, Gurindar S.
    [J]. 2012 IEEE HOT CHIPS 24 SYMPOSIUM (HCS), 2012,
  • [2] Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs
    Hayashi, Akihiro
    Grossman, Max
    Zhao, Jisheng
    Shirako, Jun
    Sarkar, Vivek
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2013, 2014, 8664 : 342 - 356
  • [3] Adaptive, Efficient, Parallel Execution of Parallel Programs
    Sridharan, Srinath
    Gupta, Gagan
    Sohi, Gurindar S.
    [J]. ACM SIGPLAN NOTICES, 2014, 49 (06) : 169 - 180
  • [4] Execution replay of parallel procedural programs
    de Kergommeaux, JC
    Fagot, A
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (10) : 835 - 849
  • [5] Compiling lisp programs for parallel execution
    Larus, James R.
    [J]. LISP and Symbolic Computation, 1991, 4 (01):
  • [6] A PARALLEL EXECUTION MODEL OF LOGIC PROGRAMS
    CHEN, AC
    WU, CL
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1991, 2 (01) : 79 - 92
  • [7] Visualizing parallel execution of FORTRAN programs
    [J]. Szelenyi, F., 1600, (35): : 1 - 2
  • [8] Parallel execution of prolog programs: A survey
    Gupta, G
    Pontelli, E
    Ali, KAM
    Carlsson, M
    Hermenegildo, MV
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2001, 23 (04): : 472 - 602
  • [9] VISUALIZING PARALLEL EXECUTION OF FORTRAN PROGRAMS
    SZELENYI, F
    ZECCA, V
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1991, 35 (1-2) : 270 - 282
  • [10] SUPPORTING REVERSE EXECUTION OF PARALLEL PROGRAMS
    PAN, DZ
    LINTON, MA
    [J]. SIGPLAN NOTICES, 1989, 24 (01): : 124 - 129