Compiler-Enhanced Incremental Checkpointing for OpenMP Applications

被引:0
|
作者
Bronevetsky, Greg [1 ]
Marques, Daniel [2 ]
Pingali, Keshav [3 ]
McKee, Sally [4 ]
Rugina, Radu [5 ]
机构
[1] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
[2] Ballista Secur, New York, NY 10005 USA
[3] Univ Texas Austin, Austin, TX 78712 USA
[4] Chalmers Univ Technol, S-41258 Gothenburg, Sweden
[5] VMWare, Palo Alto, CA 94304 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As modern supercomputing systems reach the peta-flop performance range, they grow in both size and complexity. This makes them increasingly vulnerable to failures from a variety of causes. Checkpointing is a popular technique for tolerating such failures, enabling applications to periodically save their state and restart computation after a failure. Although a many automated system-level checkpointing solutions are currently available to HPC users, manual application-level checkpointing remains more popular due to its superior performance. This paper improves performance of automated checkpointing via a compiler analysis for incremental checkpointing. This analysis, which works with both sequential and OpenMP applications, reduces checkpoint sizes by as much as 80% and enables asynchronous checkpointing.
引用
收藏
页码:189 / +
页数:3
相关论文
共 50 条
  • [1] Compiler-Enhanced Incremental Checkpointing for OpenMP Applications
    Bronevetsky, Greg
    Marques, Daniel
    Pingali, Keshav
    Rugina, Radu
    McKee, Sally A.
    [J]. PPOPP'08: PROCEEDINGS OF THE 2008 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2008, : 275 - 276
  • [2] Compiler-enhanced incremental checkpointing
    Bronevetsky, Greg
    Marques, Daniel
    Pingali, Keshav
    Rugina, Radu
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2008, 5234 : 1 - +
  • [3] Compiler-Enhanced Reliability for Network-on-Chip Architectures
    Sasongko, Muhammad Aditya
    Topcuoglu, Haluk Rahmi
    Arslan, Sanem
    Kandemir, Mahmut Taylan
    [J]. 2017 25TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2017), 2017, : 584 - 588
  • [4] I/O optimization in the checkpointing of OpenMP parallel applications
    Losada, Nuria
    Martin, Maria J.
    Rodriguez, Gabriel
    Gonzalez, Patricia
    [J]. 23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 222 - 229
  • [5] An OpenMP based Parallelization Compiler for C Applications
    Arabnejad, Hamid
    Bispo, Joao
    Barbosa, Jorge G.
    Cardoso, Joao M. P.
    [J]. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 915 - 923
  • [6] Compiler-Directed Incremental Checkpointing for Low Latency GPU Preemption
    Ji, Zhuoran
    Wang, Cho-Li
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 751 - 761
  • [7] An incremental compilation approach for OpenMP applications
    Giordano, M
    Furnari, MM
    [J]. NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 249 - 252
  • [8] Performance instrumentation and compiler optimizations for MPI/OpenMP applications
    Hernandez, Oscar
    Song, Fengguang
    Chapman, Barbara
    Dongarra, Jack
    Mohr, Bernd
    Moore, Shirley
    Wolf, Felix
    [J]. OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2008, 4315 : 267 - +
  • [9] Compiler Optimizations for OpenMP
    Doerfert, Johannes
    Finkel, Hal
    [J]. EVOLVING OPENMP FOR EVOLVING ARCHITECTURES, 2018, 11128 : 113 - 127
  • [10] An Architecture-Independent CGRA Compiler enabling OpenMP Applications
    Kojima, Takuya
    Adhi, Boma
    Cortes, Carlos
    Tan, Yiyu
    Sano, Kentaro
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 631 - 638