I/O optimization in the checkpointing of OpenMP parallel applications

被引:2
|
作者
Losada, Nuria [1 ]
Martin, Maria J. [1 ]
Rodriguez, Gabriel [1 ]
Gonzalez, Patricia [1 ]
机构
[1] Univ A Coruna, Comp Architecture Grp, La Coruna, Spain
关键词
OpenMP; Fault Tolerance; Checkpointing; FAULT-TOLERANCE; CPPC; TOOL;
D O I
10.1109/PDP.2015.39
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the increasing popularity of shared-memory systems, there is a lack of tools for providing fault tolerance support to shared-memory applications. Checkpointing is one of the most popular fault tolerance techniques. However, checkpointing cost in terms of computing time, network utilization or storage resources can be a limitation for its practical use. This work proposes different techniques for the optimization of the I/O cost in the checkpointing of shared-memory parallel applications. The proposals are extensively evaluated using the OpenMP NAS Parallel Benchmarks. Results show a significant decrease of the checkpointing overhead.
引用
收藏
页码:222 / 229
页数:8
相关论文
共 50 条
  • [21] Parallel I/O Aware Query Optimization
    Ghodsnia, Pedram
    Bowman, Ivan T.
    Nica, Anisoara
    [J]. SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 349 - 360
  • [22] Parallel I/O Performance for Application-Level Checkpointing on the Blue Gene/P System
    Fu, Jing
    Min, Misun
    Latham, Robert
    Carothers, Christopher D.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 465 - 473
  • [23] I/O in parallel applications: The weakest link
    Thakur, Rajeev
    Lusk, Ewing
    Gropp, William
    [J]. International Journal of High Performance Computing Applications, 12 (04): : 389 - 395
  • [24] Passion: Optimized I/O for parallel applications
    Thakur, R
    Choudhary, A
    Bordawekar, R
    More, S
    Kuditipudi, S
    [J]. COMPUTER, 1996, 29 (06) : 70 - &
  • [25] I/O in parallel applications: The weakest link
    Thakur, R
    Lusk, E
    Gropp, W
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 1998, 12 (04): : 389 - 395
  • [26] Programming Parallel Embedded and Consumer Applications in OpenMP Superscalar
    Andersch, Michael
    Chi, Chi Ching
    Juurlink, Ben
    [J]. ACM SIGPLAN NOTICES, 2012, 47 (08) : 281 - 282
  • [27] Parallel data reuse theories for OpenMP and OpenTM applications
    Wu J.-J.
    Yang X.-J.
    Liu G.-H.
    Tang Y.-H.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2010, 21 (12): : 3011 - 3028
  • [28] Extending an Application-Level Checkpointing Tool to Provide Fault Tolerance Support to OpenMP Applications
    Losada, Nuria
    Martin, Maria J.
    Rodriguez, Gabriel
    Gonzalez, Patricia
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2014, 20 (09) : 1352 - 1372
  • [29] Automatic Cloud I/O Configurator for I/O Intensive Parallel Applications
    Zhai, Jidong
    Liu, Mingliang
    Jin, Ye
    Ma, Xiaosong
    Chen, Wenguang
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (12) : 3275 - 3288
  • [30] Analyzing the Parallel I/O Severity of MPI Applications
    Mendez, Sandra
    Rexachs, Dolores
    Luque, Emilio
    [J]. 2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 953 - 962