Fault-tolerant file-I/O for portable checkpointing systems

被引:0
|
作者
机构
来源
Lyubashevskiy, Igor | 2000年 / Kluwer Academic Publishers, Dordrecht, Netherlands卷 / 16期
关键词
Algorithms - Binary codes - Input output programs - Program compilers - User interfaces;
D O I
暂无
中图分类号
学科分类号
摘要
The ftIO-system provides portable and fault-tolerant file-I/O by enhancing the functionality of the ANSI C file system without changing its application programmer interface and without depending on system-specific implementations of the standard file operations. The ftIO-system is an extension of the porch compiler and its runtime system. The porch compiler automatically generates code to save bookkeeping information about ftIO's transactional file operations in portable checkpoints. These portable checkpoints can be recovered on a binary incompatible architecture. We developed a new algorithm for supporting transactional file operations in ftIO. Rather than using the well-known two-phase commit protocol, this algorithm uses only a single bit of information and an atomic rename file operation to guarantee fault tolerance. In this paper, we describe our new ftIO algorithm, discuss design choices for ftIO, and provide experimental data of our ftIO prototype.
引用
收藏
页码:1 / 2
相关论文
共 50 条
  • [1] Fault-Tolerant File-I/O for Portable Checkpointing Systems
    Igor Lyubashevskiy
    Volker Strumpen
    The Journal of Supercomputing, 2000, 16 : 69 - 92
  • [2] Fault-tolerant file-I/O for portable checkpointing systems
    Lyubashevskiy, I
    Strumpen, V
    JOURNAL OF SUPERCOMPUTING, 2000, 16 (1-2): : 69 - 92
  • [3] Synthesis of fault-tolerant embedded systems with checkpointing and replication
    Izosimov, V
    Pop, P
    Eles, P
    Peng, Z
    DELTA 2006: THIRD IEEE INTERNATIONAL WORKSHOP ON ELECTRONIC DESIGN, TEST AND APPLICATIONS, 2006, : 440 - +
  • [4] Portable and fault-tolerant software systems
    Strumpen, V
    IEEE MICRO, 1998, 18 (05) : 22 - 32
  • [5] EFFICIENT CHECKPOINTING PROCEDURES FOR FAULT-TOLERANT DISTRIBUTED SYSTEMS
    SALEH, K
    AGARWAL, A
    MICROPROCESSING AND MICROPROGRAMMING, 1994, 40 (06): : 427 - 438
  • [6] I/O response time in a fault-tolerant parallel virtual file system
    Feng, D
    Jiang, H
    Zhu, YF
    NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2004, 3222 : 248 - 251
  • [7] Experiences with Formal Specification of Fault-Tolerant File Systems
    Geambasu, Roxana
    Birrell, Andrew
    MacCormick, John
    2008 IEEE INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS & NETWORKS WITH FTCS & DCC, 2008, : 96 - +
  • [8] Application controlled checkpointing coordination for fault-tolerant distributed computing systems
    Park, T
    Yeom, HY
    PARALLEL COMPUTING, 2000, 26 (04) : 467 - 482
  • [9] A retrial queue for modeling fault-tolerant systems with checkpointing and rollback recovery
    Dimitriou, Ioannis
    COMPUTERS & INDUSTRIAL ENGINEERING, 2015, 79 : 156 - 167
  • [10] Fault-tolerant CORBA, using checkpointing and recovery
    ComTec, 1997, 75 (08): : 20 - 25