FAULT TOLERANCE IN DISTRIBUTED UNIX

被引:0
|
作者
BORG, A [1 ]
BLAU, W [1 ]
OBERLE, W [1 ]
GRAETSCH, W [1 ]
机构
[1] NIXDORF COMP, PADERBORN, GERMANY
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An initial design for a fault tolerant, distributed version of UNIX was presented in an earlier paper [2]. That design left a number of open questions in two particular areas: Fault tolerance for server processes through which peripherals are accessed; recovery after a crash including the re-backup of processes. Since then, the fundamental design involving three-way message transmission has remained unchanged. However, server fault tolerance has been redesigned and is now more consistent with the fault tolerance of normal user processes. Recovery and re-backup have been completed in a more efficient manner than previously envisioned. In addition, important changes in the implementation have occurred. In this paper, we review the original design, borrowing heavily from the earlier paper in sections 1-3, and explain additions and modifications in later sections.
引用
收藏
页码:224 / 243
页数:20
相关论文
共 50 条
  • [31] LITE FAULT-TOLERANCE KEEPS UNIX PCS FROM GOING OUT
    SEITHER, M
    [J]. MINI-MICRO SYSTEMS, 1989, 22 (03): : 32 - 35
  • [32] Ensuring Fault-Tolerance in Distributed Media
    A. G. Tormasov
    M. A. Khasin
    Yu. I. Pakhomov
    [J]. Programming and Computer Software, 2001, 27 : 245 - 251
  • [33] Ensuring fault-tolerance in distributed media
    Tormasov, AG
    Khasin, MA
    Pakhomov, YI
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2001, 27 (05) : 245 - 251
  • [34] IMPROVED DISTRIBUTED FAULT TOLERANT CLUSTERING ALGORITHM FOR FAULT TOLERANCE IN WSN
    Kaur, Mandeep
    Garg, Parul
    [J]. 2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 197 - 201
  • [35] LBFT: Load Balancing and Fault Tolerance in distributed controllers
    Mahjoubi, Ayeh
    Zeynalpour, Omid
    Eslami, Benyamin
    Yazdani, Nasser
    [J]. 2019 INTERNATIONAL SYMPOSIUM ON NETWORKS, COMPUTERS AND COMMUNICATIONS (ISNCC 2019), 2019,
  • [36] A Fault-tolerance Framework for Distributed Component Systems
    Hamid, Brahim
    Radermacher, Ansgar
    Vanuxeem, Patrick
    Lanusse, Agnes
    Gerard, Sebastien
    [J]. PROCEEDINGS OF THE 34TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS, 2008, : 84 - 91
  • [37] On the fault (in)tolerance of coordination mechanisms for distributed investment decisions
    Leitner, Stephan
    Behrens, Doris A.
    [J]. CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2015, 23 (01) : 251 - 278
  • [38] A framework for fault tolerance in distributed real time systems
    Malik, S
    Rehman, MJ
    [J]. IEEE: 2005 International Conference on Emerging Technologies, Proceedings, 2005, : 505 - 510
  • [39] Software fault tolerance for distributed object based computing
    Kim, HC
    Nair, VSS
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 1997, 39 (02) : 103 - 117
  • [40] Fault Tolerance on Improved Distributed Spanning Tree Structure
    Wang, Tiejun
    Liu, Heng
    Sun, Ming
    Liu, Zhen
    Zhou, Mingtian
    [J]. 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 5, 2010, : 296 - 300