FAULT TOLERANCE IN DISTRIBUTED UNIX

被引:0
|
作者
BORG, A [1 ]
BLAU, W [1 ]
OBERLE, W [1 ]
GRAETSCH, W [1 ]
机构
[1] NIXDORF COMP, PADERBORN, GERMANY
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An initial design for a fault tolerant, distributed version of UNIX was presented in an earlier paper [2]. That design left a number of open questions in two particular areas: Fault tolerance for server processes through which peripherals are accessed; recovery after a crash including the re-backup of processes. Since then, the fundamental design involving three-way message transmission has remained unchanged. However, server fault tolerance has been redesigned and is now more consistent with the fault tolerance of normal user processes. Recovery and re-backup have been completed in a more efficient manner than previously envisioned. In addition, important changes in the implementation have occurred. In this paper, we review the original design, borrowing heavily from the earlier paper in sections 1-3, and explain additions and modifications in later sections.
引用
收藏
页码:224 / 243
页数:20
相关论文
共 50 条
  • [1] A distributed fault-tolerance mechanism in UNIX
    Gantenbein, RE
    Yu, ZJ
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 1996, : 146 - 149
  • [2] UNIX NETWORKS AND FAULT TOLERANCE
    HOGAN, J
    [J]. INTECH, 1992, 39 (11) : 19 - 20
  • [3] FAULT TOLERANCE UNDER UNIX
    BORG, A
    BLAU, W
    GRAETSCH, W
    HERRMANN, F
    OBERLE, W
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1989, 7 (01): : 1 - 24
  • [4] FAULT TOLERANCE IN DISTRIBUTED SYSTEMS
    SCHMITTER, E
    [J]. SIEMENS FORSCHUNGS-UND ENTWICKLUNGSBERICHTE-SIEMENS RESEARCH AND DEVELOPMENT REPORTS, 1983, 12 (01): : 34 - 37
  • [5] Fault Tolerance in Distributed Paradigms
    Haider, Sajjad
    Ansari, Naveed Riaz
    Akbar, Muhammad
    Perwez, Mohammad Raza
    Ghori, Khawaja MoyeezUllah
    [J]. COMPUTER COMMUNICATION AND MANAGEMENT, 2011, 5 : 587 - 592
  • [6] Incorporating fault tolerance in distributed applications
    Ouyang, J
    Maheshwari, P
    [J]. PROCEEDINGS OF THE 21ST AUSTRALASIAN COMPUTER SCIENCE CONFERENCE, ACSC'98, 1998, 20 (01): : 121 - 132
  • [7] THE MAFT ARCHITECTURE FOR DISTRIBUTED FAULT TOLERANCE
    KIECKHAFER, RM
    WALTER, CJ
    FINN, AM
    THAMBIDURAI, PM
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (04) : 398 - 405
  • [8] Fault Tolerance in Heterogeneous Distributed Systems
    Wang, Zhe
    Minsky, Naftaly H.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING (COLLABORATECOM), 2014, : 539 - 545
  • [9] SYNCHRONIZATION AND FAULT TOLERANCE IN A DISTRIBUTED TRACKER
    LEIGHTON, DA
    HANSEN, BK
    [J]. SIGNAL AND DATA PROCESSING OF SMALL TARGETS 1989, 1989, 1096 : 224 - 230
  • [10] An architecture for rapid distributed fault tolerance
    Russ, SH
    [J]. PARALLEL AND DISTRIBUTED PROCESSING, 1998, 1388 : 925 - 930