A flexible clustered approach to high availability

被引:0
|
作者
HughesFenchel, G
机构
来源
TWENTY-SEVENTH ANNUAL INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING, DIGEST OF PAPERS | 1997年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Reliable Clustered Computing project created a system which enables applications to improve the reliability of off the shelf computers from a typical 99% (about 90 hours of downtime per year) to 99.99% (under one hour of downtime per year) in a cost-effective manner. The chief constraints were the need to achieve high reliability while minimizing cost and maintaining vendor independence. This was realized by creating a vendor independent clustered configuration comprised of two or more computers capable of recovering from hardware or software errors by restarting one or more processes on the current machine or by failing over one or more processes to another machine. Only two inexpensive custom hardware components were required for this solution: a WatchDog, to monitor component status, and a PowerDog, to control electrical power to processing elements (and optional peripherals). The bulk of the functionality was provided by software.
引用
收藏
页码:314 / 318
页数:5
相关论文
共 50 条
  • [1] High availability in clustered multimedia servers
    Tewari, R
    Dias, DM
    Mukherjee, R
    Vin, HM
    PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, : 645 - 654
  • [2] HAV: Providing high availability for clustered systems
    King, R
    Leff, A
    Dias, DM
    Mukherjee, R
    INTERNATIONAL SOCIETY FOR COMPUTERS AND THEIR APPLICATIONS 10TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 1997, : 51 - 58
  • [3] HIGH AVAILABILITY THROUGH FLEXIBLE MANUFACTURING SYSTEMS
    MASCHKE, H
    WERKSTATTSTECHNIK ZEITSCHRIFT FUR INDUSTRIELLE FERTIGUNG, 1988, 78 (02): : 109 - 113
  • [4] A flexible software architecture for high availability computing
    Iyer, RK
    Kalbarczyk, Z
    Whisnant, K
    Bagchi, S
    THIRD IEEE INTERNATIONAL HIGH-ASSURANCE SYSTEMS ENGINEERING SYMPOSIUM, PROCEEDINGS, 1998, : 42 - 49
  • [5] A flexible approach for causal inference with multiple treatments and clustered survival outcomes
    Hu, Liangyuan
    Ji, Jiayi
    Ennis, Ronald D.
    Hogan, Joseph W.
    STATISTICS IN MEDICINE, 2022, 41 (25) : 4982 - 4999
  • [6] A novel clustered MongoDB-based storage system for unstructured data with high availability
    Wenbin Jiang
    Lei Zhang
    Xiaofei Liao
    Hai Jin
    Yaqiong Peng
    Computing, 2014, 96 : 455 - 478
  • [7] A novel clustered MongoDB-based storage system for unstructured data with high availability
    Jiang, Wenbin
    Zhang, Lei
    Liao, Xiaofei
    Jin, Hai
    Peng, Yaqiong
    COMPUTING, 2014, 96 (06) : 455 - 478
  • [8] A generic availability model for clustered computing systems
    Sun, HR
    Han, JJ
    Levendel, H
    2001 PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2001, : 241 - 248
  • [9] FLEXIBILITY AND AVAILABILITY OF FLEXIBLE MANUFACTURING SYSTEMS - AN INFORMATION-THEORY APPROACH
    GUPTA, YP
    GUPTA, MC
    COMPUTERS IN INDUSTRY, 1991, 17 (04) : 391 - 406
  • [10] The transis approach to high availability cluster communication
    Dolev, D
    Malki, D
    COMMUNICATIONS OF THE ACM, 1996, 39 (04) : 64 - 70