Measurement-based analysis of system dependability using fault injection and field failure data

被引:0
|
作者
Iyer, RK [1 ]
Kalbarczyk, Z [1 ]
机构
[1] Univ Illinois, Ctr Reliable & High Performance Comp, Urbana, IL 61801 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The discussion in this paper focuses on the issues involved in analyzing the availability of networked systems using fault injection and the failure data collected by the logging mechanisms built into the system. In particular we address: (1) analysis in the prototype phase using physical fault injection to an actual system. We use example of fault injection-based evaluation of a software-implemented fault tolerance (SIFT) environment (built around a set of self-checking processes called ARMORS) that provides error detection and recovery services to spaceborne scientific applications and (2) measurement-based analysis of systems in the field. We use example of LAN of Windows NT based computers to present methods for collecting and analyzing failure data to characterize network system dependability. Both, fault injection and failure data analysis enable us to study naturally occurring errors and to provide feedback to system designers on potential availability bottlenecks. For example, the study of failures in a network of Windows NT machines reveals that most of the problems that lead to reboots are software related and that though the average availability evaluates to over 99%, a typical machine, on average, provides acceptable service only about 92% of the time.
引用
收藏
页码:290 / 317
页数:28
相关论文
共 50 条
  • [1] Measurement-based analysis: A key to experimental research in dependability
    Kalbarczyk, Zbigniew
    [J]. EDCC 2006: Sixth European Dependable Computing Conference, Proceedings, 2006, : 69 - 70
  • [2] Role of fault injection techniques in system dependability analysis
    Benso, A
    Corno, F
    Prinetto, P
    Rebaudengo, M
    Reorda, MS
    [J]. AEI AUTOMAZIONE ENERGIA INFORMAZIONE, 1996, 83 (10): : 63 - 69
  • [3] Dependability analysis using a fault injection tool based on synthesizability of HDL models
    Zarandi, HR
    Miremadi, SG
    Ejlali, A
    [J]. 18TH IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 2003, : 485 - 492
  • [4] Fault injection stress strategies in dependability analysis
    Sosnowski, J
    Gawkowski, P
    Lesiak, A
    [J]. CONTROL AND CYBERNETICS, 2004, 33 (04): : 679 - 699
  • [5] MEASUREMENT-BASED EVALUATION OF OPERATING SYSTEM FAULT-TOLERANCE
    LEE, I
    TANG, D
    IYER, RK
    HSUEH, MC
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 1993, 42 (02) : 238 - 249
  • [6] Wide-area measurement-based fault tolerant control of power system during sensor failure
    Khosravani, Saeid
    Moghaddam, Iman Naziri
    Afshar, Ahmad
    Karrari, Mahdi
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2016, 137 : 66 - 75
  • [7] Measurement-based Analysis of Fault and Error Sensitivities of Dynamic Memory
    Yim, Keun Soo
    Kalbarczyk, Zbigniew
    Iyer, Ravishankar K.
    [J]. 2010 IEEE-IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS DSN, 2010, : 431 - 436
  • [8] Measurement-based analysis of networked system availability
    Iyer, RK
    Kalbarczyk, Z
    Kalyanakrishnan, M
    [J]. PERFORMANCE EVALUATION: ORIGINS AND DIRECTIONS, 2000, 1769 : 161 - 199
  • [9] A Novel Simulation Fault Injection Method for Dependability Analysis
    Lee, Dongwoo
    Na, Jongwhoa
    [J]. IEEE DESIGN & TEST OF COMPUTERS, 2009, 26 (06): : 50 - 60
  • [10] Dependability analysis of a high-speed network using software-implemented fault injection and simulated fault injection
    Stott, DT
    Ries, G
    Hsueh, MC
    Iyer, RK
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1998, 47 (01) : 108 - 119