Dependability analysis of a high-speed network using software-implemented fault injection and simulated fault injection

被引:16
|
作者
Stott, DT
Ries, G
Hsueh, MC
Iyer, RK
机构
[1] Univ Illinois, Ctr Reliable & High Performance Comp, Coordinated Sci Lab, Urbana, IL 61801 USA
[2] Chromat Res Inc, Sunnyvale, CA 94089 USA
[3] Digital Equipment Corp, Marlborough, MA 01752 USA
关键词
dependability; fault simulation; Myrinet; SWIFI fault effect; embedded system;
D O I
10.1109/12.656094
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a dependability study of high-speed, switched Local Area Networks (LANs) using Myrinet as an example testbed (with theoretical speeds of 2.56 Gbps). The study uses results of two fault injection methods, simulated fault injection and software-implemented fault injection (SWIFI), to analyze the application-level impact of transient faults injected into the network interface hardware. These results include a number of errors, such as dropped or corrupt messages, host interface or host resets, and local or remote host interface hangs. The paper presents the study in two parts: First, the results from the SWIFI method in the real system are used as a basis to validate the simulation and identify the major factors leading to differences between the methods. A comparison between the two injection methods shows that they agree for 83 percent of the fault injections. The results, however, vary greatly, depending on the fault type considered, The study also presents an analysis of the effects of varying workload intensity, host platform, and interface function targeted by the injection. An example of this analysis is to show that the function targeted has a significant impact on the fault activation rate. Finally, the study identifies two mechanisms by which faults may propagate from the interface to other parts of the network; in one example, this propagation caused the interface's host computer to reboot, while another caused a remote interface in the network to hang.
引用
收藏
页码:108 / 119
页数:12
相关论文
共 50 条
  • [31] Reduced instrumentation and optimized fault injection control for dependability analysis
    Vanhauwaert, P.
    Leveugle, R.
    Roche, P.
    IFIP VLSI-SOC 2006: IFIP WG 10.5 INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION & SYSTEM-ON-CHIP, 2006, : 391 - +
  • [32] An Evaluation Method for Embedded Software Dependability Using QEMU-Based Fault Injection Framework
    Metawie, Haytham
    Safar, Mona
    El-Kharashi, M. Watheq
    2022 6TH INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SAFETY, ICSRS, 2022, : 548 - 555
  • [33] Using simulated fault injection for fault tolerance assessment of quantum circuits
    Boncalo, Oana
    Udrescu, Mihai
    Prodan, Lucian
    Vladutiu, Mircea
    Amaricai, Alexandru
    40TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2007, : 213 - +
  • [34] Deterministic High-Speed Simulation of Complex Systems Including Fault-Injection
    Sand, Matthias
    Potyra, Stefan
    Sieh, Volkmar
    2009 IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS & NETWORKS (DSN 2009), 2009, : 211 - 216
  • [35] Dependability analysis of a commercial high-speed network
    Stott, DT
    Hsueh, MC
    Ries, GL
    Iyer, RK
    TWENTY-SEVENTH ANNUAL INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING, DIGEST OF PAPERS, 1997, : 248 - 257
  • [36] Simulation-based Fault Injection with QEMU for Speeding-up Dependability Analysis of Embedded Software
    Ferraretto, Davide
    Pravadelli, Graziano
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2016, 32 (01): : 43 - 57
  • [37] Dependability assessment of by-wire control systems using fault injection
    Blanc, S.
    Bonastre, A.
    Gil, P. J.
    JOURNAL OF SYSTEMS ARCHITECTURE, 2009, 55 (02) : 102 - 113
  • [38] Simulation-based Fault Injection with QEMU for Speeding-up Dependability Analysis of Embedded Software
    Davide Ferraretto
    Graziano Pravadelli
    Journal of Electronic Testing, 2016, 32 : 43 - 57
  • [39] Dependability Analysis on OpenStack IaaS Cloud: Bug Anaysis and Fault Injection
    Yuan Xiaoyong
    Li Ying
    Wu Zhonghai
    Liu Tiancheng
    2014 IEEE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2014, : 18 - 25
  • [40] Simulated Fault Injection Using Simulator Modification Technique
    Na, Jongwhoa
    Lee, Dongwoo
    ETRI JOURNAL, 2011, 33 (01) : 50 - 59