Dependability analysis of a high-speed network using software-implemented fault injection and simulated fault injection

被引:16
|
作者
Stott, DT
Ries, G
Hsueh, MC
Iyer, RK
机构
[1] Univ Illinois, Ctr Reliable & High Performance Comp, Coordinated Sci Lab, Urbana, IL 61801 USA
[2] Chromat Res Inc, Sunnyvale, CA 94089 USA
[3] Digital Equipment Corp, Marlborough, MA 01752 USA
关键词
dependability; fault simulation; Myrinet; SWIFI fault effect; embedded system;
D O I
10.1109/12.656094
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a dependability study of high-speed, switched Local Area Networks (LANs) using Myrinet as an example testbed (with theoretical speeds of 2.56 Gbps). The study uses results of two fault injection methods, simulated fault injection and software-implemented fault injection (SWIFI), to analyze the application-level impact of transient faults injected into the network interface hardware. These results include a number of errors, such as dropped or corrupt messages, host interface or host resets, and local or remote host interface hangs. The paper presents the study in two parts: First, the results from the SWIFI method in the real system are used as a basis to validate the simulation and identify the major factors leading to differences between the methods. A comparison between the two injection methods shows that they agree for 83 percent of the fault injections. The results, however, vary greatly, depending on the fault type considered, The study also presents an analysis of the effects of varying workload intensity, host platform, and interface function targeted by the injection. An example of this analysis is to show that the function targeted has a significant impact on the fault activation rate. Finally, the study identifies two mechanisms by which faults may propagate from the interface to other parts of the network; in one example, this propagation caused the interface's host computer to reboot, while another caused a remote interface in the network to hang.
引用
收藏
页码:108 / 119
页数:12
相关论文
共 50 条
  • [1] Comparison of physical and software-implemented fault injection techniques
    Arlat, J
    Crouzet, Y
    Karlsson, J
    Folkesson, P
    Fuchs, E
    Leber, GH
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2003, 52 (09) : 1115 - 1133
  • [2] A software-implemented fault injection methodology for design and validation of system fault tolerance
    Some, RR
    Kim, WS
    Khanoyan, G
    Callum, L
    Agrawal, A
    Beahan, JJ
    [J]. INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2001, : 501 - 506
  • [3] Non-intrusive software-implemented fault injection in embedded systems
    Yuste, P
    Ruiz, JC
    Lemus, L
    Gil, P
    [J]. DEPENDABLE COMPUTING, 2003, 2847 : 23 - 38
  • [4] Software-implemented fault injection at machine code level for ARM architecture
    Jin, A
    Jiang, JH
    [J]. ICEMI 2005: Conference Proceedings of the Seventh International Conference on Electronic Measurement & Instruments, Vol 8, 2005, : 173 - 178
  • [5] Control Flow Aware Software-Implemented Fault Injection for Embedded CPUs
    Vankeirsbilck, Jens
    Thati, Venu Babu
    Van Waes, Jonas
    Hallez, Hans
    Boydens, Jeroen
    [J]. 2017 XXVI INTERNATIONAL SCIENTIFIC CONFERENCE ELECTRONICS (ET), 2017,
  • [6] Software-implemented Fault Injection in Operating System Kernel Mutex Data Structure
    Montrucchio, Bartolomeo
    Rebaudengo, Maurizio
    Velasco, Alejandro David
    [J]. 2014 IEEE 5TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS (LASCAS), 2014,
  • [7] FAIL*: An Open and Versatile Fault-Injection Framework for the Assessment of Software-Implemented Hardware Fault Tolerance
    Schirmeier, Horst
    Hoffmann, Martin
    Dietrich, Christian
    Lenz, Michael
    Lohmann, Daniel
    Spinczyk, Olaf
    [J]. 2015 ELEVENTH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC), 2015, : 245 - 255
  • [8] Using software implemented fault inserter in dependability analysis
    Gawkowski, P
    Sosnowski, J
    [J]. 2002 PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2002, : 81 - 88
  • [9] A Tutorial on Hardware-Implemented Fault Injection and Online Fault Diagnosis for High-Speed Trains
    Yang, Xiaoyue
    Qiao, Xinyu
    Cheng, Chao
    Zhong, Kai
    Chen, Hongtian
    [J]. SENSORS, 2021, 21 (17)
  • [10] A methodology for dependability evaluation of the time-triggered architecture using software implemented fault injection
    Ademaj, A
    [J]. DEPENDABLE COMPUTING: EDCC-4, PROCEEDINGS, 2002, 2485 : 172 - 190