A PARALLEL PROBABILISTIC SYSTEM-LEVEL FAULT DIAGNOSIS APPROACH FOR LARGE MULTIPROCESSOR SYSTEMS

被引:9
|
作者
Elhadef, Mourad [1 ]
Abrougui, Kaouther [1 ]
Das, Shantanu [1 ]
Nayak, Amiya [1 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON, Canada
关键词
Fault tolerance; System-level diagnosis; Multiprocessor systems; Probabilistic diagnosis; Parallel genetic algorithms; Parallel virtual machine (PVM);
D O I
10.1142/S0129626406002472
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we present a system-level fault identification algorithm, using a parallel genetic algorithm, for diagnosing faulty nodes in large heterogeneous systems. The algorithm is based on a probabilistic model where individual node fails with an a priori probability p. The assumptions concerning test outcomes are the same as in the PMC model, that is, fault-free testers always give correct test outcomes and faulty testers are totally unpredictable. The parallel diagnosis algorithm was implemented and simulated on randomly generated large systems. The proposed parallelization is intended to speed up the performance of the evolutionary diagnosis approach, hence reducing the computation time by evolving various sub-populations in parallel. Simulation results are provided showing that the parallel diagnosis did improve the efficiency of the evolutionary diagnosis approach, in that it allowed faster diagnosis of faulty situations, making it a viable alternative to existing techniques of diagnosis. Moreover, the evolutionary approach still provide good results even when extreme non-diagnosable faulty situations are considered.
引用
收藏
页码:63 / 79
页数:17
相关论文
共 50 条
  • [1] An Evolutionary Approach to System-Level Fault Diagnosis
    Yang, Hui
    Elhadef, Mourad
    Nayak, Amiya
    Yang, Xiaofan
    [J]. 2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 1406 - +
  • [2] Probabilistic cluster fault diagnosis for multiprocessor systems
    Niu, Baohua
    Zhou, Shuming
    Zhang, Hong
    Zhang, Qifan
    [J]. THEORETICAL COMPUTER SCIENCE, 2024, 1020
  • [3] On the system-level fault diagnosis
    Xuan, H.N.
    [J]. 2001, Shanghai Computer Society (27):
  • [4] Probabilistic Fault Diagnosis of Clustered Faults for Multiprocessor Systems
    Sun, Xue-Li
    Fan, Jian-Xi
    Cheng, Bao-Lei
    Wang, Yan
    Zhang, Li
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (04) : 821 - 833
  • [5] Probabilistic Fault Diagnosis of Clustered Faults for Multiprocessor Systems
    Xue-Li Sun
    Jian-Xi Fan
    Bao-Lei Cheng
    Yan Wang
    Li Zhang
    [J]. Journal of Computer Science and Technology, 2023, 38 : 821 - 833
  • [6] System-level operational diagnosability analysis in quasi real-time fault diagnosis: The probabilistic approach
    Cui, Yiqian
    Shi, Junyou
    Wang, Zili
    [J]. JOURNAL OF PROCESS CONTROL, 2014, 24 (09) : 1444 - 1453
  • [7] A fault-injection methodology for the system-level dependability analysis of multiprocessor embedded systems
    Miele, Antonio
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2014, 38 (06) : 567 - 580
  • [8] SYSTEM-LEVEL FAULT-DIAGNOSIS
    FRIEDMAN, AD
    SIMONCINI, L
    [J]. COMPUTER, 1980, 13 (03) : 47 - 53
  • [9] Probabilistic diagnosis of large systems using a parallel genetic approach
    Elhadef, M
    Abrougui, K
    Das, S
    Nayak, A
    [J]. PDPTA '05: Proceedings of the 2005 International Conference on Parallel and Distributed Processing Techniques and Applications, Vols 1-3, 2005, : 1010 - 1016
  • [10] Probabilistic system-level fault diagnostic algorithms for multiprocessors
    Bartha, T
    Selenyi, E
    [J]. PARALLEL COMPUTING, 1997, 22 (13) : 1807 - 1821