A new fault-tolerance framework for grid computing

被引:7
|
作者
Derbal, Youcef [1 ]
机构
[1] Ryerson Univ, Sch Informat Technol Management, 350 Victoria St, Toronto, ON M5B 2K3, Canada
关键词
Computational grid; fault-tolerance; fault detector; reliability; service request;
D O I
10.3233/MGS-2006-2203
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fault detection and propagation in a computational grid requires a comprehensive framework that takes in consideration the various grid environmental conditions such as the asynchronous nature of communication and the uncertainty on the disseminated fault information. The paper presents a fault-tolerance framework that provides the necessary models to manage the local faulty behavior associated with the operation of hosted services. The framework includes a quantification mechanism of the fault vulnerability of grid nodes and their hosted services. The resulting measures of fault vulnerability are globally disseminated to enable the synthesis of decentralized fault-tolerant decision making strategies.
引用
收藏
页码:115 / 133
页数:19
相关论文
共 50 条
  • [41] LQCD Workflow Execution Framework: Models, Provenance and Fault-Tolerance
    Piccoli, Luciano
    Dubey, Abhishek
    Simone, James N.
    Kowalkowlski, James B.
    [J]. 17TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP09), 2010, 219
  • [42] Condition Monitoring and Fault-Tolerance Agents for Grid-Tied Inverters
    Mirafzal, Behrooz
    Das, Sanjoy
    [J]. 2012 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING, 2012,
  • [43] ON FAULT-TOLERANCE AND FAULT-AVOIDANCE
    REGULINSKI, TLD
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 1987, 36 (02) : 161 - 161
  • [44] Computation-Oriented Fault-Tolerance Schemes for RRAM Computing Systems
    Huangfu, Wenqin
    Xia, Lixue
    Cheng, Ming
    Yin, Xiling
    Tang, Tianqi
    Li, Boxun
    Chakrabarty, Krishnendu
    Xie, Yuan
    Wang, Yu
    Yang, Huazhong
    [J]. 2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, : 794 - 799
  • [45] A hybrid approach for fault-tolerance aware load balancing in fog computing
    Kashyap, Vijaita
    Ahuja, Rakesh
    Kumar, Ashok
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (04): : 5217 - 5233
  • [46] Fault-Tolerance of Binarized and Stochastic Computing-based Neural Networks
    Ardakani, Amir
    Ardakani, Arash
    Gross, Warren J.
    [J]. 2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021), 2021, : 52 - 57
  • [47] Photonic Quantum Computing: Shor's Algorithm and the Road to Fault-Tolerance
    Lanyon, B. P.
    Weinhold, T. J.
    Langford, N. K.
    Barbieri, M.
    de Almeida, M. P.
    Gilchrist, A.
    James, D. F. V.
    White, A. G.
    [J]. 2008 CONFERENCE ON LASERS AND ELECTRO-OPTICS & QUANTUM ELECTRONICS AND LASER SCIENCE CONFERENCE, VOLS 1-9, 2008, : 3167 - +
  • [48] A new approach for mobile agent fault-tolerance and reliability
    Mohammadi, K.
    Hamidi, H.
    [J]. 2005 1ST IEEE/IFIP INTERNATIONAL CONFERENCE IN CENTRAL ASIA ON INTERNET (ICI), 2005, : 164 - 168
  • [49] Fault-tolerance in sensor networks: A new evaluation metric
    Sen, Arunabha
    Shen, Bao Hong
    Zhou, Ling
    Hao, Bin
    [J]. 25TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-7, PROCEEDINGS IEEE INFOCOM 2006, 2006, : 2193 - 2204
  • [50] A new algorithm for increasing fault-tolerance of distributed systems
    Dishabi, Mohammad Reza Ebrahimi
    Sharifi, Mohsen
    [J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS, 2007, : 96 - +