A Scalability Hierarchical Fault Tolerance Strategy: Community Fault Tolerance

被引:0
|
作者
Chen, Jianping [2 ]
Lu, Yao [1 ]
Comsa, Ioan [1 ]
Kuonen, Pierre [1 ]
机构
[1] Univ Appl Sci Western Switzerland, Inst Complex Syst, CH-1705 Fribourg, Switzerland
[2] Univ Neuchatel, Fac Sci, Inst Informat, CH-2000 Neuchatel, Switzerland
关键词
distributed system; scalability; dynamic programming; hierarchical fault tolerance;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of hierarchical fault tolerance strategies did not pay much attention to scalability of fault tolerance. In distributed system, scalability is a very important feature. To tolerant failures when the scale of the system changing is a normal and important scenario. Especially in nowadays, almost all the cloud computing companies provide their computing services elastically. To add extra devices or remove devices in order to provide different services happens all the time. In such a scenario, it is very important that the fault tolerance strategy is scalable. In this paper, we introduce dynamic programming thoughts to build hierarchical regions as communities for fault tolerance strategy and apply different strategies based on communities instead of a single process. We call this fault tolerance strategy as Community Fault Tolerance. It cannot only reduce the memory overload by eliminating the number of records of messages inside the community region, but also provides a good characteristic of scalability. The scalability property of our strategy makes it handle with the scenario of adding devices or removing devices in the distributed system easily.
引用
收藏
页码:212 / +
页数:3
相关论文
共 50 条
  • [31] Strategy for soft fault diagnosis on analog circuits with tolerance
    Dong Haidi
    Liu Gang
    Wang Junti
    Pan Dianheng
    Xie Hui
    [J]. PROCEEDINGS OF 2017 13TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS (ICEMI), VOL 1, 2017, : 331 - 335
  • [32] Factorizing fault tolerance
    Prasetya, ISWB
    Swierstra, SD
    [J]. THEORETICAL COMPUTER SCIENCE, 2003, 290 (02) : 1201 - 1222
  • [33] Fault Tolerance as a Service
    Nandi, Bipin B.
    Paul, Himadri Sekhar
    Banerjee, Ansuman
    Ghosh, Sasthi C.
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 446 - 453
  • [34] FPGAs and fault tolerance
    Doumar, A
    Ito, H
    [J]. ICM 2001: 13TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS, PROCEEDINGS, 2001, : 222 - 225
  • [35] Fault tolerance in the brain
    Yu, Byron M.
    [J]. NATURE, 2016, 532 (7600) : 449 - 450
  • [36] Principles of fault tolerance
    White, RV
    Miles, FM
    [J]. APEC '96 - ELEVENTH ANNUAL APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITIONS, VOLS 1 & 2, CONFERENCE PROCEEDINGS, 1996, : 18 - 25
  • [37] FAULT TOLERANCE AND TESTING
    DISTANTE, F
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1990, 30 (1-5): : 507 - 507
  • [38] Efficient fault tolerance
    Daniel Gottesman
    [J]. Nature, 2016, 540 : 44 - 45
  • [39] DESIGN FAULT TOLERANCE
    KNIGHT, JC
    AMMANN, PE
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 1991, 32 (1-2) : 25 - 49
  • [40] FAULT TOLERANCE LIVES
    STEVENS, L
    [J]. COMPUTER DECISIONS, 1987, 19 (03): : 60 - 62