A Scalability Hierarchical Fault Tolerance Strategy: Community Fault Tolerance

被引:0
|
作者
Chen, Jianping [2 ]
Lu, Yao [1 ]
Comsa, Ioan [1 ]
Kuonen, Pierre [1 ]
机构
[1] Univ Appl Sci Western Switzerland, Inst Complex Syst, CH-1705 Fribourg, Switzerland
[2] Univ Neuchatel, Fac Sci, Inst Informat, CH-2000 Neuchatel, Switzerland
关键词
distributed system; scalability; dynamic programming; hierarchical fault tolerance;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of hierarchical fault tolerance strategies did not pay much attention to scalability of fault tolerance. In distributed system, scalability is a very important feature. To tolerant failures when the scale of the system changing is a normal and important scenario. Especially in nowadays, almost all the cloud computing companies provide their computing services elastically. To add extra devices or remove devices in order to provide different services happens all the time. In such a scenario, it is very important that the fault tolerance strategy is scalable. In this paper, we introduce dynamic programming thoughts to build hierarchical regions as communities for fault tolerance strategy and apply different strategies based on communities instead of a single process. We call this fault tolerance strategy as Community Fault Tolerance. It cannot only reduce the memory overload by eliminating the number of records of messages inside the community region, but also provides a good characteristic of scalability. The scalability property of our strategy makes it handle with the scenario of adding devices or removing devices in the distributed system easily.
引用
收藏
页码:212 / +
页数:3
相关论文
共 50 条
  • [1] On Fault Tolerance and Scalability of Swarm Robotic Systems
    Bjerknes, Jan Dyre
    Winfield, Alan F. T.
    [J]. DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS, 2013, 83 : 431 - 444
  • [2] Hierarchical fault tolerance for nanoscale memories
    Jeffery, Casey A.
    Figueiredo, Renato J. O.
    [J]. IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2006, 5 (04) : 407 - 414
  • [3] On conditional fault tolerance of hierarchical cubic networks
    Li, Xiang-Jun
    Liu, Min
    Yan, Zheng
    Xu, Jun-Ming
    [J]. THEORETICAL COMPUTER SCIENCE, 2019, 761 : 1 - 6
  • [4] Fault Tolerance Management for a Hierarchical GridRPC Middleware
    Bouteiller, Aurelien
    Desprez, Frederic
    [J]. CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 484 - 491
  • [5] Fault tolerance analysis of hierarchical folded cube
    Sun, Xueli
    Dong, Qingfeng
    Zhou, Shuming
    Lv, Mengjie
    Lian, Guanqin
    Liu, Jiafei
    [J]. THEORETICAL COMPUTER SCIENCE, 2019, 790 : 117 - 130
  • [6] Fault tolerance of hierarchical cubic networks based on cluster fault pattern
    Lv, Mengjie
    Fan, Weibei
    Dong, Hui
    Wang, Guijuan
    [J]. COMPUTER JOURNAL, 2024,
  • [7] Balancing Scalability, Performance and Fault Tolerance for Structured Data (BSPF)
    Khalid, Amna
    Afzal, Hammad
    Aftab, Shoohira
    [J]. 2014 16TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2014, : 725 - 732
  • [8] A Verification Strategy for Fault-Detection and Fault-Tolerance Circuits
    Boschi, Gabriele
    Mariani, Riccardo
    Lorenzini, Stefano
    [J]. 2011 IEEE 17TH INTERNATIONAL ON-LINE TESTING SYMPOSIUM (IOLTS), 2011,
  • [9] FAULT TOLERANCE
    不详
    [J]. COMPUTER DECISIONS, 1984, 16 (14): : 34 - 34
  • [10] FAULT TOLERANCE
    DALCIN, M
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1989, 27 (1-5): : 695 - 695