A Scalability Hierarchical Fault Tolerance Strategy: Community Fault Tolerance

被引:0
|
作者
Chen, Jianping [2 ]
Lu, Yao [1 ]
Comsa, Ioan [1 ]
Kuonen, Pierre [1 ]
机构
[1] Univ Appl Sci Western Switzerland, Inst Complex Syst, CH-1705 Fribourg, Switzerland
[2] Univ Neuchatel, Fac Sci, Inst Informat, CH-2000 Neuchatel, Switzerland
关键词
distributed system; scalability; dynamic programming; hierarchical fault tolerance;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of hierarchical fault tolerance strategies did not pay much attention to scalability of fault tolerance. In distributed system, scalability is a very important feature. To tolerant failures when the scale of the system changing is a normal and important scenario. Especially in nowadays, almost all the cloud computing companies provide their computing services elastically. To add extra devices or remove devices in order to provide different services happens all the time. In such a scenario, it is very important that the fault tolerance strategy is scalable. In this paper, we introduce dynamic programming thoughts to build hierarchical regions as communities for fault tolerance strategy and apply different strategies based on communities instead of a single process. We call this fault tolerance strategy as Community Fault Tolerance. It cannot only reduce the memory overload by eliminating the number of records of messages inside the community region, but also provides a good characteristic of scalability. The scalability property of our strategy makes it handle with the scenario of adding devices or removing devices in the distributed system easily.
引用
收藏
页码:212 / +
页数:3
相关论文
共 50 条
  • [21] Hyper Star Fault Tolerance of Hierarchical Star Networks
    Yang, Lulu
    Hua, Xiaohui
    [J]. JOURNAL OF INTERCONNECTION NETWORKS, 2023, 23 (04)
  • [22] SOFTWARE FAULT TOLERANCE IN ARCHITECTURES WITH HIERARCHICAL PROTECTION LEVELS
    OZAKI, BM
    FERNANDEZ, EB
    GUDES, E
    [J]. IEEE MICRO, 1988, 8 (04) : 30 - 43
  • [23] MODELING OF HIERARCHICAL DISTRIBUTED SYSTEMS WITH FAULT-TOLERANCE
    SHIEH, YB
    GHOSAL, D
    CHINTAMANENI, PR
    TRIPATHI, SK
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1990, 16 (04) : 444 - 457
  • [24] Scalability and Fault Tolerance of the Alternating Direction Method of Multipliers for Sparse Grids
    Khakhutskyy, Valeriy
    Pflueger, Dirk
    Hegland, Markus
    [J]. PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 603 - 612
  • [25] Aspen Trees: Balancing Data Center Fault Tolerance, Scalability and Cost
    Walraed-Sullivan, Meg
    Vandat, Amin
    Marzullo, Keith
    [J]. PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES (CONEXT '13), 2013, : 85 - 96
  • [26] Database Sharding: To Provide Fault Tolerance and Scalability of Big Data on the Cloud
    Bagui, Sikha
    Nguyen, Loi Tang
    [J]. INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2015, 5 (02) : 36 - 52
  • [27] Optimal Fault Tolerance Strategy Selection for Web Services
    Zheng, Zibin
    Lyu, Michael R.
    [J]. INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2010, 7 (04) : 21 - 40
  • [28] A Replication Strategy for Fault Tolerance in Data Grid Environment
    Li, Jing
    [J]. ACC 2009: ETP/IITA WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING, 2009, : 363 - 366
  • [29] Failover strategy for fault tolerance in cloud computing environment
    Mohammed, Bashir
    Kiran, Mariam
    Maiyama, Kabiru M.
    Kamala, Mumtaz M.
    Awan, Irfan-Ullah
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2017, 47 (09): : 1243 - 1274
  • [30] Multisensor switching control strategy with fault tolerance guarantees
    Seron, Maria M.
    Zhuo, Xiang W.
    De Dona, Jose A.
    Martinez, John J.
    [J]. AUTOMATICA, 2008, 44 (01) : 88 - 97