A practical cross-datacenter fault-tolerance algorithm in the cloud storage system

被引:1
|
作者
Yuxia Cheng
Xinjie Yu
Wenzhi Chen
Rui Chang
Yang Xiang
机构
[1] Deakin University,
[2] Zhejiang University,undefined
[3] State Key Laboratory of Mathematical Engineering and Advanced Computing,undefined
来源
Cluster Computing | 2017年 / 20卷
关键词
Fault-tolerance; Cloud storage; Cross-datacenter; Distributed system;
D O I
暂无
中图分类号
学科分类号
摘要
The fault-tolerance property in most cloud storage systems are designed within the scale of a single datacenter. The single datacenter as a whole may be unreachable or crashed due to severe problems, such as broken network links, power supply interruptions, and natural disasters, etc. Therefore, the design of an effective cross-datacenter fault-tolerant storage system is important to protect data security in the cloud. However, building a cross-datacenter fault-tolerant system faces great challenges, such as high latency, low throughput, high costs of bandwidth resources between datacenters. In this paper, we propose a practical cross-datacenter fault-tolerant (CDFT) algorithm in the cloud storage system. Our fault-tolerant algorithm design considers the difficult tradeoffs among fault tolerance, latency, throughput, network and storage costs. We propose the Domain Fault Codes (DFC) and the topology-aware scheduling techniques, which can tolerate the whole datacenter breakdown. We implemented the DFC-CDFT algorithm in a prototype cloud storage system. The experimental results showed that the proposed DFC-CDFT algorithm can effectively recover data blocks from the single datacenter failure while achieves low storage and bandwidth costs.
引用
收藏
页码:1801 / 1813
页数:12
相关论文
共 50 条
  • [31] IMPROVED BOUNDS FOR ALGORITHM-BASED FAULT-TOLERANCE
    ROSENKRANTZ, DJ
    RAVI, SS
    IEEE TRANSACTIONS ON COMPUTERS, 1993, 42 (05) : 630 - 635
  • [32] Extendibility, scalability and fault-tolerance methods for cloud robots especially for cloud nanorobots
    Zhu, Dingju
    Journal of Computational and Theoretical Nanoscience, 2015, 12 (12) : 6208 - 6219
  • [33] Multiple Fault-tolerance Mechanisms in Cloud Systems: a Systematic Review
    Marcotte, Philippe
    Gregoire, Frederic
    Petrillo, Fabio
    2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2019), 2019, : 414 - 421
  • [34] Adaptive Application Scaling for Improving Fault-Tolerance and Availability in the Cloud
    Radhakrishnan, Ganesan
    BELL LABS TECHNICAL JOURNAL, 2012, 17 (02) : 5 - 14
  • [35] A comparative cost analysis of fault-tolerance mechanisms for availability on the cloud
    Sampaio, Altino M.
    Barbosa, Jorge G.
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2018, 19 : 315 - 323
  • [36] Multilevel Fault-Tolerance Aware Scheduling Technique in Cloud Environment
    Devi, K.
    Paulraj, D.
    JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (01): : 109 - 119
  • [37] Fault-Tolerance Analysis Algorithm for SpaceWire Onboard Networks
    Lavrovskaya, Irina
    Olenev, Valentin
    Korobkov, Ilya
    PROCEEDINGS OF THE 2017 21ST CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2017, : 217 - 223
  • [38] A new algorithm for increasing fault-tolerance of distributed systems
    Dishabi, Mohammad Reza Ebrahimi
    Sharifi, Mohsen
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS, 2007, : 96 - +
  • [39] ALGORITHM-BASED FAULT-TOLERANCE FOR FFT NETWORKS
    WANG, SJ
    JHA, NK
    IEEE TRANSACTIONS ON COMPUTERS, 1994, 43 (07) : 849 - 854
  • [40] Hardware fault-tolerance within the POEtic system
    Barker, W
    Tyrrell, AM
    EVOLVABLE SYSTEMS: FROM BIOLOGY TO HARDWARE, 2005, 3637 : 25 - 36