A practical cross-datacenter fault-tolerance algorithm in the cloud storage system

被引:1
|
作者
Yuxia Cheng
Xinjie Yu
Wenzhi Chen
Rui Chang
Yang Xiang
机构
[1] Deakin University,
[2] Zhejiang University,undefined
[3] State Key Laboratory of Mathematical Engineering and Advanced Computing,undefined
来源
Cluster Computing | 2017年 / 20卷
关键词
Fault-tolerance; Cloud storage; Cross-datacenter; Distributed system;
D O I
暂无
中图分类号
学科分类号
摘要
The fault-tolerance property in most cloud storage systems are designed within the scale of a single datacenter. The single datacenter as a whole may be unreachable or crashed due to severe problems, such as broken network links, power supply interruptions, and natural disasters, etc. Therefore, the design of an effective cross-datacenter fault-tolerant storage system is important to protect data security in the cloud. However, building a cross-datacenter fault-tolerant system faces great challenges, such as high latency, low throughput, high costs of bandwidth resources between datacenters. In this paper, we propose a practical cross-datacenter fault-tolerant (CDFT) algorithm in the cloud storage system. Our fault-tolerant algorithm design considers the difficult tradeoffs among fault tolerance, latency, throughput, network and storage costs. We propose the Domain Fault Codes (DFC) and the topology-aware scheduling techniques, which can tolerate the whole datacenter breakdown. We implemented the DFC-CDFT algorithm in a prototype cloud storage system. The experimental results showed that the proposed DFC-CDFT algorithm can effectively recover data blocks from the single datacenter failure while achieves low storage and bandwidth costs.
引用
收藏
页码:1801 / 1813
页数:12
相关论文
共 50 条
  • [41] Adding fault-tolerance to a hierarchical DRE system
    Rubel, Paul
    Loyall, Joseph
    Schantz, Richard
    Gillen, Matthew
    DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, PROCEEDINGS, 2006, 4025 : 303 - 308
  • [42] REPLICATION AND FAULT-TOLERANCE IN THE ISIS SYSTEM.
    Birman, Kenneth P.
    Operating Systems Review (ACM), 1985, 19 (05): : 79 - 86
  • [43] Flexible Supervision System: A Fast Fault-Tolerance Strategy for Cloud Applications in Cloud-Edge Collaborative Environments
    Cai, Weilin
    Chen, Heng
    Zhuo, Zhimin
    Wang, Ziheng
    An, Ninggang
    NETWORK AND PARALLEL COMPUTING, NPC 2022, 2022, 13615 : 108 - 113
  • [44] Proactive Fault-Tolerance Technique to Enhance Reliability of Cloud Service in Cloud Federation Environment
    Ray, Benay Kumar
    Saha, Avirup
    Khatua, Sunirmal
    Roy, Sarbani
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (02) : 957 - 971
  • [45] Using Proactive Fault-Tolerance Approach to Enhance Cloud Service Reliability
    Liu, Jialei
    Wang, Shangguang
    Zhou, Ao
    Kumar, Sathish A. P.
    Yang, Fangchun
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (04) : 1191 - 1202
  • [46] Efficacy and Efficiency of Algorithm-Based Fault-Tolerance on GPUs
    Wunderlich, Hans-Joachim
    Braun, Claus
    Raider, Sebastian
    PROCEEDINGS OF THE 2013 IEEE 19TH INTERNATIONAL ON-LINE TESTING SYMPOSIUM (IOLTS), 2013, : 240 - 243
  • [47] CONSTRUCTION OF CHECK SETS FOR ALGORITHM-BASED FAULT-TOLERANCE
    GU, DC
    ROSENKRANTZ, DJ
    RAVI, SS
    IEEE TRANSACTIONS ON COMPUTERS, 1994, 43 (06) : 641 - 650
  • [48] Dynamic Fault-tolerance and Mobility Provisioning for Services on Mobile Cloud Platforms
    Stahl, Philip
    Broberg, Jonatan
    Landfeldt, Bjorn
    2017 5TH IEEE INTERNATIONAL CONFERENCE ON MOBILE CLOUD COMPUTING, SERVICES, AND ENGINEERING (MOBILECLOUD), 2017, : 131 - 138
  • [49] Non-intrusive system level fault-tolerance
    Lundqvist, K
    Srinivasan, J
    Gorelov, S
    RELIABLE SOFTWARE TECHNOLOGY ADA-EUROPE 2005, PROCEEDINGS, 2005, 3555 : 156 - 166
  • [50] Service Based Software Fault-Tolerance for Manufacturing System
    Jeong, HwaYoung
    Hong, BongHwa
    COMPUTER APPLICATIONS FOR SOFTWARE ENGINEERING, DISASTER RECOVERY, AND BUSINESS CONTINUITY, 2012, 340 : 171 - +