Geo-aware erasure coding for high-performance erasure-coded storage clusters

被引:3
|
作者
Mohan, Lakshmi J. [1 ]
Caneleo, Pablo Ignacio Serrano [1 ]
Parampalli, Udaya [1 ]
Harwood, Aaron [1 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, Parkville, Vic, Australia
关键词
Distributed storage; Erasure codes; Geographical diversity; XOR based codes; Code aware placement; Hadoop distributed file system; HDFS-RAID;
D O I
10.1007/s12243-017-0623-2
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Erasure code-based distributed storage systems are increasingly being used by storage providers for big data storage since they offer the same reliability as replication with a significant decrease in the amount of storage required. But, when it comes to a storage system with data nodes spread across a very large geographical area, the node's recovery performance is affected by various factors that are both network and computation related. In this paper, we present a XOR-based code supplemented with the ideas of parity duplication and rack awareness that could be adopted in such storage clusters to improve the recovery performance during node failures and compare it with popular implementations of erasure codes, namely Facebook's Reed-Solomon codes and XORBAS local recovery codes. The code performance along with the proposed ideas are evaluated on a geo-diverse cluster deployed on the NeCTAR research cloud. We also present a scheme for intelligently placing blocks of coded storage depending on the design of the code, inspired by local reconstruction codes. The sum of all these propositions could offer a better solution for applications that are deployed on coded storage systems that are geographically distributed, in which storage constraints make triple replication not affordable, at the same time ensuring minimal recovery time is a strict requirement.
引用
收藏
页码:139 / 152
页数:14
相关论文
共 50 条
  • [1] Geo-aware erasure coding for high-performance erasure-coded storage clusters
    Lakshmi J. Mohan
    Pablo Ignacio Serrano Caneleo
    Udaya Parampalli
    Aaron Harwood
    Annals of Telecommunications, 2018, 73 : 139 - 152
  • [2] LaRS: A Load-aware Recovery Scheme for Heterogeneous Erasure-Coded Storage Clusters
    Luo, Haibing
    Huang, Jianzhong
    Cao, Qiang
    Xie, Changsheng
    2014 9TH IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE, AND STORAGE (NAS), 2014, : 168 - 175
  • [3] Network-Aware Feasible Repairs for Erasure-Coded Storage
    Sipos, Marton
    Gahm, Josh
    Venkat, Narayan
    Oran, Dave
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (03) : 1404 - 1417
  • [4] Repair Pipelining for Erasure-Coded Storage
    Li, Runhui
    Li, Xiaolu
    Lee, Patrick P. C.
    Huang, Qun
    2017 USENIX ANNUAL TECHNICAL CONFERENCE (USENIX ATC '17), 2017, : 567 - 579
  • [5] An Erasure-Coded Data Update Method for Distributed Storage Clusters
    Zhang Z.
    Liu D.
    Tan Y.
    Wu Y.
    Luo L.
    Wang W.
    Qiao L.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (11): : 2451 - 2466
  • [6] An Efficient Parallel Coding Scheme in Erasure-Coded Storage Systems
    Dong, Wenrui
    Liu, Guangming
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (03): : 627 - 643
  • [7] Making MapReduce Scheduling Effective in Erasure-Coded Storage Clusters
    Li, Runhui
    Lee, Patrick P. C.
    2015 IEEE 21ST INTERNATIONAL WORKSHOP ON LOCAL & METROPOLITAN AREA NETWORKS (LANMAN), 2015,
  • [8] Optimistic Erasure-Coded Distributed Storage
    Dutta, Partha
    Guerraoui, Rachid
    Levy, Ron R.
    DISTRIBUTED COMPUTING, PROCEEDINGS, 2008, 5218 : 182 - +
  • [9] A popularity-aware reconstruction technique in erasure-coded storage systems
    Cao, Ting
    Peng, Xiaopu
    Zhang, Chaowei
    Al Tekreeti, Taha Khalid
    Mao, Jianzhou
    Qin, Xiao
    Huang, Jianzhong
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 146 : 122 - 138
  • [10] PUSH: A Pipelined Reconstruction I/O for Erasure-Coded Storage Clusters
    Huang, Jianzhong
    Liang, Xianhai
    Qin, Xiao
    Cao, Qiang
    Xie, Changsheng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (02) : 516 - 526