Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques

被引:20
|
作者
Zhou, Tianli [1 ]
Tian, Chao [1 ]
机构
[1] Texas A&M Univ, Wisenbaker Engn Bldg 3128,188 Bizzell St, College Stn, TX 77843 USA
关键词
Erasure code; performance; SCHEME; RAID;
D O I
10.1145/3375554
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various techniques have been proposed in the literature to improve erasure code computation efficiency, including optimizing bitmatrix design and computation schedule, common XOR (exclusive-OR) operation reduction, caching management techniques, and vectorization techniques. These techniques were largely proposed individually, and, in this work, we seek to use them jointly. To accomplish this task, these techniques need to be thoroughly evaluated individually and their relation better understood. Building on extensive testing, we develop methods to systematically optimize the computation chain together with the underlying bitmatrix. This led to a simple design approach of optimizing the bitmatrix by minimizing a weighted computation cost function, and also a straightforward coding procedure-follow a computation schedule produced from the optimized bitmatrix to apply XOR-level vectorization. This procedure provides better performances than most existing techniques (e.g., those used in ISA-L and Jerasure libraries), and sometimes can even compete against well-known but less general codes such as EVENODD, RDP, and STAR codes. One particularly important observation is that vectorizing the XOR operations is a better choice than directly vectorizing finite field operations, not only because of the flexibility in choosing finite field size and the better encoding throughput, but also its minimal migration efforts onto newer CPUs.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Minimizing Network and Storage Costs for Consensus with Flexible Erasure Coding
    Zhang, Mi
    Kang, Qihan
    Lee, Patrick P. C.
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 41 - 50
  • [32] Erasure Coding for Small Objects in In-Memory KV Storage
    Yiu, Matt M. T.
    Chan, Helen H. W.
    Lee, Patrick P. C.
    SYSTOR'17: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, 2017,
  • [33] Cost analysis of erasure coding for exa-scale storage
    Dong-Oh Kim
    Hong-Yeon Kim
    Young-Kyun Kim
    Jeong-Joon Kim
    The Journal of Supercomputing, 2019, 75 : 4638 - 4656
  • [34] Demand-Aware Erasure Coding for Distributed Storage Systems
    Li, Jun
    Li, Baochun
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2021, 9 (02) : 532 - 545
  • [35] In-network block repairing for erasure coding storage systems
    Xia, Junxu
    Guo, Deke
    Cheng, Geyao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (24):
  • [36] ProCode: A Proactive Erasure Coding Scheme for Cloud Storage Systems
    Li, Peng
    Li, Jing
    Stones, Rebecca J.
    Wang, Gang
    Li, Zhongwei
    Liu, Xiaoguang
    PROCEEDINGS OF 2016 IEEE 35TH SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2016, : 219 - 228
  • [37] Hierarchical Erasure Coding for P2P storage
    Charles, Denis
    Puri, Sidd
    2009 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 926 - 931
  • [38] Composite Extension Finite Fields for Distributed Storage Erasure Coding
    Heide, Janus
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [39] MICS : Mingling Chained Storage Combining Replication and Erasure Coding
    Tang, Yan
    Yin, Jianwei
    Lo, Wei
    Li, Ying
    Deng, Shuiguang
    Dong, Kexiong
    Pu, Calton
    2015 IEEE 34TH SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2015, : 192 - 201
  • [40] Maintenance-Efficient Erasure Coding for Distributed Archival Storage
    Cuong Pham
    Zhang, Feng
    Tran, Duc A.
    2011 20TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2011,