Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques

被引:20
|
作者
Zhou, Tianli [1 ]
Tian, Chao [1 ]
机构
[1] Texas A&M Univ, Wisenbaker Engn Bldg 3128,188 Bizzell St, College Stn, TX 77843 USA
关键词
Erasure code; performance; SCHEME; RAID;
D O I
10.1145/3375554
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various techniques have been proposed in the literature to improve erasure code computation efficiency, including optimizing bitmatrix design and computation schedule, common XOR (exclusive-OR) operation reduction, caching management techniques, and vectorization techniques. These techniques were largely proposed individually, and, in this work, we seek to use them jointly. To accomplish this task, these techniques need to be thoroughly evaluated individually and their relation better understood. Building on extensive testing, we develop methods to systematically optimize the computation chain together with the underlying bitmatrix. This led to a simple design approach of optimizing the bitmatrix by minimizing a weighted computation cost function, and also a straightforward coding procedure-follow a computation schedule produced from the optimized bitmatrix to apply XOR-level vectorization. This procedure provides better performances than most existing techniques (e.g., those used in ISA-L and Jerasure libraries), and sometimes can even compete against well-known but less general codes such as EVENODD, RDP, and STAR codes. One particularly important observation is that vectorizing the XOR operations is a better choice than directly vectorizing finite field operations, not only because of the flexibility in choosing finite field size and the better encoding throughput, but also its minimal migration efforts onto newer CPUs.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques
    Zhou, Tianli
    Tian, Chao
    PROCEEDINGS OF THE 17TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2019, : 317 - 329
  • [2] On Data Parallelism of Erasure Coding in Distributed Storage Systems
    Li, Jun
    Li, Baochun
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 45 - 56
  • [3] Sparsity exploiting erasure coding for distributed storage of versioned data
    Harshan, J.
    Oggier, Frederique
    Datta, Anwitaman
    COMPUTING, 2016, 98 (12) : 1305 - 1329
  • [4] Sparsity exploiting erasure coding for distributed storage of versioned data
    J. Harshan
    Frédérique Oggier
    Anwitaman Datta
    Computing, 2016, 98 : 1305 - 1329
  • [5] Exploring Erasure Coding Techniques for High Availability of Intermediate Data
    Zhang, Zhe
    Bockelman, Brian
    Weitzel, Derek
    Swanson, David
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 865 - 872
  • [6] Erasure Coding-Oriented Data Update for Cloud Storage: A Survey
    Xiao, Yifei
    Zhou, Shijie
    Zhong, Linpeng
    IEEE ACCESS, 2020, 8 (08): : 227982 - 227998
  • [7] Secure Data Storage Using Erasure-Coding In Cloud Environment
    Vasanthi, G.
    Chinnasamy, P.
    Kanagavalli, N.
    Ramalingam, M.
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [8] Distributed Erasure Coding in Data Centric Storage for Wireless Sensor Networks
    Albano, Michele
    Chessa, Stefano
    ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 22 - 27
  • [9] Erasure coding for distributed storage: an overview
    Balaji, S. B.
    Krishnan, M. Nikhil
    Vajha, Myna
    Ramkumar, Vinayak
    Sasidharan, Birenjith
    Kumar, P. Vijay
    SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (10)
  • [10] Erasure coding for distributed storage: an overview
    S.B.BALAJI
    M.Nikhil KRISHNAN
    Myna VAJHA
    Vinayak RAMKUMAR
    Birenjith SASIDHARAN
    P.Vijay KUMAR
    Science China(Information Sciences), 2018, 61 (10) : 7 - 51