Robot: An Efficient Model For Big Data Storage Systems Based On Erasure Coding

被引:0
|
作者
Yin, Chao [1 ]
Wang, Jianzong [3 ]
Xie, Changsheng [1 ,2 ]
Wan, Jiguang [1 ]
Long, Changlin [1 ]
Bi, Wenjuan [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Wuhan Natl Lab Optoelect, Wuhan, Peoples R China
[3] NetEase Inc, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
distributed file system; erasure coding; big data; robustness; availiabilty; cloud storage; MDS ARRAY CODES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
it is well-known that with the explosive growth of data, the age of big data has arrived. How to save huge amounts of data is of great importance to both industry and academia. This paper puts forward a solution based on coding technologies in big data system that store a lot of cold data. By studying existing coding technologies and big data systems, we can not only maintain the system's reliability, but also improve the security and the utilization of storage systems. Due to the remarkable reliability and space saving rate of coding technologies, importing coding schema in to big data systems becomes prerequisite. In our presented schema, the storage node is divided into several virtual nodes to keep load balancing. By setting up different virtual node storage groups for different codec server, we can ensure system availability. And by utilizing the parallel decoding computing of the node and the block of data, we can also reduce the system recovery time when data is corrupted. Additionally, different users set different coding parameters can improve the robustness of big data storage systems. We configure various data block m and calibration block k to improve the utilization rate in the quantitative experiments. The results shows that parallel decoding speed can rise up two times than the past serial decoding speed. The encoding efficiency with ICRS coding is 34.2% higher than using CRS and 56.5% more than using RS coding equally. The decoding rate by using ICRS is 18.1% higher than using CRS and 31.1% higher than using RS averagely.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] On Data Parallelism of Erasure Coding in Distributed Storage Systems
    Li, Jun
    Li, Baochun
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 45 - 56
  • [2] Efficient Scheduling for Multi-Block Updates in Erasure Coding Based Storage Systems
    Shen, Jiajie
    Zhang, Kai
    Gu, Jiazhen
    Zhou, Yangfan
    Wang, Xin
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (04) : 573 - 581
  • [3] An Efficient Parallel Coding Scheme in Erasure-Coded Storage Systems
    Dong, Wenrui
    Liu, Guangming
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (03): : 627 - 643
  • [4] P-Schedule: Erasure Coding Schedule Strategy in Big Data Storage System
    Yin, Chao
    Lv, Haitao
    Li, Tongfang
    Liu, Yan
    Qu, Xiaoping
    Yuan, Sihao
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT III, 2018, 11336 : 270 - 279
  • [5] Erasure-Coding-Based Storage and Recovery for Distributed Exascale Storage Systems
    Kim, Jeong-Joon
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [6] UHUM: An Efficient Hybrid Update Mechanism in Distributed Storage Systems with Erasure Coding
    Luo, Qian
    Wang, Yun
    [J]. PROCEEDINGS OF THE 2019 IEEE 23RD INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2019, : 158 - 163
  • [7] RAPID: A Fast Data Update Protocol in Erasure Coded Storage Systems for Big Data
    Akash, G. J.
    Lee, Ojus Thomas
    Kumar, S. D. Madhu
    Chandran, Priya
    Cuzzocrea, Alfredo
    [J]. 2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 890 - 897
  • [8] Erasure Coding for Cloud Storage Systems: A Survey
    Li, Jun
    Li, Baochun
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2013, 18 (03) : 259 - 272
  • [9] Erasure Coding for Cloud Storage Systems: A Survey
    Jun Li
    Baochun Li
    [J]. Tsinghua Science and Technology, 2013, 18 (03) : 259 - 272
  • [10] An In-network Aggregation Scheme for Erasure Coding Storage Systems in Data Centers
    Xia, Junxu
    Yao, Chendie
    Li, Jiangfan
    [J]. 2018 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2018, : 36 - 41