A Cross-Datacenter Erasure Code Writing Method Based on Generator Matrix Transformation

被引:0
|
作者
Bao H. [1 ,2 ]
Wang Y. [1 ,2 ]
Xu F. [2 ]
机构
[1] National Laboratory for Parallel and Distributed Processing (National University of Defense Technology), Changsha
[2] College of Computer, National University of Defense Technology, Changsha
基金
中国国家自然科学基金; 国家教育部科学基金资助;
关键词
Cross-datacenter storage; Disaster tolerance; Erasure code; Fault tolerance technology; Writing method;
D O I
10.7544/issn1000-1239.2020.20190542
中图分类号
学科分类号
摘要
In cross-datacenter storage systems, existing writing methods of erasure code usually has low encoding efficiency, low transmission efficiency, and large network resource consumption. Therefore, cross-datacenters erasure code usually has a low writing rate. This paper proposes a cross-datacenter erasure code writing method based on generator matrix transformation called CREW. Specifically, we first propose a greedy strategy-based transmission topology construction algorithm called GBTC, which can construct a tree-structured transmission topology with incremental weights (the weights are set to the network distances between datacenters) from top to bottom to organize data transmission between datacenters. Then, we propose a generator matrix transformation algorithm called GMT. Without changing the linear relationship of coded blocks, GMT can transform the generator matrix so that the number of data blocks related to a coded block is negatively correlated with the network distance between the datacenter where the coded block is located and the root of the tree-structured topology. Therefore, CREW only needs to transfer a small number of data blocks through a long network distance to write data. Thus, the network resource consumption is reduced. Finally, we propose a distributed pipelined writing algorithm called DPW to distribute encoding operations to different nodes for parallel execution and limit the number of forwards of data blocks, thereby improving encoding efficiency and transmission efficiency. Experiments show that compared with writing methods of traditional erasure code, the write rate of CREW is increased by 36.3%~57.9%. And compared with the existing writing method of cross-datacenter erasure code (IncEncoding), the writing rate of CREW is increased by 32.4%. © 2020, Science Press. All right reserved.
引用
收藏
页码:291 / 305
页数:14
相关论文
共 35 条
  • [1] Pierre M., Alexandru C., Gabriel A., Et al., Towards efficient location and placement of dynamic replicas for geo-distributed data stores, Proc of the 7th ACM Workshop on Scientific Cloud Computing, pp. 3-9, (2016)
  • [2] Wyatt L., Michael F., Michael K., Et al., Don't settle for eventual: Scalable causal consistency for wide-area storage with COPS, Proc of the 23rd ACM Symp on Operating Systems Principles, pp. 401-416, (2011)
  • [3] Yu B., Pan J., Location-aware associated data placement for geo-distributed data-intensive applications, Proc of the 34th IEEE Conf on Computer Communications, pp. 603-611, (2015)
  • [4] Cable smashing affects Alipay
  • [5] Yevgeniy S., UPDATE: Explosion in downtown Los Angeles disrupts data center operations
  • [6] Official response to large-scale failure of Amazon China cloud service: Affected by the construction party to cut fiber
  • [7] Japan earthquake threatens data centers of several IT giants in Tokyo
  • [8] Amazon AWS confirms the downtime in night
  • [9] Wang Y., Li S., Research and performance evaluation of data replication technology in distributed storage systems, Computers & Mathematics with Applications, 51, 11, pp. 1625-1632, (2006)
  • [10] Wang Y., Pei X., Ma X., Et al., TA-Update: An adaptive update scheme with tree-structured transmission in erasure-coded storage systems, IEEE Transactions on Parallel and Distributed Systems, 29, 8, pp. 1893-1906, (2018)