SUORA: A Scalable and Uniform Data Distribution Algorithm for Heterogeneous Storage Systems

被引:0
|
作者
Zhou, Jiang [1 ]
Xie, Wei [1 ]
Noble, Jason [2 ]
Echo, Kace [1 ]
Chen, Yong [1 ]
机构
[1] Texas Tech Univ, Lubbock, TX 79409 USA
[2] NIMBOXX Inc, Austin, TX USA
关键词
Data distribution algorithm; data placement; data management; heterogeneous storage; data centers;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The data scale in many data centers is growing explosively with emerging applications and usages of big data technologies. Data distribution is a key issue in large-scale distributed storage systems to place petabytes of data or even beyond, among tens or hundreds of thousands of storage devices. In the meantime, heterogeneous storage systems, such as those having devices with hard disk drives (HDDs) and storage class memories (SCMs), have become increasingly popular for massive data storage due to balanced performance, capacity, and cost. Current data distribution algorithms can achieve efficient, scalable, and balanced mapping, but do not distinguish different characteristics of heterogeneous devices well. This paper presents a novel data distribution algorithm called SUORA (Scalable and Uniform storage via Optimally-adaptive and Random number Addressing), to take full advantage of hetero-geneous devices. SUORA is a pseudo-random algorithm that uniformly distributes data cross a hybrid and tiered storage cluster. It divides hetero-geneous devices, maps them onto different buckets and assigns them to various segments in each bucket. A pseudo-random and deterministic number sequence is generated to map data among segments and devices. Data movement is performed for achieving better read throughput while keeping load balance according to data hotness and bucket threshold. With considering distinct characteristics of heterogeneous storage devices well, the SUORA algorithm achieves a highly efficient adaptive data distribution for data centers and heterogeneous storage systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Data Distribution for Heterogeneous Storage Systems
    Zhou, Jiang
    Chen, Yong
    Zheng, Mai
    Wang, Weiping
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (06) : 1747 - 1762
  • [2] A Heterogeneous Cloud Storage Platform With Uniform Data Distribution by Software-Defined Storage Technologies
    Yang, Chao-Tung
    Chen, Shuo-Tsung
    Cheng, Wei-Hsun
    Chan, Yu-Wei
    Kristiani, Endah
    [J]. IEEE ACCESS, 2019, 7 : 147672 - 147682
  • [3] The Implementation of Supporting Uniform Data Distribution with Software-Dened Storage Service on Heterogeneous Cloud Storage
    Cheng, Wei-Hsun
    Chiang, Chun-I
    Yang, Chao-Tung
    Chen, Shuo-Tsung
    Liu, Jung-Chun
    [J]. 2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 610 - 615
  • [4] Data Migration in Heterogeneous Storage Systems
    Kari, Chadi
    Kim, Yoo-Ah
    Russell, Alexander
    [J]. 31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, : 143 - 150
  • [5] An Effective Cache Algorithm for Heterogeneous Storage Systems
    Li, Yong
    Feng, Dan
    Shi, Zhan
    [J]. SCIENTIFIC WORLD JOURNAL, 2013,
  • [6] Hashing Based Data Distribution in Heterogeneous Storage
    Zhou, Jiang
    Su, Lin
    Wang, Weiping
    Chen, Yong
    [J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 652 - 659
  • [7] A scalable task duplication based scheduling algorithm for heterogeneous systems
    Ranaweera, S
    Agrawal, DP
    [J]. 2000 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2000, : 383 - 390
  • [8] Scalable clustering and mapping algorithm for application distribution on heterogeneous and irregular FPGA clusters
    Kalms, Lester
    Goehringer, Diana
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 133 : 367 - 376
  • [9] A guideline for data placement in heterogeneous distributed storage systems
    Kaneko, Shun
    Nakamura, Takaki
    Kamei, Hitoshi
    Muraoka, Hiroaki
    [J]. PROCEEDINGS 2016 5TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS IIAI-AAI 2016, 2016, : 942 - 945
  • [10] Mass data storage and sharing algorithm in distributed heterogeneous environment
    Yan, Huaichao
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2018, 21 (02): : 317 - 326