Optimizing data robustness in large-scale storage systems

被引:0
|
作者
Gougeaud, Sebastien [1 ]
Zertal, Soraya [1 ]
Lafoucriere, Jacques-Charles [2 ]
Deniel, Philippe [2 ]
机构
[1] Univ Versailles, Li PaRAD, F-78000 Versailles, France
[2] CEA, DAM, Ile De France, France
关键词
robustness; mapping efficiency; declustering; data layout; large storage systems;
D O I
10.1109/HPCS.2017.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Storage systems capacity provided by data centers do not cease to increase, currently reaching the exabyte scale using thousands of disks. In this way, the question of the resiliency of such systems becomes critical, to avoid data loss and reduce the impact of the reconstruction process on the data access time. We propose SD2S, a method to create a placement scheme for declustered RAID organizations, based on a shifting placement. It consists in the calculation of degree matrices, which represent the distance between the source sets of each couple of physical disks, thus the number of data blocks which will be reconstructed in case of a double failure. The scheme creation is made by the computation of a score function for all possible shifting offsets and the selection of the one ensuring the reconstruction of the highest percentage of data. Results show the data reconstruction distribution against the number of double failure events. Also, the overhead generated by the calculation of the shifting offsets is compared to greedy SD2S and CRUSH without replicas for systems reaching the hundred of disks. These results confirm that the selection of the best offset can lead to a complete data reconstruction giving a small overhead, especially for large systems.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 50 条
  • [1] On Uncertainty and Robustness in Large-Scale Intelligent Data Fusion Systems
    Marlin, Benjamin M.
    Abdelzaher, Tarek
    Ciocarlie, Gabriela
    Cobb, Adam D.
    Dennison, Mark
    Jalaian, Brian
    Kaplan, Lance
    Raber, Tiffany
    Raglin, Adrienne
    Sharma, Piyush K.
    Srivastava, Mani
    Trout, Theron
    Vadera, Meet P.
    Wigness, Maggie
    [J]. 2020 IEEE SECOND INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2020), 2020, : 82 - 91
  • [2] Optimizing checkpoint data placement with guaranteed burst buffer endurance in large-scale hierarchical storage systems
    Wan, Lipeng
    Cao, Qing
    Wang, Feiyi
    Oral, Sarp
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 100 : 16 - 29
  • [3] Robustness of large-scale distributed computer systems
    Khoroshevsky, VG
    [J]. EUROSIM '96 - HPCN CHALLENGES IN TELECOMP AND TELECOM: PARALLEL SIMULATION OF COMPLEX SYSTEMS AND LARGE-SCALE APPLICATIONS, 1996, : 141 - 150
  • [4] A Data Storage Approach for Large-Scale Distributed Medical Systems
    de Macedo, Douglas D. J.
    von Wangenheim, Aldo
    Dantas, Mario A. R.
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS CISIS 2015, 2015, : 486 - 490
  • [5] Modeling and optimizing large-scale data flows
    Woehrer, Alexander
    Brezany, Peter
    Janciak, Ivan
    Mehofer, Eduard
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 31 : 12 - 27
  • [6] OPTIMIZING THE DEVELOPMENT OF LARGE-SCALE SYSTEMS STRUCTURES
    LUKIN, NV
    LUKOYANOVA, GN
    FILIPPOV, VA
    [J]. AUTOMATION AND REMOTE CONTROL, 1987, 48 (11) : 1551 - 1559
  • [7] Impact of Data Placement on Resilience in Large-Scale Object Storage Systems
    Carns, Philip
    Harms, Kevin
    Jenkins, John
    Mubarak, Misbah
    Ross, Robert
    Carothers, Christopher
    [J]. 2016 32ND SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2016,
  • [8] Optimizing Data Aggregation by Leveraging the Deep Memory Hierarchy on Large-scale Systems
    Tessier, Francois
    Gressier, Paul
    Vishwanath, Venkatram
    [J]. INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 229 - 239
  • [9] Optimizing data stream processing for large-scale applications
    Cappellari, Paolo
    Roantree, Mark
    Chun, Soon Ae
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (09): : 1607 - 1641
  • [10] Optimizing of metadata management in large-scale file systems
    Nae Young Song
    Hwajung Kim
    Hyuck Han
    Heon Young Yeom
    [J]. Cluster Computing, 2018, 21 : 1865 - 1879