Developing Cost-Effective Data Rescue Schemes to Tackle Disk Failures in Data Centers

被引:3
|
作者
Qiao, Zhi [1 ]
Hochstetler, Jacob [1 ]
Liang, Shuwen [1 ]
Fu, Song [1 ]
Chen, Hsing-Bung [2 ]
Settlemyer, Bradley [2 ]
机构
[1] Univ North Texas, Dept Comp Sci & Engn, Denton, TX 76203 USA
[2] Los Alamos Natl Lab, HPC Grp, Los Alamos, NM 87545 USA
来源
BIG DATA - BIGDATA 2018 | 2018年 / 10968卷
关键词
Storage reliability; Data protection; Failure prediction; RAID; ARRAYS;
D O I
10.1007/978-3-319-94301-5_15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensuring the reliability of large-scale storage systems remains a challenge, especially when there are millions of disk drives deployed. Post-failure disk rebuild takes much longer time nowadays due to the ever-increasing disk capacity, which also increases the risk of service unavailability and even data loss. In this paper, we present a proactive data protection (PDP) framework in the ZFS file system to rescue data from disks before actual failure onset. By reducing the risk of data loss and mitigating the prolonged disk rebuilds caused by disk failures, PDP is designed to enhance the overall storage reliability. We extensively evaluate the recovery performance of ZFS with diverse configurations, and further explore disk failure prediction techniques to develop a proactive data protection mechanism in ZFS. We further compare the performance of different data protection strategies, including post-failure disk recovery, proactive disk cloning, and proactive data recovery. We propose an analytic model that uses storage utilization and contextual system information to select the best data protection strategy to achieve cost-effective and enhanced storage reliability.
引用
收藏
页码:194 / 208
页数:15
相关论文
共 50 条
  • [31] Cost-Effective Data Transfer for Mobile Health Care
    Khazbak, Youssef
    Izz, Mostafa
    ElBatt, Tamer
    Fahim, Abdulrahman
    Guirguis, Arsany
    Youssef, Moustafa
    [J]. IEEE SYSTEMS JOURNAL, 2017, 11 (04): : 2663 - 2674
  • [32] Learning a Cost-Effective Strategy on Incomplete Medical Data
    Zhu, Mengxiao
    Zhu, Haogang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II, 2020, 12113 : 175 - 191
  • [33] DATA ACQUISITION - COST-EFFECTIVE METHODS FOR OBTAINING DATA ON WATER-QUALITY
    MAR, BW
    HORNER, RR
    RICHEY, JS
    PALMER, RN
    LETTENMAIER, DP
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 1986, 20 (06) : 545 - 551
  • [34] COST-EFFECTIVE DIALYSIS FOR THE DEVELOPING WORLD
    Gregory, Martin C.
    [J]. ETHNICITY & DISEASE, 2009, 19 (01) : 65 - 67
  • [35] DEVELOPING A COST-EFFECTIVE CARD CAGE
    FLYNN, RJ
    [J]. ELECTRONIC PRODUCTS MAGAZINE, 1981, 24 (05): : 91 - 92
  • [36] Predicting Hard Disk Failures in Data Centers Using Temporal Convolutional Neural Networks
    Burrello, Alessio
    Pagliari, Daniele Jahier
    Bartolini, Andrea
    Benini, Luca
    Macii, Enrico
    Poncino, Massimo
    [J]. EURO-PAR 2020: PARALLEL PROCESSING WORKSHOPS, 2021, 12480 : 277 - 289
  • [37] On developing a fast, cost-effective and non-invasive method to derive data center thermal maps
    Jonas, Michael
    Varsamopoulos, Georgios
    Gupta, Sandeep K. S.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, 2007, : 474 - 475
  • [38] Leveraging technology and data to facilitate development of cost-effective pathways
    Tom, B. S.
    Aksamit, I.
    Del Buono, C. B.
    Tuscher, L.
    Verrilli, D. K.
    Wang, Z.
    Whitlock, K. B.
    Leasure, N. C.
    Bergstrom, K. A.
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2011, 29 (15)
  • [39] A fuzzy method for discovering cost-effective actions from data
    Kalanat, Nasrin
    Shamsinejadbabaki, Pirooz
    Saraee, Mohamad
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (02) : 757 - 765
  • [40] Cost-Effective Data Feeds to Blockchains via Workload-Adaptive Data Replication
    Li, Kai
    Tang, Yuzhe
    Chen, Jiaqi
    Yuan, Zhehu
    Xu, Cheng
    Xu, Jianliang
    [J]. PROCEEDINGS OF THE 2020 21ST INTERNATIONAL MIDDLEWARE CONFERENCE (MIDDLEWARE '20), 2020, : 371 - 385