Developing Cost-Effective Data Rescue Schemes to Tackle Disk Failures in Data Centers

被引:3
|
作者
Qiao, Zhi [1 ]
Hochstetler, Jacob [1 ]
Liang, Shuwen [1 ]
Fu, Song [1 ]
Chen, Hsing-Bung [2 ]
Settlemyer, Bradley [2 ]
机构
[1] Univ North Texas, Dept Comp Sci & Engn, Denton, TX 76203 USA
[2] Los Alamos Natl Lab, HPC Grp, Los Alamos, NM 87545 USA
来源
BIG DATA - BIGDATA 2018 | 2018年 / 10968卷
关键词
Storage reliability; Data protection; Failure prediction; RAID; ARRAYS;
D O I
10.1007/978-3-319-94301-5_15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensuring the reliability of large-scale storage systems remains a challenge, especially when there are millions of disk drives deployed. Post-failure disk rebuild takes much longer time nowadays due to the ever-increasing disk capacity, which also increases the risk of service unavailability and even data loss. In this paper, we present a proactive data protection (PDP) framework in the ZFS file system to rescue data from disks before actual failure onset. By reducing the risk of data loss and mitigating the prolonged disk rebuilds caused by disk failures, PDP is designed to enhance the overall storage reliability. We extensively evaluate the recovery performance of ZFS with diverse configurations, and further explore disk failure prediction techniques to develop a proactive data protection mechanism in ZFS. We further compare the performance of different data protection strategies, including post-failure disk recovery, proactive disk cloning, and proactive data recovery. We propose an analytic model that uses storage utilization and contextual system information to select the best data protection strategy to achieve cost-effective and enhanced storage reliability.
引用
收藏
页码:194 / 208
页数:15
相关论文
共 50 条
  • [21] CoShare: A Cost-effective Data Sharing System for Data Center Networks
    Zhuang, Hao
    Filali, Imen
    Rahman, Rameez
    Aberer, Karl
    [J]. 2015 IEEE CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC), 2015, : 11 - 18
  • [22] Expandable and Cost-Effective Network Structures for Data Centers Using Dual-Port Servers
    Guo, Deke
    Chen, Tao
    Li, Dan
    Li, Mo
    Liu, Yunhao
    Chen, Guihai
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (07) : 1303 - 1317
  • [23] Are poison control centers cost-effective?
    Williams, RM
    [J]. ANNALS OF EMERGENCY MEDICINE, 1997, 29 (02) : 246 - 247
  • [24] Are Certified Breast Centers Cost-Effective?
    Beckmann, Matthias W.
    Bani, Mayada R.
    Loehberg, Christian R.
    Hildebrandt, Thomas
    Schrauder, Michael G.
    Wagner, Stefanie
    Fasching, Peter A.
    Lux, Michael Patrick
    [J]. BREAST CARE, 2009, 4 (04) : 245 - 250
  • [25] Cost-effective Data Upkeep in Decentralized Storage Systems
    Nygaard, Racin
    Meling, Hein
    Olsen, John Ingve
    [J]. 38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 165 - 173
  • [26] Cost-Effective Data Aggregation Method for Smart Grid
    Hsu, Hsi-Chou
    Zhuang, Shi-Ren
    Huang, Yung-Fa
    [J]. ELECTRONICS, 2021, 10 (23)
  • [27] Cost-Effective App Data Distribution in Edge Computing
    Xia, Xiaoyu
    Chen, Feifei
    He, Qiang
    Grundy, John C.
    Abdelrazek, Mohamed
    Jin, Hai
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (01) : 31 - 44
  • [28] EXCEPTIONALLY COST-EFFECTIVE ERROR CONTROL FOR DATA COMMUNICATIONS
    GIBSON, ED
    [J]. PROCEEDINGS OF THE INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS, 1966, 54 (06): : 908 - &
  • [29] Enabling Cost-effective Data Processing with Smart SSD
    Kang, Yangwook
    Kee, Yang-suk
    Miller, Ethan L.
    Park, Chanik
    [J]. 2013 IEEE 29TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2013,
  • [30] DATA SHOWS MEDICAL NUTRITION THERAPY IS COST-EFFECTIVE
    FOLKMAN, JW
    [J]. JOURNAL OF THE AMERICAN DIETETIC ASSOCIATION, 1994, 94 (09) : 966 - 968