Proactive Data Migration for Improved Storage Availability in Large-Scale Data Centers

被引:24
|
作者
Wu, Suzhen [1 ,2 ]
Jiang, Hong [3 ]
Mao, Bo [4 ]
机构
[1] Xiamen Univ, Dept Comp Sci, Xiamen 361005, Fujian, Peoples R China
[2] State Key Lab High End Server & Storage Technol, Jinan, Shandong, Peoples R China
[3] Univ Nebraska, Dept Comp Sci & Engn, Lincoln, NE 68588 USA
[4] Xiamen Univ, Software Sch, Xiamen 361005, Fujian, Peoples R China
关键词
Low-priority background tasks; availability; proactive; temporal and spatial locality; RAID reconstruction; DISK FAILURE; RECOVERY;
D O I
10.1109/TC.2014.2366734
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In face of high partial and complete disk failure rates and untimely system crashes, the executions of low-priority background tasks become increasingly frequent in large-scale data centers. However, the existing algorithms are all reactive optimizations and only exploit the temporal locality of workloads to reduce the user I/O requests during the low-priority background tasks. To address the problem, this paper proposes Intelligent Data Outsourcing (IDO), a zone-based and proactive data migration optimization, to significantly improve the efficiency of the low-priority background tasks. The main idea of IDO is to proactively identify the hot data zones of RAID-structured storage systems in the normal operational state. By leveraging the prediction tools to identify the upcoming events, IDO proactively migrates the data blocks belonging to the hot data zones on the degraded device to a surrogate RAID set in the large-scale data centers. Upon a disk failure or crash reboot, most user I/O requests addressed to the degraded RAID set can be serviced directly by the surrogate RAID set rather than the much slower degraded RAID set. Consequently, the performance of the background tasks and user I/O performance during the background tasks are improved simultaneously. Our lightweight prototype implementation of IDO and extensive trace-driven experiments on two case studies demonstrate that, compared with the existing state-of-the-art approaches, IDO effectively improves the performance of the low-priority background tasks. Moreover, IDO is portable and can be easily incorporated into any existing algorithms for RAID-structured storage systems.
引用
收藏
页码:2637 / 2651
页数:15
相关论文
共 50 条
  • [31] Data Migration in Large Scale Heterogeneous Storage Systems with Nodes to Spare
    Kari, Chadi
    Chen, Sixia
    Amir-Mohammadian, Sepehr
    Pallipuram, Vivek
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2019, : 854 - 858
  • [32] Large-scale data visualization
    Ma, KL
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2001, 21 (04) : 22 - 23
  • [33] Accelerate Large-Scale Seismic Data Kirchhoff Time Migration in Spark
    Tian, Yang
    Liu, Chao
    Yan, Haihua
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2018), 2018, : 41 - 45
  • [34] Marbor: A Novel Large-Scale Graph Data Storage and Processing Framework
    Zhou, Wei
    Gao, Yun
    Han, Jizhong
    Xu, Zhiyong
    [J]. 2014 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2014,
  • [35] Large-scale electrophysiology: Acquisition, compression, encryption, and storage of big data
    Brinkmann, Benjamin H.
    Bower, Mark R.
    Stengel, Keith A.
    Worrell, Gregory A.
    Stead, Matt
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2009, 180 (01) : 185 - 192
  • [36] MetHoS: a platform for large-scale processing, storage and analysis of metabolomics data
    Konstantinos Tzanakis
    Tim W. Nattkemper
    Karsten Niehaus
    Stefan P. Albaum
    [J]. BMC Bioinformatics, 23
  • [37] Efficient data reconstruction: The bottleneck of large-scale application of DNA storage
    Cao, Ben
    Zheng, Yanfen
    Shao, Qi
    Liu, Zhenlu
    Xie, Lei
    Zhao, Yunzhu
    Wang, Bin
    Zhang, Qiang
    Wei, Xiaopeng
    [J]. CELL REPORTS, 2024, 43 (04):
  • [38] Impact of Data Placement on Resilience in Large-Scale Object Storage Systems
    Carns, Philip
    Harms, Kevin
    Jenkins, John
    Mubarak, Misbah
    Ross, Robert
    Carothers, Christopher
    [J]. 2016 32ND SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2016,
  • [39] MetHoS: a platform for large-scale processing, storage and analysis of metabolomics data
    Tzanakis, Konstantinos
    Nattkemper, Tim W.
    Niehaus, Karsten
    Albaum, Stefan P.
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [40] MatrixDCN: a high performance network architecture for large-scale cloud data centers
    Sun, Yantao
    Chen, Min
    Peng, Limei
    Hassan, Mohammad Mehedi
    Alelaiwi, Abdulhameed
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2016, 16 (08): : 942 - 959