Proactive Data Migration for Improved Storage Availability in Large-Scale Data Centers

被引:24
|
作者
Wu, Suzhen [1 ,2 ]
Jiang, Hong [3 ]
Mao, Bo [4 ]
机构
[1] Xiamen Univ, Dept Comp Sci, Xiamen 361005, Fujian, Peoples R China
[2] State Key Lab High End Server & Storage Technol, Jinan, Shandong, Peoples R China
[3] Univ Nebraska, Dept Comp Sci & Engn, Lincoln, NE 68588 USA
[4] Xiamen Univ, Software Sch, Xiamen 361005, Fujian, Peoples R China
关键词
Low-priority background tasks; availability; proactive; temporal and spatial locality; RAID reconstruction; DISK FAILURE; RECOVERY;
D O I
10.1109/TC.2014.2366734
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In face of high partial and complete disk failure rates and untimely system crashes, the executions of low-priority background tasks become increasingly frequent in large-scale data centers. However, the existing algorithms are all reactive optimizations and only exploit the temporal locality of workloads to reduce the user I/O requests during the low-priority background tasks. To address the problem, this paper proposes Intelligent Data Outsourcing (IDO), a zone-based and proactive data migration optimization, to significantly improve the efficiency of the low-priority background tasks. The main idea of IDO is to proactively identify the hot data zones of RAID-structured storage systems in the normal operational state. By leveraging the prediction tools to identify the upcoming events, IDO proactively migrates the data blocks belonging to the hot data zones on the degraded device to a surrogate RAID set in the large-scale data centers. Upon a disk failure or crash reboot, most user I/O requests addressed to the degraded RAID set can be serviced directly by the surrogate RAID set rather than the much slower degraded RAID set. Consequently, the performance of the background tasks and user I/O performance during the background tasks are improved simultaneously. Our lightweight prototype implementation of IDO and extensive trace-driven experiments on two case studies demonstrate that, compared with the existing state-of-the-art approaches, IDO effectively improves the performance of the low-priority background tasks. Moreover, IDO is portable and can be easily incorporated into any existing algorithms for RAID-structured storage systems.
引用
收藏
页码:2637 / 2651
页数:15
相关论文
共 50 条
  • [1] An overview of a large-scale data migration
    Lübeck, M
    Geppert, D
    Nienartowicz, K
    Nowak, M
    Valassi, A
    [J]. 20TH IEEE/11TH NASA GODDARD CONFERENCE ON MASS STORAGE AND TECHNOLOGIES (MSST 2003), PROCEEDINGS, 2003, : 49 - 55
  • [2] Network Virtualization for Large-Scale Data Centers
    Ando, Tatsuhiro
    Shimokuni, Osamu
    Asano, Katsuhito
    [J]. FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2013, 49 (03): : 292 - 299
  • [3] Improved Phasing and Imputation for Large-Scale Data
    Browning, Brian L.
    Browning, Sharon R.
    Tian, Xiaowen
    [J]. GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 673 - 673
  • [4] DRAM Failure Prediction in Large-Scale Data Centers
    Yu, Fengyuan
    Xu, Hongzuo
    Jian, Songlei
    Huang, Chenlin
    Wang, Yijie
    Wu, Zhiyue
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2021) / 2021 9TH IEEE INTERNATIONAL CONFERENCE ON MOBILE CLOUD COMPUTING, SERVICES, AND ENGINEERING (MOBILECLOUD 2021), 2021, : 1 - 8
  • [5] Metadata Exploitation in Large-scale Data Migration Projects
    Narayanan, Ram
    Oberhofer, Martin
    Pandit, Sushain
    [J]. AMCIS 2012 PROCEEDINGS, 2012,
  • [6] Optimizing data robustness in large-scale storage systems
    Gougeaud, Sebastien
    Zertal, Soraya
    Lafoucriere, Jacques-Charles
    Deniel, Philippe
    [J]. 2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 236 - 243
  • [7] Secure large-scale genome data storage and query
    Chen, Luyao
    Aziz, Md Momin
    Mohammed, Noman
    Jiang, Xiaoqian
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 165 : 129 - 137
  • [8] An Improved LSTM-Based Prediction Approach for Resources and Workload in Large-Scale Data Centers
    Yuan, Haitao
    Bi, Jing
    Li, Shuang
    Zhang, Jia
    Zhou, MengChu
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12): : 22816 - 22829
  • [9] HHS: an efficient network topology for large-scale data centers
    Sadoon Azizi
    Naser Hashemi
    Ahmad Khonsari
    [J]. The Journal of Supercomputing, 2016, 72 : 874 - 899
  • [10] OpenPOWER: Reengineering a server ecosystem for large-scale data centers
    Gschwind, Michael
    [J]. 2014 IEEE HOT CHIPS 26 SYMPOSIUM (HCS), 2014,