Dayu: Fast and Low-interference Data Recovery in Very-large Storage Systems

被引:0
|
作者
Wang, Zhufan [1 ]
Zhang, Guangyan [1 ]
Wang, Yang [2 ]
Yang, Qinglin [1 ]
Zhu, Jiaji [3 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Ohio State Univ, Columbus, OH 43210 USA
[3] Alibaba Cloud, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper tries to accelerate data recovery in a large-scale storage system with minimal interference to foreground traffic. By investigating I/O and failure traces from a real-world large-scale storage system, we find that because of the scale of the system and the imbalanced and dynamic foreground traffic, no existing recovery protocols can generate a high-quality recovery strategy in a short time. To address this problem, this paper proposes Dayu, a timeslot-based recovery protocol, which only schedules a subset of tasks which are expected to finish in one timeslot: this approach reduces the computation overhead and naturally can cope with the dynamic foreground traffic. In each timeslot, Dayu incorporates four key algorithms, which enhance existing solutions with heuristics motivated by our trace analysis. Our evaluations in a 1,000-node real cluster and in a 25,000-node simulation both confirm that Dayu can outperform existing recovery protocols, achieving high speed and high quality.
引用
收藏
页码:993 / 1007
页数:15
相关论文
共 50 条
  • [21] Data Prefetching for Large Tiered Storage Systems
    Cherubini, Giovanni
    Kim, Yusik
    Lantz, Mark
    Venkatesan, Vinodh
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 823 - 828
  • [22] Lossless compression of very large volume data with fast dynamic access
    Zhao, RK
    Tao, T
    Gabriel, M
    Belford, GG
    ELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY III, 2002, 4925 : 179 - 190
  • [23] Fast SVM training algorithm with decomposition on very large data sets
    Dong, JX
    Krzyzak, A
    Suen, CY
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (04) : 603 - 618
  • [24] Adaptive Low-level Storage of Very Large Knowledge Graphs
    Urbani, Jacopo
    Jacobs, Ceriel
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1761 - 1772
  • [25] Magnetic Disk Storage Subsystems for Very Large Quantity of Data World
    Kaneko, S.
    Denshi Joho Tsushin Gakkai Shi/Journal of the Institute of Electronics, Information and Communications Engineers, 79 (11):
  • [26] A HYBRID STRUCTURE FOR THE STORAGE AND MANIPULATION OF VERY LARGE SPATIAL DATA SETS
    PEUQUET, DJ
    COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1983, 24 (01): : 14 - 27
  • [27] Evaluation of distributed recovery in large-scale storage systems
    Xin, Q
    Miller, EL
    Schwarz, TJE
    13TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2004, : 172 - 181
  • [28] ESetStore: An Erasure-Coded Storage System With Fast Data Recovery
    Liu, Chengjian
    Wang, Qiang
    Chu, Xiaowen
    Leung, Yiu-Wing
    Liu, Hai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (09) : 2001 - 2016
  • [29] Clock and data recovery circuits with fast acquisition and low jitter
    Zhang, RY
    La Rue, GS
    2004 IEEE WORKSHOP ON MICROELECTRONIC AND ELECTRON DEVICES, 2004, : 48 - 51
  • [30] Fast acquisition clock and data recovery circuit with low jitter
    Zhang, RY
    La Rue, GS
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2006, 41 (05) : 1016 - 1024