Dayu: Fast and Low-interference Data Recovery in Very-large Storage Systems

被引:0
|
作者
Wang, Zhufan [1 ]
Zhang, Guangyan [1 ]
Wang, Yang [2 ]
Yang, Qinglin [1 ]
Zhu, Jiaji [3 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Ohio State Univ, Columbus, OH 43210 USA
[3] Alibaba Cloud, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper tries to accelerate data recovery in a large-scale storage system with minimal interference to foreground traffic. By investigating I/O and failure traces from a real-world large-scale storage system, we find that because of the scale of the system and the imbalanced and dynamic foreground traffic, no existing recovery protocols can generate a high-quality recovery strategy in a short time. To address this problem, this paper proposes Dayu, a timeslot-based recovery protocol, which only schedules a subset of tasks which are expected to finish in one timeslot: this approach reduces the computation overhead and naturally can cope with the dynamic foreground traffic. In each timeslot, Dayu incorporates four key algorithms, which enhance existing solutions with heuristics motivated by our trace analysis. Our evaluations in a 1,000-node real cluster and in a 25,000-node simulation both confirm that Dayu can outperform existing recovery protocols, achieving high speed and high quality.
引用
收藏
页码:993 / 1007
页数:15
相关论文
共 50 条
  • [31] Verification of parity data in large scale storage systems
    Schwarz, T
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 508 - 514
  • [32] EDIndex: Enabling Fast Data Queries in Edge Storage Systems
    He, Qiang
    Tan, Siyu
    Chen, Feifei
    Xu, Xiaolong
    Qi, Lianyong
    Hei, Xinhong
    Jin, Hai
    Yang, Yun
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 675 - 685
  • [33] Core vector machines: Fast SVM training on very large data sets
    Tsang, IW
    Kwok, JT
    Cheung, PM
    JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 363 - 392
  • [34] A fast and accurate bundle adjustment method for very large-scale data
    Zheng, Maoteng
    Zhang, Fayong
    Zhu, Junfeng
    Zuo, Zejun
    COMPUTERS & GEOSCIENCES, 2020, 142
  • [35] ESet: Placing Data towards Efficient Recovery for Large-scale Erasure-Coded Storage Systems
    Liu, Chengjian
    Chu, Xiaowen
    Liu, Hai
    Leung, Yiu-Wing
    2016 25TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2016,
  • [36] Very Fast Interactive Visualization of Large Sets of High-dimensional Data
    Dzwinel, Witold
    Wcislo, Rafal
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 572 - 581
  • [37] Fast SVM training using data reconstruction for classification of very large datasets
    Liang, Peileng
    Li, Weite
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (03) : 372 - 381
  • [38] STRUCTURAL AND DYNAMIC ANALYSIS OF VERY LARGE SYSTEMS USING A FAST PARALLEL ALGORITHM
    ELMER, KH
    APPLICATIONS OF SUPERCOMPUTERS IN ENGINEERING : FLUID FLOW AND STRESS ANALYSIS APPLICATIONS, 1989, : 229 - 238
  • [39] TrajS']jStore: An Adaptive Storage System for Very Large Trajectory Data Sets
    Cudre-Mauroux, Philippe
    Wu, Eugene
    Madden, Samuel
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 109 - 120
  • [40] vFFR: A Very Fast Failure Recovery Strategy Implemented in Devices With Programmable Data Plane
    Franco, David
    Higuero, Marivi
    Sanz, Ane
    Unzilla, Juanjo
    Huarte, Maider
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 7121 - 7146