Dayu: Fast and Low-interference Data Recovery in Very-large Storage Systems

被引：0

作者：

Wang, Zhufan ^{[1
]}

Zhang, Guangyan ^{[1
]}

Wang, Yang ^{[2
]}

Yang, Qinglin ^{[1
]}

Zhu, Jiaji ^{[3
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Ohio State Univ, Columbus, OH 43210 USA

[3] Alibaba Cloud, Hangzhou, Zhejiang, Peoples R China

来源：

PROCEEDINGS OF THE 2019 USENIX ANNUAL TECHNICAL CONFERENCE | 2019年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper tries to accelerate data recovery in a large-scale storage system with minimal interference to foreground traffic. By investigating I/O and failure traces from a real-world large-scale storage system, we find that because of the scale of the system and the imbalanced and dynamic foreground traffic, no existing recovery protocols can generate a high-quality recovery strategy in a short time. To address this problem, this paper proposes Dayu, a timeslot-based recovery protocol, which only schedules a subset of tasks which are expected to finish in one timeslot: this approach reduces the computation overhead and naturally can cope with the dynamic foreground traffic. In each timeslot, Dayu incorporates four key algorithms, which enhance existing solutions with heuristics motivated by our trace analysis. Our evaluations in a 1,000-node real cluster and in a 25,000-node simulation both confirm that Dayu can outperform existing recovery protocols, achieving high speed and high quality.

引用

页码：993 / 1007

页数：15

共 50 条

[31] Verification of parity data in large scale storage systems
Schwarz, T
PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 508 - 514
[32] EDIndex: Enabling Fast Data Queries in Edge Storage Systems
He, Qiang
Tan, Siyu
Chen, Feifei
Xu, Xiaolong
Qi, Lianyong
Hei, Xinhong
Jin, Hai
Yang, Yun
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 675 - 685
[33] Core vector machines: Fast SVM training on very large data sets
Tsang, IW
Kwok, JT
Cheung, PM
JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 363 - 392
[34] A fast and accurate bundle adjustment method for very large-scale data
Zheng, Maoteng
Zhang, Fayong
Zhu, Junfeng
Zuo, Zejun
COMPUTERS & GEOSCIENCES, 2020, 142
[35] ESet: Placing Data towards Efficient Recovery for Large-scale Erasure-Coded Storage Systems
Liu, Chengjian
Chu, Xiaowen
Liu, Hai
Leung, Yiu-Wing
2016 25TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2016,
[36] Very Fast Interactive Visualization of Large Sets of High-dimensional Data
Dzwinel, Witold
Wcislo, Rafal
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 572 - 581
[37] Fast SVM training using data reconstruction for classification of very large datasets
Liang, Peileng
Li, Weite
Hu, Jinglu
IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (03) : 372 - 381
[38] STRUCTURAL AND DYNAMIC ANALYSIS OF VERY LARGE SYSTEMS USING A FAST PARALLEL ALGORITHM
ELMER, KH
APPLICATIONS OF SUPERCOMPUTERS IN ENGINEERING : FLUID FLOW AND STRESS ANALYSIS APPLICATIONS, 1989, : 229 - 238
[39] TrajS']jStore: An Adaptive Storage System for Very Large Trajectory Data Sets
Cudre-Mauroux, Philippe
Wu, Eugene
Madden, Samuel
26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 109 - 120
[40] vFFR: A Very Fast Failure Recovery Strategy Implemented in Devices With Programmable Data Plane
Franco, David
Higuero, Marivi
Sanz, Ane
Unzilla, Juanjo
Huarte, Maider
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 7121 - 7146

← 1 2 3 4 5 →