Dayu: Fast and Low-interference Data Recovery in Very-large Storage Systems

被引:0
|
作者
Wang, Zhufan [1 ]
Zhang, Guangyan [1 ]
Wang, Yang [2 ]
Yang, Qinglin [1 ]
Zhu, Jiaji [3 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Ohio State Univ, Columbus, OH 43210 USA
[3] Alibaba Cloud, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper tries to accelerate data recovery in a large-scale storage system with minimal interference to foreground traffic. By investigating I/O and failure traces from a real-world large-scale storage system, we find that because of the scale of the system and the imbalanced and dynamic foreground traffic, no existing recovery protocols can generate a high-quality recovery strategy in a short time. To address this problem, this paper proposes Dayu, a timeslot-based recovery protocol, which only schedules a subset of tasks which are expected to finish in one timeslot: this approach reduces the computation overhead and naturally can cope with the dynamic foreground traffic. In each timeslot, Dayu incorporates four key algorithms, which enhance existing solutions with heuristics motivated by our trace analysis. Our evaluations in a 1,000-node real cluster and in a 25,000-node simulation both confirm that Dayu can outperform existing recovery protocols, achieving high speed and high quality.
引用
收藏
页码:993 / 1007
页数:15
相关论文
共 50 条
  • [41] Properties and uses of storage for enhancing the grid penetration of very large photovoltaic systems
    Solomon, A. A.
    Faiman, D.
    Meron, G.
    ENERGY POLICY, 2010, 38 (09) : 5208 - 5222
  • [42] Fast Reconstruction for Degraded Reads and Recovery Process in Primary Array Storage Systems
    Sung, Baegjae
    Park, Chanik
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (02): : 294 - 303
  • [43] Highly efficient large ammonia systems with very low refrigerant charge
    Vestergaard, Niels P.
    Skovrup, Morten Juel
    Kristofersson, Johannes
    15TH IIR-GUSTAV LORENTZEN CONFERENCE ON NATURAL REFRIGERANTS, 2022, : 872 - 879
  • [44] Simulating Data Flows of Very Large Scale Intelligent Transportation Systems
    Tangirala, Nagacharan Teja
    Sommer, Christoph
    Knoll, Alois
    PROCEEDINGS OF THE 38TH ACM SIGSIM INTERNATIONAL CONFERENCE ON PRINCIPLES OF ADVANCED DISCRETE SIMULATION, ACM SIGSIM-PADS 2024, 2024, : 98 - 107
  • [45] Optimizing data robustness in large-scale storage systems
    Gougeaud, Sebastien
    Zertal, Soraya
    Lafoucriere, Jacques-Charles
    Deniel, Philippe
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 236 - 243
  • [46] RAPID: A Fast Data Update Protocol in Erasure Coded Storage Systems for Big Data
    Akash, G. J.
    Lee, Ojus Thomas
    Kumar, S. D. Madhu
    Chandran, Priya
    Cuzzocrea, Alfredo
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 890 - 897
  • [47] Comments on the Core vector machines: Fast SVM training on very large data sets
    LITIS, INSA de Rouen, Avenue de l'Université, 76801 Saint-Etienne du Rouvray, France
    J. Mach. Learn. Res., 2007, (291-301):
  • [48] AUTOMATED MICROFICHE TERMINAL - FAST RETRIEVAL FROM VERY LARGE DATA-BASES
    DUERDEN, F
    GEC-JOURNAL OF SCIENCE & TECHNOLOGY, 1976, 43 (02): : 51 - 60
  • [49] Comments on the "Core Vector Machines: Fast SVM training on very large data sets"
    Loosli, Gaelle
    Canu, Stephane
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 291 - 301
  • [50] Vys: A Protocol for Commensal Fast Transient Searches and Data Processing at the Very Large Array
    Pokorny, Martin
    Law, Casey J.
    Bower, Geoffrey C.
    Burke-Spolaor, Sarah
    Butler, Bryan J.
    Demorest, Paul
    Khudikyan, S. E.
    Lazio, T. Joseph W.
    Robnett, James
    Rupen, Michael
    JOURNAL OF ASTRONOMICAL INSTRUMENTATION, 2018, 7 (2-3)