An Empirical Investigation of Missing Data Handling in Cloud Node Failure Prediction

被引:11
|
作者
Ma, Minghua [1 ]
Liu, Yudong [1 ]
Tong, Yuang [1 ]
Li, Haozhe [1 ]
Zhao, Pu [1 ]
Xu, Yong [1 ]
Zhang, Hongyu [2 ]
He, Shilin [1 ]
Wang, Lu [1 ]
Dang, Yingnong [3 ]
Rajmohan, Saravanakumar [4 ]
Lin, Qingwei [1 ]
机构
[1] Microsoft Res, Beijing, Peoples R China
[2] Univ Newcastle, Callaghan, NSW, Australia
[3] Microsoft Azure, Redmond, WA USA
[4] Microsoft 365, Redmond, WA USA
基金
澳大利亚研究理事会;
关键词
Node failure prediction; Missing data; Cloud systems; INTERPOLATION;
D O I
10.1145/3540250.3558946
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Cloud computing systems have become increasingly popular in recent years. A typical cloud system utilizes millions of computing nodes as the basic infrastructure. Node failure has been identified as one of the most prevalent causes of cloud system downtime. To improve the reliability of cloud systems, many previous studies collected monitoring metrics from nodes and built models to predict node failures before the failures happen. However, based on our experience with large-scale real-world cloud systems in Microsoft, we find that the task of predicting node failure is severely hampered by missing data. There is a large amount of missing data, and the online latest data utilized for prediction is even worse. As a result, the real-time performance of the node prediction model is limited. In this paper, we first characterize the missing data problem for node failure prediction. Then, we evaluate several existing data interpolation approaches, and find that node dimension interpolation approaches outperform time dimension ones and deep learning based interpolation is the best for early prediction. Our findings can help academics and engineers address the missing data problem in cloud node failure prediction and other data-driven software engineering scenarios.
引用
收藏
页码:1453 / 1464
页数:12
相关论文
共 50 条
  • [1] Product failure prediction with missing data
    Kang, Seokho
    Kim, Eunji
    Shim, Jaewoong
    Chang, Wonsang
    Cho, Sungzoon
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2018, 56 (14) : 4849 - 4859
  • [2] AN EMPIRICAL-STUDY COMPARING SEVERAL METHODS OF HANDLING MISSING DATA
    CROMER, FE
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1981, 6 (01): : 41 - 47
  • [3] Handling of Missing Data
    Budhiraja, Pooja
    Kaplan, Bruce
    Mustafa, Reem A.
    [J]. TRANSPLANTATION, 2020, 104 (01) : 24 - 26
  • [4] HANDLING OF MISSING DATA
    Torres, F.
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2011, 109 : 17 - 17
  • [5] Handling missing data
    不详
    [J]. CURRENT PROBLEMS IN CANCER, 2005, 29 (06) : 317 - 325
  • [6] A cautionary note on the use of the missing indicator method for handling missing data in prediction research
    van Smeden, Maarten
    Groenwold, Rolf H. H.
    Moons, Karel GM.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2020, 125 : 188 - 190
  • [7] An Investigation of the Effects of Missing Data Handling Using 'R'-Packages
    Sarkar, Sobhan
    Pramanik, Anima
    Khatedi, Nikhil
    Maiti, J.
    [J]. DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 275 - 284
  • [8] Cloud Failure Prediction with Hierarchical Temporal Memory: An Empirical Assessment
    Riganelli, Oliviero
    Saltarel, Paolo
    Tundo, Alessandro
    Mobilio, Marco
    Mariani, Leonardo
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 785 - 790
  • [9] The prevention and handling of the missing data
    Kang, Hyun
    [J]. KOREAN JOURNAL OF ANESTHESIOLOGY, 2013, 64 (05) : 402 - 406
  • [10] Conservative handling of missing data
    Berger, Vance W.
    [J]. CONTEMPORARY CLINICAL TRIALS, 2012, 33 (03) : 460 - 460