Cloud failure prediction based on traditional machine learning and deep learning

被引:9
|
作者
Asmawi, Tengku Nazmi Tengku [1 ]
Ismail, Azlan [1 ,2 ]
Shen, Jun [3 ]
机构
[1] Univ Teknol MARA UiTM, Fac Comp & Math Sci FSKM, Shah Alam 40450, Selangor, Malaysia
[2] Univ Teknol MARA UiTM, Kompleks Al Khawarizmi, Inst Big Data Analyt & Artificial Intelligence IB, Shah Alam 40450, Selangor, Malaysia
[3] Univ Wollongong, Fac Engn & Informat Sci, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia
关键词
Cloud computing; Job and task failure; Failure prediction; Deep learning; Machine learning; ARCHITECTURE;
D O I
10.1186/s13677-022-00327-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud failure is one of the critical issues since it can cost millions of dollars to cloud service providers, in addition to the loss of productivity suffered by industrial users. Fault tolerance management is the key approach to address this issue, and failure prediction is one of the techniques to prevent the occurrence of a failure. One of the main challenges in performing failure prediction is to produce a highly accurate predictive model. Although some work on failure prediction models has been proposed, there is still a lack of a comprehensive evaluation of models based on different types of machine learning algorithms. Therefore, in this paper, we propose a comprehensive comparison and model evaluation for predictive models for job and task failure. These models are built and trained using five traditional machine learning algorithms and three variants of deep learning algorithms. We use a benchmark dataset, called Google Cloud Traces, for training and testing the models. We evaluated the performance of models using multiple metrics and determined their important features, as well as measured their scalability. Our analysis resulted in the following findings. Firstly, in the case of job failure prediction, we found that Extreme Gradient Boosting produces the best model where the disk space request and CPU request are the most important features that influence the prediction. Second, for task failure prediction, we found that Decision Tree and Random Forest produce the best models where the priority of the task is the most important feature for both models. Our scalability analysis has determined that the Logistic Regression model is the most scalable as compared to others.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Prediction of Potential Targets of Traditional Chinese Medicine Based on Machine Learning
    Cong, Chunyu
    Zhang, Xu
    Li, Lijing
    INTERNATIONAL SYMPOSIUM ON THE FRONTIERS OF BIOTECHNOLOGY AND BIOENGINEERING (FBB 2019), 2019, 2110
  • [32] NOx Prediction Method Based on Deep Extreme Learning Machine
    Li, Ying
    Li, Fanjun
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2018, : 97 - 101
  • [33] Life Prediction Model of Machine Tool based on Deep Learning
    HE Jiawei
    ZHAO Chendi
    GAO Ruiyu
    LIU Xuehui
    WANG Xue
    International Journal of Plant Engineering and Management, 2021, 26 (01) : 1 - 15
  • [34] The application of traditional machine learning and deep learning techniques in mammography: a review
    Gao, Ying'e
    Lin, Jingjing
    Zhou, Yuzhuo
    Lin, Rongjin
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [35] Deep Learning Approach Versus Traditional Machine Learning for ADHD Classification
    Cicek, Gulay
    Akan, Aydin
    TIP TEKNOLOJILERI KONGRESI (TIPTEKNO'21), 2021,
  • [36] Traditional Machine Learning and Deep Learning Modeling for Legume Species Recognition
    Rimi I.F.
    Habib M.T.
    Supriya S.
    Khan M.A.A.
    Hossain S.A.
    SN Computer Science, 3 (6)
  • [37] Fruit Classification Using Traditional Machine Learning and Deep Learning Approach
    Saranya, N.
    Srinivasan, K.
    Kumar, S. K. Pravin
    Rukkumani, V
    Ramya, R.
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 79 - 89
  • [38] A Comprehensive Review on Crop Disease Prediction Based on Machine Learning and Deep Learning Techniques
    Patil, Manoj A.
    Manohar, M.
    THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 481 - 503
  • [39] Software Defect Prediction Based on Machine Learning and Deep Learning Techniques: An Empirical Approach
    Albattah, Waleed
    Alzahrani, Musaad
    AI, 2024, 5 (04) : 1743 - 1758
  • [40] Machine Learning and Deep Learning Based Traffic Classification and Prediction in Software Defined Networking
    Mohammed, Ayse Rumeysa
    Mohammed, Shady A.
    Shirmohammadi, Shervin
    2019 IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENTS & NETWORKING (M&N 2019), 2019,