Cloud failure prediction based on traditional machine learning and deep learning

被引:9
|
作者
Asmawi, Tengku Nazmi Tengku [1 ]
Ismail, Azlan [1 ,2 ]
Shen, Jun [3 ]
机构
[1] Univ Teknol MARA UiTM, Fac Comp & Math Sci FSKM, Shah Alam 40450, Selangor, Malaysia
[2] Univ Teknol MARA UiTM, Kompleks Al Khawarizmi, Inst Big Data Analyt & Artificial Intelligence IB, Shah Alam 40450, Selangor, Malaysia
[3] Univ Wollongong, Fac Engn & Informat Sci, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia
关键词
Cloud computing; Job and task failure; Failure prediction; Deep learning; Machine learning; ARCHITECTURE;
D O I
10.1186/s13677-022-00327-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud failure is one of the critical issues since it can cost millions of dollars to cloud service providers, in addition to the loss of productivity suffered by industrial users. Fault tolerance management is the key approach to address this issue, and failure prediction is one of the techniques to prevent the occurrence of a failure. One of the main challenges in performing failure prediction is to produce a highly accurate predictive model. Although some work on failure prediction models has been proposed, there is still a lack of a comprehensive evaluation of models based on different types of machine learning algorithms. Therefore, in this paper, we propose a comprehensive comparison and model evaluation for predictive models for job and task failure. These models are built and trained using five traditional machine learning algorithms and three variants of deep learning algorithms. We use a benchmark dataset, called Google Cloud Traces, for training and testing the models. We evaluated the performance of models using multiple metrics and determined their important features, as well as measured their scalability. Our analysis resulted in the following findings. Firstly, in the case of job failure prediction, we found that Extreme Gradient Boosting produces the best model where the disk space request and CPU request are the most important features that influence the prediction. Second, for task failure prediction, we found that Decision Tree and Random Forest produce the best models where the priority of the task is the most important feature for both models. Our scalability analysis has determined that the Logistic Regression model is the most scalable as compared to others.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] An efficient plant disease prediction model based on machine learning and deep learning classifiers
    Shinde, Nirmala
    Ambhaikar, Asha
    EVOLUTIONARY INTELLIGENCE, 2025, 18 (01)
  • [42] Modelling on Car-Sharing Serial Prediction Based on Machine Learning and Deep Learning
    Brahimi, Nihad
    Zhang, Huaping
    Dai, Lin
    Zhang, Jianzi
    COMPLEXITY, 2022, 2022
  • [43] CLOUD-BASED MACHINE LEARNING FOR BUS ARRIVAL TIME PREDICTION
    Olczyk, Adrian
    Galuszka, Adam
    CARPATHIAN LOGISTICS CONGRESS (CLC' 2016), 2017, : 173 - 177
  • [44] Cloud-Based Parallel Machine Learning for Tool Wear Prediction
    Wu, Dazhong
    Jennings, Connor
    Terpenny, Janis
    Kumara, Soundar
    Gao, Robert X.
    JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2018, 140 (04):
  • [45] Machine Learning-Based Precipitation Prediction Using Cloud Properties
    Yakubu, Abdulaziz Tunde
    Abayomi, Abdultaofeek
    Chetty, Naven
    HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 243 - 252
  • [46] Research trends in deep learning and machine learning for cloud computing security
    Alzoubi, Yehia Ibrahim
    Mishra, Alok
    Topcu, Ahmet Ercan
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (05)
  • [47] Machine Learning Does Not Outperform Traditional Statistical Modelling for Kidney Allograft Failure Prediction
    Truchot, A.
    Raynaud, M.
    Kamar, N.
    TRANSPLANTATION, 2023, 107 (07) : 1412 - 1412
  • [48] Machine learning does not outperform traditional statistical modelling for kidney allograft failure prediction
    Truchot, Agathe
    Raynaud, Marc
    Kamar, Nassim
    Naesens, Maarten
    Legendre, Christophe
    Delahousse, Michel
    Thaunat, Olivier
    Buchler, Matthias
    Crespo, Marta
    Linhares, Kamilla
    Orandi, Babak J.
    Akalin, Enver
    Pujol, Gervacio Soler
    Silva, Helio Tedesco
    Gupta, Gaurav
    Segev, Dorry L.
    Jouven, Xavier
    Bentall, Andrew J.
    Stegall, Mark D.
    Lefaucheur, Carmen
    Aubert, Olivier
    Loupy, Alexandre
    KIDNEY INTERNATIONAL, 2023, 103 (05) : 936 - 948
  • [49] Review of machine learning and deep learning models for toxicity prediction
    Guo, Wenjing
    Liu, Jie
    Dong, Fan
    Song, Meng
    Li, Zoe
    Khan, Md Kamrul Hasan
    Patterson, Tucker A.
    Hong, Huixiao
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (21) : 1952 - 1973
  • [50] Dropout prediction in Moocs using deep learning and machine learning
    Basnet, Ram B.
    Johnson, Clayton
    Doleck, Tenzin
    EDUCATION AND INFORMATION TECHNOLOGIES, 2022, 27 (08) : 11499 - 11513