Fault Tolerance of Cloud Infrastructure with Machine Learning

被引:2
|
作者
Kalaskar, Chetankumar [1 ]
Thangam, S. [1 ]
机构
[1] Amrita Vishwavidyapeetam, Amrita Sch Comp, Dept Comp Sci & Engn, Bangalore 560035, Karnataka, India
关键词
Cloud computing; Fault tolerance; Machine learning; Reliability of cloud;
D O I
10.2478/cait-2023-0034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enhancing the fault tolerance of cloud systems and accurately forecasting cloud performance are pivotal concerns in cloud computing research. This research addresses critical concerns in cloud computing by enhancing fault tolerance and forecasting cloud performance using machine learning models. Leveraging the Google trace dataset with 10000 cloud environment records encompassing diverse metrics, we systematically have employed machine learning algorithms, including linear regression, decision trees, and gradient boosting, to construct predictive models. These models have outperformed baseline methods, with C5.0 and XGBoost showing exceptional accuracy, precision, and reliability in forecasting cloud behavior. Feature importance analysis has identified the ten most influential factors affecting cloud system performance. This work significantly advances cloud optimization and reliability, enabling proactive monitoring, early performance issue detection, and improved fault tolerance. Future research can further refine these predictive models, enhancing cloud resource management and ultimately improving service delivery in cloud computing.
引用
收藏
页码:26 / 50
页数:25
相关论文
共 50 条
  • [41] Fault-tolerance in a Boltzmann machine
    Price, CC
    Hanks, JB
    Stephens, JN
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1326 - 1331
  • [42] Multi Level Fault Tolerance in Cloud Environment
    Devi, K.
    Paulraj, D.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 824 - 828
  • [43] Fault-Tolerance in the Scope of Cloud Computing
    Rehman, A. U.
    Aguiar, Rui L.
    Barraca, Joao Paulo
    IEEE ACCESS, 2022, 10 : 63422 - 63441
  • [44] FAULT TOLERANCE CAPABILITY OF CLOUD DATA CENTER
    Emesowum, Humphrey
    Paraskelidis, Athanasios
    Adda, Mo
    2017 13TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2017, : 495 - 502
  • [45] An infrastructure for adaptive fault tolerance on FT-CORBA
    Lung, Lau Cheuk
    Favarim, Fabio
    Santos, Giuhana Teixeira
    Correia, Miguel
    NINTH IEEE INTERNATIONAL SYMPOSIUM ON OBJECT AND COMPONENT-ORIENTED REAL-TIME DISTRIBUTED COMPUTING, PROCEEDINGS, 2006, : 504 - 511
  • [46] Fault Tolerance Techniques for Scientific Applications In Cloud
    Talwani, Suruchi
    Chana, Inderveer
    2017 2ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND NETWORKS (TEL-NET), 2017, : 465 - 469
  • [47] Replication Based Fault Tolerance Approach for Cloud
    Agarwal, Kamal K.
    Kotakula, Haribabu
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2022, 2022, 13145 : 163 - 169
  • [48] A Study on Fault Tolerance methods in Cloud Computing
    Ganesh, Amal
    Sandhya, M.
    Shankar, Sharmila
    SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 844 - 849
  • [49] An adaptive fault tolerance strategy for cloud storage
    Yan Xiai
    Zhang Dafang
    Yang Jinmin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (11): : 5290 - 5304
  • [50] A survey of fault tolerance architecture in cloud computing
    Cheraghlou, Mehdi Nazari
    Khadem-Zadeh, Ahmad
    Haghparast, Majid
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 61 : 81 - 92