Fault Detection for Cloud Computing Systems with Correlation Analysis

被引:0
|
作者
Wang, Tao [1 ]
Zhang, Wenbo [1 ]
Wei, Jun [1 ]
Zhong, Hua [1 ]
机构
[1] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
关键词
Software Monitoring; Performance Anomaly; Fault Detection; Cloud Computing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The large-scale dynamic cloud computing environment has raised great challenges for fault diagnosis in Web applications. First, fluctuating workloads cause traditional application models to change over time. Moreover, modeling the behaviors of complex applications always requires domain knowledge which is difficult to obtain. Finally, managing large-scale applications manually is impractical for operators. This paper addresses these issues and proposes an automatic fault diagnosis method for Web applications in cloud computing. We propose an online incremental clustering method to recognize access behavior patterns, and uses CCA to model the correlation between workloads and the metrics of application performance/resource utilization in a specific access behavior pattern. Our method detects anomalies by discovering the abrupt change of correlation coefficients with a EWMA control chart, and then locates suspicious metrics using a feature selection method combining ReliefF and SVM-RFE. We validate our method by injecting typical faults in TPC-W an industry-standard benchmark, and the experimental results demonstrate that it can effectively detect typical faults.
引用
收藏
页码:652 / 658
页数:7
相关论文
共 50 条
  • [1] Method of Fault Detection in Cloud Computing Systems
    Jiang, Ying
    Huang, Jie
    Ding, Jiaman
    Liu, Yingli
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2014, 7 (03): : 205 - 212
  • [2] Optimal Online Liveness Fault Detection for Multilayer Cloud Computing Systems
    Lee, Yen-Lin
    Liang, Deron
    Wang, Wei-Jen
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (05) : 3464 - 3477
  • [3] Performance Analysis of Intrusion Detection Systems in the Cloud Computing
    Abdelaziz, Ettaoufik
    Mohamed, Ouzzif
    PROCEEDINGS OF 2017 3RD INTERNATIONAL CONFERENCE OF CLOUD COMPUTING TECHNOLOGIES AND APPLICATIONS (CLOUDTECH), 2017, : 136 - 143
  • [4] Acceptance Test for Fault Detection in Component-based Cloud Computing and Systems
    Smara, Mounya
    Aliouat, Makhlouf
    Pathan, Al-Sakib Khan
    Aliouat, Zibouda
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 70 : 74 - 93
  • [5] Intrusion Detection Systems in Cloud Computing Paradigm: Analysis and Overview
    Rana, Pooja
    Batra, Isha
    Malik, Arun
    Imoize, Agbotiname Lucky
    Kim, Yongsung
    Pani, Subhendu Kumar
    Goyal, Nitin
    Kumar, Arun
    Rho, Seungmin
    COMPLEXITY, 2022, 2022
  • [6] Intrusion Detection Systems in Cloud Computing Paradigm: Analysis and Overview
    Rana, Pooja
    Batra, Isha
    Malik, Arun
    Imoize, Agbotiname Lucky
    Kim, Yongsung
    Pani, Subhendu Kumar
    Goyal, Nitin
    Kumar, Arun
    Rho, Seungmin
    COMPLEXITY, 2022, 2022
  • [7] Outlier Detection based Fault-Detection Algorithm for Cloud Computing
    Kumar, Manoj
    Mathur, Robin
    2014 INTERNATIONAL CONFERENCE FOR CONVERGENCE OF TECHNOLOGY (I2CT), 2014,
  • [8] Model Based Byzantine Fault Detection Technique for Cloud Computing
    Fan, Guisheng
    Yu, Huiqun
    Chen, Liqiong
    Liu, Dongmei
    2012 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC), 2012, : 249 - 256
  • [9] Adaptive and Dynamic Adjustment of Fault Detection Cycles in Cloud Computing
    Zhang, Peiyun
    Shu, Sheng
    Zhou, MengChu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (01) : 20 - 30
  • [10] Towards Fault Propagation Analysis in Cloud Computing Ecosystems
    De Simone, Luigi
    2014 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2014, : 156 - 161