Parallelization Framework of Root Cause Analysis Based on Global Cache and Global Lock

被引:0
|
作者
Lu, Ming [1 ]
Wang, Youyan [2 ]
Zhang, Zhonghongyu [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] Lenovo, BT IT, Infrastruct & Cloud Serv, Beijing, Peoples R China
关键词
cloud computing; root cause analysis; orchestration; health check; enterprise application;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The end-to-end resolution time cost of system failure can be split into the time of root cause analysis (RCA) and the problem resolution. In complex enterprise applications and large-scale cluster fault diagnosis activities, the RCA has a long duration due to the scale and complexity of the business, directly causing miss SLAs. To enhance the efficiency of problem discovery, an asynchronous parallel framework is presented in this paper, which establishes concurrent detection or orchestration jobs based on the dependency graph of various types of resources and parallel analysis jobs based on the parallel access capability of a specific resource bearer. Since there is one or two root causes of most anomalies, the framework's message notification mechanism was also designed as a pattern that gives reports immediately after discovery and as a full-path coverage pattern to achieve early reporting of root cause analysis and early intervention by human. Thus, the root cause analysis duration is reduced and the end-to-end solution time of system failure is shortened.
引用
收藏
页码:174 / 178
页数:5
相关论文
共 50 条
  • [31] Implementation and analysis of an image-based global illumination framework for animated environments
    Nimeroff, J
    Dorsey, J
    Rushmeier, H
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 1996, 2 (04) : 283 - 298
  • [32] A Novel Framework for Analysis of Global Network Neutrality Based on Packet Loss Rate
    Li, Di
    Tian, Feng
    Zhu, Ming
    Wang, Lei
    Sun, Liang
    [J]. 2015 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2015, : 297 - 304
  • [33] The development of a safety management system (SMS) framework based on root cause analysis of disabling accidents
    Chi, Chia -Fen
    Lin, Yi-Cheng
    [J]. INTERNATIONAL JOURNAL OF INDUSTRIAL ERGONOMICS, 2022, 92
  • [34] Global research status and hotspot analysis of meniscal root tears based on the WOS database
    Wang, Yifan
    Huang, Chen
    Qi, Yansong
    Bao, Huricha
    Xu, Yongsheng
    [J]. FRONTIERS IN SURGERY, 2022, 9
  • [35] Root cause analysis based maintenance policy
    Tan, Cher
    Raghavan, Nagarajan
    [J]. INTERNATIONAL JOURNAL OF QUALITY & RELIABILITY MANAGEMENT, 2007, 24 (02) : 203 - +
  • [36] Optimization-based Root Cause Analysis
    Dassau, Eyal
    Lewin, Daniel
    [J]. 16TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING AND 9TH INTERNATIONAL SYMPOSIUM ON PROCESS SYSTEMS ENGINEERING, 2006, 21 : 943 - 948
  • [37] Scanless fast handoff technique based on global Path-Cache for WLANs
    Wanalertlak, Weetit
    Lee, Ben
    Yu, Chansu
    Kim, Myungchul
    Park, Seung-Min
    Kim, Won-Tae
    [J]. JOURNAL OF SUPERCOMPUTING, 2013, 66 (03): : 1320 - 1349
  • [38] Scanless fast handoff technique based on global Path-Cache for WLANs
    Weetit Wanalertlak
    Ben Lee
    Chansu Yu
    Myungchul Kim
    Seung-Min Park
    Won-Tae Kim
    [J]. The Journal of Supercomputing, 2013, 66 : 1320 - 1349
  • [39] A general framework for convexity analysis in deterministic global optimization
    Kearfott, Ralph Baker
    Castille, Jessie
    Tyagi, Gaurav
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2013, 56 (03) : 765 - 785
  • [40] Root cause analysis: A framework using principles of high reliability.
    Denny, Diane
    Shinners, Caitlyn
    Hepler, Marjorie
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2019, 37 (27)