Parallelization Framework of Root Cause Analysis Based on Global Cache and Global Lock

被引:0
|
作者
Lu, Ming [1 ]
Wang, Youyan [2 ]
Zhang, Zhonghongyu [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] Lenovo, BT IT, Infrastruct & Cloud Serv, Beijing, Peoples R China
关键词
cloud computing; root cause analysis; orchestration; health check; enterprise application;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The end-to-end resolution time cost of system failure can be split into the time of root cause analysis (RCA) and the problem resolution. In complex enterprise applications and large-scale cluster fault diagnosis activities, the RCA has a long duration due to the scale and complexity of the business, directly causing miss SLAs. To enhance the efficiency of problem discovery, an asynchronous parallel framework is presented in this paper, which establishes concurrent detection or orchestration jobs based on the dependency graph of various types of resources and parallel analysis jobs based on the parallel access capability of a specific resource bearer. Since there is one or two root causes of most anomalies, the framework's message notification mechanism was also designed as a pattern that gives reports immediately after discovery and as a full-path coverage pattern to achieve early reporting of root cause analysis and early intervention by human. Thus, the root cause analysis duration is reduced and the end-to-end solution time of system failure is shortened.
引用
收藏
页码:174 / 178
页数:5
相关论文
共 50 条
  • [1] Global Imbalances as Root Cause of Global Financial Crisis? A Critical Analysis
    Liang, Yan
    [J]. JOURNAL OF ECONOMIC ISSUES, 2012, 46 (01) : 101 - 117
  • [2] PARALLEL VISUALIZATION WITH GLOBAL DATA BASED DISTRIBUTED CACHE FRAMEWORK
    Di, Zhao
    [J]. 8TH INTERNATIONAL SYMPOSIUM ON SPATIAL DATA QUALITY, 2013, 40-2 (w1): : 139 - 142
  • [3] Is China the Root Cause of Global Deflation?
    Hu Angang
    [J]. China & World Economy, 2003, (03) : 3 - 7
  • [4] UTPlaceF 3.0: A Parallelization Framework for Modern FPGA Global Placement
    Li, Wuxi
    Li, Meng
    Wang, Jiajun
    Pan, David Z.
    [J]. 2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 922 - 928
  • [5] Global collaboration in teaching root cause analysis with healthcare professional students
    Hampe, Holly
    Frndak, Diane
    Kydonaki, Claire
    [J]. HIGHER EDUCATION SKILLS AND WORK-BASED LEARNING, 2023, 13 (04) : 772 - 785
  • [6] Global Sparse Analysis Framework
    Oh, Hakjoo
    Heo, Kihong
    Lee, Wonchan
    Lee, Woosuk
    Park, Daejun
    Kang, Jeehoon
    Yi, Kwangkeun
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2014, 36 (03):
  • [7] Root Cause Analysis for Global Anomalous Events in Self-Organizing Industrial Systems
    Kiermeier, Marie
    Feld, Sebastian
    Linnhoff-Popien, Claudia
    [J]. 2017 IEEE 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES), 2017, : 163 - 168
  • [8] Distributed parallelization of a global atmospheric data objective analysis system
    Zhao, J
    Song, JQ
    Li, ZJ
    [J]. ADVANCES IN ATMOSPHERIC SCIENCES, 2003, 20 (01) : 159 - 163
  • [9] Distributed Parallelization of a Global Atmospheric Data Objective Analysis System
    Jun Zhao
    Junqiarig Song
    Zhenjun Li
    [J]. Advances in Atmospheric Sciences, 2003, 20 : 159 - 163
  • [10] Determining Root Cause of Construction Waste Generation: A Global Context
    Kaliannan, Suaathi
    Nagapan, Sasitharan
    Abdullah, Abd Halid
    Sohu, Samiullah
    Jhatial, Ashfaque Ahmed
    [J]. CIVIL ENGINEERING JOURNAL-TEHRAN, 2018, 4 (11): : 2539 - 2547