Root-Cause Metric Location for Microservice Systems via Log Anomaly Detection

被引:36
|
作者
Wang, Lingzhi [1 ,2 ]
Zhao, Nengwen [2 ]
Chen, Junjie [1 ]
Li, Pinnong [2 ]
Zhang, Wenchi [3 ]
Sui, Kaixin [3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
[3] BizSeer, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
DIAGNOSIS;
D O I
10.1109/ICWS49710.2020.00026
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microservice systems are typically fragile and failures are inevitable in them due to their complexity and large scale. However, it is challenging to localize the root-cause metric due to its complicated dependencies and the huge number of various metrics. Existing methods are based on either correlation between metrics or correlation between metrics and failures. All of them ignore the key data source in microservice, i.e., logs. In this paper, we propose a novel root-cause metric localization approach by incorporating log anomaly detection. Our approach is based on a key observation, the value of root-cause metric should be changed along with the change of the log anomaly score of the system caused by the failure. Specifically, our approach includes two components, collecting anomaly scores by log anomaly detection algorithm and identifying root-cause metric by robust correlation analysis with data augmentation. Experiments on an open-source benchmark microservice system have demonstrated our approach can identify root-cause metrics more accurately than existing methods and only require a short localization time. Therefore, our approach can assist engineers to save much effort in diagnosing and mitigating failures as soon as possible.
引用
收藏
页码:142 / 150
页数:9
相关论文
共 50 条
  • [1] On Anomaly Detection and Root Cause Analysis of Microservice Systems
    Guan, Zijie
    Lin, Jinjin
    Chen, Pengfei
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2018, 2019, 11434 : 465 - 469
  • [2] TraceModel: An Automatic Anomaly Detection and Root Cause Localization Framework for Microservice Systems
    Cai, Yang
    Han, Biao
    Su, Jinshu
    Wang, Xiaoyan
    [J]. 2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 512 - 519
  • [3] Anomaly Detection and Root Cause Analysis on Log Data
    Pasha, Daem
    Shah, Ali Hussain
    Zadeh, Esmaeil Habib
    Konur, Savas
    [J]. ARTIFICIAL INTELLIGENCE XXXIX, AI 2022, 2022, 13652 : 333 - 339
  • [4] Practical Root Cause Localization for Microservice Systems via Trace Analysis
    Li, Zeyan
    Chen, Junjie
    Jiao, Rui
    Zhao, Nengwen
    Wang, Zhijun
    Zhang, Shuwei
    Wu, Yanjun
    Jiang, Long
    Yan, Leiqin
    Wang, Zikai
    Chen, Zhekang
    Zhang, Wenchi
    Nie, Xiaohui
    Sui, Kaixin
    Pei, Dan
    [J]. 2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [5] Unsupervised Root-Cause Analysis for Integrated Systems
    Pan, Renjian
    Zhang, Zhaobo
    Li, Xin
    Chakrabarty, Krishnendu
    Gu, Xinli
    [J]. 2020 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2020,
  • [6] Anomaly Detection in Microservice-Based Systems
    Nobre, Joao
    Pires, E. J. Solteiro
    Reis, Arsenio
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [7] Indirect porosity detection and root-cause identification in WAAM
    Alcaraz, Joselito Yam Ii
    Foque, Wout
    Sharma, Abhay
    Tjahjowidodo, Tegoeh
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (04) : 1607 - 1628
  • [8] Indirect porosity detection and root-cause identification in WAAM
    Joselito Yam II Alcaraz
    Wout Foqué
    Abhay Sharma
    Tegoeh Tjahjowidodo
    [J]. Journal of Intelligent Manufacturing, 2024, 35 : 1607 - 1628
  • [9] Progressing from Anomaly Detection to Automated Log Labeling and Pioneering Root Cause Analysis
    Wittkopp, Thorsten
    Acker, Alexander
    Kao, Odej
    [J]. 2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1231 - 1239
  • [10] A Methodology for Root-cause Analysis in Component Based Systems
    Wang, Kui
    Fung, Carol
    Ding, Chao
    Pei, Polo
    Huang, Shaohan
    Luan, Zhongzhi
    Qian, Depei
    [J]. 2015 IEEE 23RD INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2015, : 243 - 248