Eadro: An End-to-End Troubleshooting Framework for Microservices on Multi-source Data

被引:9
|
作者
Lee, Cheryl [1 ]
Yang, Tianyi [1 ]
Chen, Zhuangbin [1 ]
Su, Yuxin [2 ]
Lyu, Michael R. [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Sun Yat Sen Univ, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Microservices; Root Cause Localization; Anomaly Detection; Traces;
D O I
10.1109/ICSE48619.2023.00150
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The complexity and dynamism of microservices pose significant challenges to system reliability, and thereby, automated troubleshooting is crucial. Effective root cause localization after anomaly detection is crucial for ensuring the reliability of microservice systems. However, two significant issues rest in existing approaches: (1) Microservices generate traces, system logs, and key performance indicators (KPIs), but existing approaches usually consider traces only, failing to understand the system fully as traces cannot depict all anomalies; (2) Troubleshooting microservices generally contains two main phases, i.e., anomaly detection and root cause localization. Existing studies regard these two phases as independent, ignoring their close correlation. Even worse, inaccurate detection results can deeply affect localization effectiveness. To overcome these limitations, we propose Eadro, the first end-to-end framework to integrate anomaly detection and root cause localization based on multi-source data for troubleshooting large-scale microservices. The key insights of Eadro are the anomaly manifestations on different data sources and the close connection between detection and localization. Thus, Eadro models intra-service behaviors and inter-service dependencies from traces, logs, and KPIs, all the while leveraging the shared knowledge of the two phases via multi-task learning. Experiments on two widely-used benchmark microservices demonstrate that Eadro outperforms state-of-the-art approaches by a large margin. The results also show the usefulness of integrating multi-source data. We also release our code and data to facilitate future research.
引用
收藏
页码:1750 / 1762
页数:13
相关论文
共 50 条
  • [1] End-to-End Rotation Averaging with Multi-Source Propagation
    Yang, Luwei
    Li, Heng
    Rahim, Jamal Ahmed
    Cui, Zhaopeng
    Tan, Ping
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11769 - 11778
  • [2] End-to-End Outage Probability Analysis for Multi-Source Multi-Relay Systems
    He, Jiguang
    Hussain, Iqbal
    Juntti, Markku
    Matsumoto, Tad
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [3] An End-to-End Convolutional Recurrent Neural Network with Multi-Source Data Fusion for Sleep Stage Classification
    Toma, Tabassum Islam
    Choi, Sunwoong
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 564 - 569
  • [4] Benchmarks for End-to-End Microservices Testing
    Smith, Sheldon
    Robinson, Ethan
    Frederiksen, Timmy
    Stevens, Trae
    Cerny, Tomas
    Bures, Miroslav
    Taibi, Davide
    2023 IEEE INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED SYSTEM ENGINEERING, SOSE, 2023, : 60 - 66
  • [5] CoSSD - An end-to-end framework for multi-instance source separation and detection
    Baligar, Shrishail
    Newsam, Shawn
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 150 - 154
  • [6] An open-source framework for end-to-end analysis of electronic health record data
    Heumos, Lukas
    Ehmele, Philipp
    Treis, Tim
    Upmeier zu Belzen, Julius
    Roellin, Eljas
    May, Lilly
    Namsaraeva, Altana
    Horlava, Nastassya
    Shitov, Vladimir A.
    Zhang, Xinyue
    Zappia, Luke
    Knoll, Rainer
    Lang, Niklas J.
    Hetzel, Leon
    Virshup, Isaac
    Sikkema, Lisa
    Curion, Fabiola
    Eils, Roland
    Schiller, Herbert B.
    Hilgendorff, Anne
    Theis, Fabian J.
    NATURE MEDICINE, 2024, : 3369 - 3380
  • [7] A New End-to-End Multi-Dimensional CNN Framework for Land Cover/Land Use Change Detection in Multi-Source Remote Sensing Datasets
    Seydi, Seyd Teymoor
    Hasanlou, Mahdi
    Amani, Meisam
    REMOTE SENSING, 2020, 12 (12)
  • [8] An End-to-End Framework for Repairing Potentially Vulnerable Source Code
    Jasz, Judit
    Hegedus, Peter
    Milankovich, Akos
    Ferenc, Rudolf
    2022 IEEE 22ND INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM 2022), 2022, : 242 - 247
  • [9] An End-To-End Framework for Repairing Potentially Vulnerable Source Code
    Jasz, Judit
    Hegedus, Peter
    Milankovich, Akos
    Ferenc, Rudolf
    Proceedings - 2022 IEEE 22nd International Working Conference on Source Code Analysis and Manipulation, SCAM 2022, 2022, : 242 - 247
  • [10] Intelligent Parking Control Method Based on Multi-Source Sensory Information Fusion and End-to-End Deep Learning
    Ma, Zhenpeng
    Jiang, Haobin
    Ma, Shidian
    Li, Yue
    APPLIED SCIENCES-BASEL, 2023, 13 (08):