Semi-supervised Log-based Anomaly Detection via Probabilistic Label Estimation

被引:73
|
作者
Yang, Lin [1 ]
Chen, Junjie [1 ]
Wang, Zan [1 ]
Wang, Weijing [1 ]
Jiang, Jiajun [1 ]
Dong, Xuyuan [2 ]
Zhang, Wenbin [2 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Tianjin Univ, Informat & Network Ctr, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
Log Analysis; Anomaly Detection; Deep Learning; Probabilistic Estimation; Label;
D O I
10.1109/ICSE43902.2021.00130
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the growth of software systems, logs have become an important data to aid system maintenance. Log-based anomaly detection is one of the most important methods for such purpose, which aims to automatically detect system anomalies via log analysis. However, existing log-based anomaly detection approaches still suffer from practical issues due to either depending on a large amount of manually labeled training data (supervised approaches) or unsatisfactory performance without learning the knowledge on historical anomalies (unsupervised and semi-supervised approaches). In this paper, we propose a novel practical log-based anomaly detection approach, PLELog, which is semi-supervised to get rid of time-consuming manual labeling and incorporates the knowledge on historical anomalies via probabilistic label estimation to bring supervised approaches' superiority into play. In addition, PLELog is able to stay immune to unstable log data via semantic embedding and detect anomalies efficiently and effectively by designing an attention-based (MU neural network. We evaluated PLELog on two most widely-used public datasets, and the results demonstrate the effectiveness of PLELog, significantly outperforming the compared approaches with an average of 181.6% improvement in terms of F1-score. In particular, PLELog has been applied to two real-world systems from our university and a large corporation, further demonstrating its practicability.
引用
下载
收藏
页码:1448 / 1460
页数:13
相关论文
共 50 条
  • [1] PLELog: Semi-supervised Log-based Anomaly Detection via Probabilistic Label Estimation
    Yang, Lin
    Chen, Junjie
    Wang, Zan
    Wang, Weijing
    Jiang, Jiajun
    Dong, Xuyuan
    Zhang, Wenbin
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, : 230 - 231
  • [2] LogOnline: A Semi-supervised Log-based Anomaly Detector Aided with Online Learning Mechanism
    Wang, Xuheng
    Song, Jiaxing
    Zhang, Xu
    Tang, Junshu
    Gao, Weihe
    Lin, Qingwei
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 141 - 152
  • [3] Semi-supervised log anomaly detection based on bidirectional temporal convolution network
    Yin, Zhichao
    Kong, Xian
    Yin, Chunyong
    COMPUTERS & SECURITY, 2024, 140
  • [4] Inductive Multi-View Semi-Supervised Anomaly Detection via Probabilistic Modeling
    Wang, Zhen
    Fan, Maohong
    Muknahallipatna, Suresh
    Lan, Chao
    2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 257 - 264
  • [5] Semi-Supervised Anomaly Detection Via Neural Process
    Zhou, Fan
    Wang, Guanyu
    Zhang, Kunpeng
    Liu, Siyuan
    Zhong, Ting
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10423 - 10435
  • [6] Incremental Clustering for Semi-Supervised Anomaly Detection applied on Log Data
    Wurzenberger, Markus
    Skopik, Florian
    Landauer, Max
    Greitbauer, Philipp
    Fiedler, Roman
    Kastner, Wolfgang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY (ARES 2017), 2017,
  • [7] SSDLog: a semi-supervised dual branch model for log anomaly detection
    Lu, Siyang
    Han, Ningning
    Wang, Mingquan
    Wei, Xiang
    Lin, Zaichao
    Wang, Dongdong
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (05): : 3137 - 3153
  • [8] SSDLog: a semi-supervised dual branch model for log anomaly detection
    Siyang Lu
    Ningning Han
    Mingquan Wang
    Xiang Wei
    Zaichao Lin
    Dongdong Wang
    World Wide Web, 2023, 26 : 3137 - 3153
  • [9] Log-based Anomaly Detection Without Log Parsing
    Van-Hoang Le
    Zhang, Hongyu
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 492 - 504
  • [10] Leveraging Log Instructions in Log-based Anomaly Detection
    Bogatinovski, Jasmin
    Madjarov, Gjorgji
    Nedelkoski, Sasho
    Cardoso, Jorge
    Kao, Odej
    2022 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (IEEE SCC 2022), 2022, : 321 - 326