Natural Language Processing-based Model for Log Anomaly Detection

被引:3
|
作者
Li, Zezhou [1 ]
Zhang, Jing [1 ]
Zhang, Xianbo [1 ]
Lin, Feng [1 ]
Wang, Chao [1 ]
Cai, Xingye [1 ]
机构
[1] JD Tech, Beijing, Peoples R China
关键词
log anomaly detection; natural language processing; deep neural networks;
D O I
10.1109/SEAI55746.2022.9832400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Logs are widely used in IT industry and the anomaly detection of logs is essential to identify the running status of systems. Conventional methods solving this problem require sophisticated rule-based regulations and intensive labor input. In this paper, we propose a new model based on natural language processing techniques. In order to modify the feature extraction and to improve the vector quality of log templates, Part-of-Speech (PoS) and Named Entity Recognition (NER) are employed in our model, which leads to the less involvement of regulation-based rule and a modification of the template vector thanks to the weight vector by NER. The PoS property of each token in the template is firstly analyzed, which also reduces labor involvement and helps for better weight allocation. The weight investigation on tokens of the template is introduced to modify the template vector. And the final detection based on the modified vector of templates is realized by deep neural networks (DNNs). The effectiveness of our model is tested on three datasets, and compared with two state-ofthe-art models. The evaluation results prove that our model achieves better log anomaly detection.
引用
收藏
页码:129 / 134
页数:6
相关论文
共 50 条
  • [1] Anomaly Detection in Log Files Using Selected Natural Language Processing Methods
    Ryciak, Piotr
    Wasielewska, Katarzyna
    Janicki, Artur
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [2] A MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING-BASED SMISHING DETECTION MODEL FOR MOBILE MONEY TRANSACTIONS
    Zimba, Aaron
    Phiri, Katongo O.
    Kashale, Chimanga
    Phiri, Mwiza Norina
    [J]. INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2024, 16 (03): : 69 - 80
  • [3] Experience Report: Log Mining using Natural Language Processing and Application to Anomaly Detection
    Bertero, Christophe
    Roy, Matthieu
    Sauvanaud, Carla
    Tredan, Gilles
    [J]. 2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2017, : 351 - 360
  • [4] Anomaly Detection of Software System Logs based on Natural Language Processing
    Wang, Mengying
    Xu, Lele
    Guo, Lili
    [J]. 2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [5] Log Layering Based on Natural Language Processing
    Shen, Hanji
    Long, Chun
    Wan, Wei
    Li, Jun
    Qin, Yakui
    Fu, Yuhao
    Song, Xiaofan
    [J]. 2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION, 2019, : 660 - 663
  • [6] A Natural Language Processing-based Model to Automate MRI Brain Protocol Selection and Prioritization
    Brown, Andrew D.
    Marotta, Thomas R.
    [J]. ACADEMIC RADIOLOGY, 2017, 24 (02) : 160 - 166
  • [7] LAnoBERT: System log anomaly detection based on BERT masked language model
    Lee, Yukyung
    Kim, Jina
    Kang, Pilsung
    [J]. APPLIED SOFT COMPUTING, 2023, 146
  • [8] A Natural Language Processing-Based Approach for Clustering Construction Projects
    Le, Chau
    Ko, Taewoo
    Jeong, H. David
    [J]. CONSTRUCTION RESEARCH CONGRESS 2022: COMPUTER APPLICATIONS, AUTOMATION, AND DATA ANALYTICS, 2022, : 354 - 360
  • [9] Anomaly Detection of System Logs Based on Natural Language Processing and Deep Learning
    Wang, Mengying
    Xu, Lele
    Guo, Lili
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP 2018), 2018, : 140 - 144
  • [10] Automated Natural Language Processing-Based Supplier Discovery for Financial Services
    Papa, Mauro
    Chatzigiannakis, Ioannis
    Anagnostopoulos, Aris
    [J]. BIG DATA, 2024, 12 (01) : 30 - 48