Incremental Analysis of Large-Scale System Logs for Anomaly Detection

被引:0
|
作者
Astekin, Merve [1 ]
Ozcan, Selim [1 ]
Sozer, Hasan [2 ]
机构
[1] TUBITAK BILGEM, Inst Informat Technol, Kocaeli, Turkey
[2] Ozyegin Univ, Dept Comp Sci, Istanbul, Turkey
关键词
log analysis; distributed systems; parallel processing; anomaly detection; big data; machine learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomalies during system execution can be detected by automated analysis of logs generated by the system. However, large scale systems can generate tens of millions of lines of logs within days. Centralized implementations of traditional machine learning algorithms are not scalable for such data. Therefore, we recently introduced a distributed log analysis framework for anomaly detection. In this paper, we introduce an extension of this framework, which can detect anomalies earlier via incremental analysis instead of the existing offline analysis approach. In the extended version, we periodically process the log data that is accumulated so far. We conducted controlled experiments based on a benchmark dataset to evaluate the effectiveness of this approach. We repeated our experiments with various periods that determine the frequency of analysis as well as the size of the data processed each time. Results showed that our online analysis can improve anomaly detection time significantly while keeping the accuracy level same as that is obtained with the offline approach. The only exceptional case, where the accuracy is compromised, rarely occurs when the analysis is triggered before all the log data associated with a particular session of events are collected.
引用
收藏
页码:2119 / 2127
页数:9
相关论文
共 50 条
  • [31] Anomaly Detection in Large-Scale Networks With Latent Space Models
    Lee, Wesley
    McCormick, Tyler H.
    Neil, Joshua
    Sodja, Cole
    Cui, Yanran
    TECHNOMETRICS, 2022, 64 (02) : 241 - 252
  • [32] Connecting the dots: anomaly and discontinuity detection in large-scale systems
    Malik, Haroon
    Davis, Ian J.
    Godfrey, Michael W.
    Neuse, Douglas
    Manskovskii, Serge
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2016, 7 (04) : 509 - 522
  • [33] DongTing: A large-scale dataset for anomaly detection of the Linux kernel
    Duan, Guoyun
    Fu, Yuanzhi
    Cai, Minjie
    Chen, Hao
    Sun, Jianhua
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 203
  • [34] Connecting the dots: anomaly and discontinuity detection in large-scale systems
    Haroon Malik
    Ian J. Davis
    Michael W. Godfrey
    Douglas Neuse
    Serge Manskovskii
    Journal of Ambient Intelligence and Humanized Computing, 2016, 7 : 509 - 522
  • [35] Proactive Failure Detection Learning Generation Patterns of Large-Scale Network Logs
    Kimura, Tatsuaki
    Watanabe, Akio
    Toyono, Tsuyoshi
    Ishibashi, Keisuke
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2019, E102B (02) : 306 - 316
  • [36] A Survey of Deep Anomaly Detection for System Logs
    Zhao, Xiaoqing
    Jiang, Zhongyuan
    Ma, Jianfeng
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [37] Privacy Analysis of User Association Logs in a Large-scale Wireless LAN
    Tan, Keren
    Yan, Guanhua
    Yeo, Jihwang
    Kotz, David
    2011 PROCEEDINGS IEEE INFOCOM, 2011, : 31 - 35
  • [38] A Large-Scale Study on Map Search Logs
    Xiao, Xiangye
    Luo, Qiong
    Li, Zhisheng
    Xie, Xing
    Ma, Wei-Ying
    ACM TRANSACTIONS ON THE WEB, 2010, 4 (03) : 1 - 33
  • [39] System anomaly detection: Mining firewall logs
    Winding, Robert
    Wright, Timothy
    Chapple, Michael
    2006 SECURECOMM AND WORKSHOPS, 2006, : 389 - +
  • [40] Proactive Failure Detection Learning Generation Patterns of Large-scale Network Logs
    Kimura, Tatsuaki
    Watanabe, Akio
    Toyono, Tsuyoshi
    Ishibashi, Keisuke
    2015 11TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2015, : 8 - 14