Mining Causality of Network Events in Log Data

被引:39
|
作者
Kobayashi, Satoru [1 ]
Otomo, Kazuki [1 ]
Fukuda, Kensuke [2 ]
Esaki, Hiroshi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138654, Japan
[2] Natl Inst Informat & Sokendai, Tokyo 1018430, Japan
关键词
Causal inference; log data; network management; PC algorithm; root cause analysis;
D O I
10.1109/TNSM.2017.2778096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network log messages (e.g., syslog) are expected to be valuable and useful information to detect unexpected or anomalous behavior in large scale networks. However, because of the huge amount of system log data collected in daily operation, it is not easy to extract pinpoint system failures or to identify their causes. In this paper, we propose a method for extracting the pinpoint failures and identifying their causes from network syslog data. The methodology proposed in this paper relies on causal inference that reconstructs causality of network events from a set of time series of events. Causal inference can filter out accidentally correlated events, thus it outputs more plausible causal events than traditional cross-correlation-based approaches can. We apply our method to 15 months' worth of network syslog data obtained from a nationwide academic network in Japan. The proposed method significantly reduces the number of pseudo correlated events compared with the traditional methods. Also, through three case studies and comparison with trouble ticket data, we demonstrate the effectiveness of the proposed method for practical network operation.
引用
收藏
页码:53 / 67
页数:15
相关论文
共 50 条
  • [41] Data mining network traffic
    Lee, Ian W. C.
    Fapojuwo, Abraham O.
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 170 - +
  • [42] Log summarizing agent for web access data using data mining techniques
    Kato, H
    Hiraishi, H
    Mizoguchi, F
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 2642 - 2647
  • [43] A Quantitative Causal Analysis for Network Log Data
    Jarry, Richard
    Kobayashi, Satoru
    Fukuda, Kensuke
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1437 - 1442
  • [44] Mining Up-to-date Knowledge from Log Data
    Hong, Tzung-Pei
    Wu, Yi-Ying
    Wang, Shyue-Liang
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 286 - +
  • [45] Research on data mining to system log audit information in IDS
    Jiang, Yichuan
    Tian, Shengfeng
    Jisuanji Gongcheng/Computer Engineering, 2002, 28 (01):
  • [46] A study on the mining access patterns from web log data
    Ahn, Jeong Yong
    IEICE Transactions on Information and Systems, 2002, E85-D (04) : 782 - 785
  • [47] Theme-based Query Expansion by Mining Log Data
    Chang, Peng
    Ma, Hui
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11562 - +
  • [48] A Kind of Improved Data Clustering Algorithm in Web Log Mining
    Guo, Jin
    Zhang, Shengbing
    Qiu, Zheng
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS RESEARCH AND MECHATRONICS ENGINEERING, 2015, 121 : 2115 - 2119
  • [49] Improved Log Data-Merging Method for Process Mining
    Xu Y.
    Lin Q.
    Li D.
    Li, Dong (cslidong@scut.edu.cn), 1600, South China University of Technology (45): : 112 - 117
  • [50] Efficient mining indirect associations from web log data
    Yin, Ying
    Zhao, Yuhai
    Zhang, Bin
    Ning, Bo
    Journal of Computational Information Systems, 2007, 3 (03): : 1285 - 1292