Mining Causality of Network Events in Log Data

被引:39
|
作者
Kobayashi, Satoru [1 ]
Otomo, Kazuki [1 ]
Fukuda, Kensuke [2 ]
Esaki, Hiroshi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138654, Japan
[2] Natl Inst Informat & Sokendai, Tokyo 1018430, Japan
关键词
Causal inference; log data; network management; PC algorithm; root cause analysis;
D O I
10.1109/TNSM.2017.2778096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network log messages (e.g., syslog) are expected to be valuable and useful information to detect unexpected or anomalous behavior in large scale networks. However, because of the huge amount of system log data collected in daily operation, it is not easy to extract pinpoint system failures or to identify their causes. In this paper, we propose a method for extracting the pinpoint failures and identifying their causes from network syslog data. The methodology proposed in this paper relies on causal inference that reconstructs causality of network events from a set of time series of events. Causal inference can filter out accidentally correlated events, thus it outputs more plausible causal events than traditional cross-correlation-based approaches can. We apply our method to 15 months' worth of network syslog data obtained from a nationwide academic network in Japan. The proposed method significantly reduces the number of pseudo correlated events compared with the traditional methods. Also, through three case studies and comparison with trouble ticket data, we demonstrate the effectiveness of the proposed method for practical network operation.
引用
收藏
页码:53 / 67
页数:15
相关论文
共 50 条
  • [21] Mining Query Subtopics from Search Log Data
    Hu, Yunhua
    Qian, Yanan
    Li, Hang
    Jiang, Daxin
    Pei, Jian
    Zheng, Qinghua
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 305 - 314
  • [22] Big Data: Mining of Log File through Hadoop
    Kotiyal, Bina
    Kumar, Ankit
    Pant, Bhaskar
    Goudar, R. H.
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [23] Preprocessing and mining web log data for web personalization
    Baglioni, M
    Ferrara, U
    Romei, A
    Ruggieri, S
    Turini, F
    AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 237 - 249
  • [24] Data Mining in the SIMBAD Database Web Log Files
    Wenger, Marc
    Oberto, Anais
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XIX, 2010, 434 : 453 - 456
  • [25] Study on data preprocessing algorithm in web log mining
    Yuan, F
    Wang, LJ
    Yu, G
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 28 - 32
  • [26] Mining web log data based on key path
    Song, AB
    Liang, ZP
    Zhao, MX
    Dong, YS
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 150 - 155
  • [27] Mining Web Log Data for Personalized Recommendation System
    Rosyidah, Asma
    Surjandari, Isti
    Zulkarnain
    2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2018, : 441 - 446
  • [28] Data Mining-based DNS Log Analysis
    Cui H.
    Yang J.
    Liu Y.
    Zheng Z.
    Wu K.
    Annals of Data Science, 2014, 1 (3-4) : 311 - 323
  • [29] Applications of Data Mining in CRM Based on Web Log
    Dang, Jianning
    Zhang, Aiqin
    Jing, Wei
    TRENDS IN CIVIL ENGINEERING, PTS 1-4, 2012, 446-449 : 3762 - 3765
  • [30] Mining CMS Log Data for Students' Feedback Analysis
    Verma, Ashok
    Rathore, Sumangla
    Vishwakarma, Santosh K.
    Goswami, Shubham
    THIRD INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, 797 : 417 - 425