Online Event Correlations Analysis in System Logs of Large-Scale Cluster Systems

被引:0
|
作者
Zhou, Wei [1 ]
Zhan, Jianfeng [1 ]
Meng, Dan [1 ]
Zhang, Zhihong [2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100864, Peoples R China
[2] Res Inst China Mobile, Beijing, Peoples R China
来源
关键词
System logs; online log analysis; event correlations; online event prediction; FAILURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It has been long recognized that failure events are correlated, not independent. Previous research efforts have shown the correlation analysis of system logs is helpful to resource allocation, job scheduling and proactive management. However, previous log analysis methods analyze the history logs offline. They fail to capture the dynamic change of system errors and failures. In this paper, we purpose an online log analysis approach to mine event correlations in system logs of large-scale cluster systems. Our contributions are three-fold: first, we analyze the event correlations of system logs of a 260-nodes production Hadoop cluster system, and the result shows that the correlation rules of logs change dramatically in different periods; Second, we present a online log analysis algorithm Apriori-SO; third, based on the online event correlations mining, we present an online event prediction method that can predict diversities of failure events with the great detail. The experiment result of a 260-nodes production Hadoop cluster system shows that our online log analysis algorithm can analyze the log streams to obtain event correlation rules in soft real time, and our online event prediction method can achieve higher precision rate and recall rate than the offline log analysis approach.
引用
收藏
页码:262 / +
页数:3
相关论文
共 50 条
  • [1] LogMaster: Mining Event Correlations in Logs of Large-scale Cluster Systems
    Fu, Xiaoyu
    Ren, Rui
    Zhan, Jianfeng
    Zhou, Wei
    Jia, Zhen
    Lu, Gang
    [J]. 2012 31ST INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS 2012), 2012, : 71 - 80
  • [2] Anomaly States Monitoring of Large-Scale Systems with Intellectual Analysis of System Logs
    Sheluhin, Oleg
    Osin, Andrey
    [J]. PROCEEDINGS OF THE 24TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 395 - 401
  • [3] Incremental Analysis of Large-Scale System Logs for Anomaly Detection
    Astekin, Merve
    Ozcan, Selim
    Sozer, Hasan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2119 - 2127
  • [4] An Online Cluster Analysis Method for Large-scale Protein Sequences
    Tang, DongMing
    Zhu, QingXin
    Zhang, YueFei
    Zhang, Jiang
    [J]. 2009 INTERNATIONAL CONFERENCE ON FUTURE BIOMEDICAL INFORMATION ENGINEERING (FBIE 2009), 2009, : 478 - +
  • [5] Multiple Event Analysis for Large-scale Power Systems through Cluster-based Sparse Coding
    Song, Yang
    Wang, Wei
    Mang, Zhifei
    Qi, Hairong
    Liu, Yilu
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2015, : 301 - 306
  • [6] DILAF: A framework for distributed analysis of large-scale system logs for anomaly detection
    Astekin, Merve
    Zengin, Harun
    Sozer, Hasan
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (02): : 153 - 170
  • [7] Intelligent analysis of digital evidences in large-scale logs in power systems attributed to the attacks
    Iqbal, Asif
    Shalaginov, Andrii
    Mahmood, Farhan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3088 - 3093
  • [8] Detecting Large-Scale System Problems by Mining Console Logs
    Xu, Wei
    Huang, Ling
    Fox, Armando
    Patterson, David
    Jordan, Michael I.
    [J]. SOSP'09: PROCEEDINGS OF THE TWENTY-SECOND ACM SIGOPS SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 2009, : 117 - 131
  • [9] LongLine: Visual Analytics System for Large-scale Audit Logs
    Yoo, Seunghoon
    Jo, Jaemin
    Kim, Bohyoung
    Seo, Jinwook
    [J]. VISUAL INFORMATICS, 2018, 2 (01): : 82 - 97
  • [10] DIGRAPH ANALYSIS OF LARGE-SCALE SYSTEMS - SYSTEM PRIMITIVE
    EVANS, FJ
    SCHIZAS, C
    [J]. ELECTRONICS LETTERS, 1979, 15 (20) : 613 - 614