Statistical Log Differencing

被引:10
|
作者
Bao, Lingfeng [1 ]
Busany, Nimrod [2 ]
Lo, David [3 ]
Maoz, Shahar [2 ]
机构
[1] Zhejiang Univ City Coll, Hangzhou, Peoples R China
[2] Tel Aviv Univ, Tel Aviv, Israel
[3] Singapore Management Univ, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
FINITE-STATE MACHINES;
D O I
10.1109/ASE.2019.00084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent works have considered the problem of log differencing: given two or more system's execution logs, output a model of their differences. Log differencing has potential applications in software evolution, testing, and security. In this paper we present statistical log differencing, which accounts for frequencies of behaviors found in the logs. We present two algorithms, s2KDiff for differencing two logs, and snKDiff, for differencing of many logs at once, both presenting their results over a single inferred model. A unique aspect of our algorithms is their use of statistical hypothesis testing: we let the engineer control the sensitivity of the analysis by setting the target distance between probabilities and the statistical significance value, and report only (and all) the statistically significant differences. Our evaluation shows the effectiveness of our work in terms of soundness, completeness, and performance. It also demonstrates its effectiveness compared to previous work via a user-study and its potential applications via a case study using real-world logs.
引用
收藏
页码:851 / 862
页数:12
相关论文
共 50 条
  • [1] A Scalable Log Differencing Visualisation Applied to COBOL Refactoring
    Deknop, Celine
    Mens, Kim
    Bergel, Alexandre
    Fabry, Johan
    Zaytsev, Vadim
    [J]. 2021 WORKING CONFERENCE ON SOFTWARE VISUALIZATION (VISSOFT 2021), 2021, : 1 - 11
  • [2] Using Finite-State Models for Log Differencing
    Amar, Hen
    Bao, Lingfeng
    Busany, Nimrod
    Lo, David
    Maoz, Shahar
    [J]. ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2018, : 49 - 59
  • [3] Experience Report: Log-based Behavioral Differencing
    Goldstein, Maayan
    Raz, Danny
    Segall, Itai
    [J]. 2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2017, : 282 - 293
  • [4] Log Delta Analysis: Interpretable Differencing of Business Process Event Logs
    van Beest, Nick R. T. P.
    Dumas, Marlon
    Garcia-Banuelos, Luciano
    La Rosa, Marcello
    [J]. BUSINESS PROCESS MANAGEMENT, BPM 2015, 2015, 9253 : 386 - 405
  • [5] Behavioral Log Analysis with Statistical Guarantees
    Busany, Nimrod
    Maoz, Shahar
    [J]. 2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 877 - 887
  • [6] Website Log Statistical Testing And Analysis
    Zhao Yongjian
    [J]. 2019 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2019, : 591 - 594
  • [7] Statistical segmentation of geophysical log data
    Velis, Danilo R.
    [J]. MATHEMATICAL GEOLOGY, 2007, 39 (04): : 409 - 417
  • [8] Statistical Segmentation of Geophysical Log Data
    Danilo R. Velis
    [J]. Mathematical Geology, 2007, 39 : 409 - 417
  • [9] Behavioral Log Analysis with Statistical Guarantees
    Busany, Nimrod
    Maoz, Shahar
    [J]. 2015 10TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE 2015) PROCEEDINGS, 2015, : 898 - 901
  • [10] Statistical Question The log rank test
    Sedgwick, Philip
    [J]. BMJ-BRITISH MEDICAL JOURNAL, 2010, 341