Mining software repositories for comprehensible software fault prediction models

被引:76
|
作者
Vandecruys, Olivier [1 ]
Martens, David [1 ]
Baesens, Bart [1 ,2 ]
Mues, Christophe [2 ]
De Backer, Manu [1 ]
Haesen, Raf [1 ]
机构
[1] Dept Decis Sci & Informat Management, B-3000 Louvain, Belgium
[2] Univ Southampton, Sch Management, Southampton SO17 1BJ, Hants, England
关键词
classification; software mining; fault prediction; comprehensibility; Ant Colony Optimization;
D O I
10.1016/j.jss.2007.07.034
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software managers are routinely confronted with software projects that contain errors or inconsistencies and exceed budget and time limits. By mining software repositories with comprehensible data mining techniques, predictive models can be induced that offer software managers the insights they need to tackle these quality and budgeting problems in an efficient way. This paper deals with the role that the Ant Colony Optimization (ACO)-based classification technique AntMiner+ can play as a comprehensible data mining technique to predict erroneous software modules. In an empirical comparison on three real-world public datasets, the rule-based models produced by AntMiner+ are shown to achieve a predictive accuracy that is competitive to that of the models induced by several other included classification techniques, such as C4.5, logistic regression and support vector machines. In addition, we will argue that the intuitiveness and comprehensibility of the AntMiner+ models can be considered superior to the latter models. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:823 / 839
页数:17
相关论文
共 50 条
  • [1] Comprehensible software fault and effort prediction: A data mining approach
    Moeyersoms, Julie
    de Fortuny, Enric Junque
    Dejaeger, Karel
    Baesens, Bart
    Martens, David
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 100 : 80 - 90
  • [2] Toward Comprehensible Software Fault Prediction Models Using Bayesian Network Classifiers
    Dejaeger, Karel
    Verbraken, Thomas
    Baesens, Bart
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (02) : 237 - 257
  • [3] Software mining and fault prediction
    Catal, Cagatay
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (05) : 420 - 426
  • [4] Mining Software Repositories Using Topic Models
    Thomas, Stephen W.
    [J]. 2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2011, : 1138 - 1139
  • [5] Mining software repositories
    [J]. 1600, Japan Society for Software Science and Technology (30):
  • [6] Mining Open Software Repositories
    Alonso Abad, Jesus
    Lopez Nozal, Carlos
    Maudes Raedo, Jesus M.
    [J]. ERCIM NEWS, 2014, (99): : 23 - 24
  • [7] Ethics in the mining of software repositories
    Nicolas E. Gold
    Jens Krinke
    [J]. Empirical Software Engineering, 2022, 27
  • [8] A Survey on Mining Software Repositories
    Jung, Woosung
    Lee, Eunjoo
    Wu, Chisu
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1384 - 1406
  • [9] Ethics in the mining of software repositories
    Gold, Nicolas E.
    Krinke, Jens
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (01)
  • [10] Tools in Mining Software Repositories
    Chaturvedi, K. K.
    Singh, V. B.
    Singh, Prashast
    [J]. PROCEEDINGS OF THE 2013 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ITS APPLICATIONS (ICCSA 2013), 2013, : 89 - 98