Mining software repositories for comprehensible software fault prediction models

被引:76
|
作者
Vandecruys, Olivier [1 ]
Martens, David [1 ]
Baesens, Bart [1 ,2 ]
Mues, Christophe [2 ]
De Backer, Manu [1 ]
Haesen, Raf [1 ]
机构
[1] Dept Decis Sci & Informat Management, B-3000 Louvain, Belgium
[2] Univ Southampton, Sch Management, Southampton SO17 1BJ, Hants, England
关键词
classification; software mining; fault prediction; comprehensibility; Ant Colony Optimization;
D O I
10.1016/j.jss.2007.07.034
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software managers are routinely confronted with software projects that contain errors or inconsistencies and exceed budget and time limits. By mining software repositories with comprehensible data mining techniques, predictive models can be induced that offer software managers the insights they need to tackle these quality and budgeting problems in an efficient way. This paper deals with the role that the Ant Colony Optimization (ACO)-based classification technique AntMiner+ can play as a comprehensible data mining technique to predict erroneous software modules. In an empirical comparison on three real-world public datasets, the rule-based models produced by AntMiner+ are shown to achieve a predictive accuracy that is competitive to that of the models induced by several other included classification techniques, such as C4.5, logistic regression and support vector machines. In addition, we will argue that the intuitiveness and comprehensibility of the AntMiner+ models can be considered superior to the latter models. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:823 / 839
页数:17
相关论文
共 50 条
  • [31] Prediction models for software fault correction effort
    Evanco, WM
    [J]. FIFTH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING, PROCEEDINGS, 2001, : 114 - 120
  • [32] A survey and taxonomy of approaches for mining software repositories in the context of software evolution
    Kagdi, Huzefa
    Collard, Michael L.
    Maletic, Jonathan I.
    [J]. JOURNAL OF SOFTWARE MAINTENANCE AND EVOLUTION-RESEARCH AND PRACTICE, 2007, 19 (02): : 77 - 131
  • [33] On Software Fault Prediction by Mining Software Complexity Data with Dynamically Filtered Training Sets
    Podgorelec, Vili
    [J]. PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON SIMULATION, MODELLING AND OPTIMIZATION, 2009, : 332 - +
  • [34] Introduction to the special issue on mining software repositories
    Tao Xie
    Thomas Zimmermann
    Arie van Deursen
    [J]. Empirical Software Engineering, 2013, 18 : 1043 - 1046
  • [35] Mining expertise of developers from software repositories
    Hammad, Maen
    Hijazi, Haneen
    Hammad, Mustafa
    Otoom, Ahmed Fawzi
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 62 (03) : 227 - 239
  • [36] Introduction to the special issue on mining software repositories
    Xie, Tao
    Zimmermann, Thomas
    van Deursen, Arie
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2013, 18 (06) : 1043 - 1046
  • [37] Guest editorial: Mining software repositories 2018
    Kamei, Yasutaka
    Zaidman, Andy
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (03) : 2055 - 2057
  • [38] Guest editorial: Mining software repositories 2018
    Yasutaka Kamei
    Andy Zaidman
    [J]. Empirical Software Engineering, 2020, 25 : 2055 - 2057
  • [39] Manas: Mining Software Repositories to Assist AutoML
    Nguyen, Giang
    Islam, Md Johirul
    Pan, Rangeet
    Rajan, Hridesh
    [J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 1368 - 1380
  • [40] Research on mining software repositories to facilitate refactoring
    Nyamawe, Ally S.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 13 (05)