Defect Prediction on a Legacy Industrial Software: A Case Study on Software with Few Defects

被引:0
|
作者
Koroglu, Yavuz [1 ]
Sen, Alper [1 ]
Kutluay, Doruk [2 ]
Bayraktar, Akin [2 ]
Tosun, Yalcin [2 ]
Cinar, Murat [2 ]
Kaya, Hasan [2 ]
机构
[1] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[2] Netas Telecommun, Istanbul, Turkey
关键词
Defect Prediction; Experience Report; Process Metrics; Feature Selection; Random Forest; FAULT;
D O I
10.1145/2896839.2896843
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Context: Building defect prediction models for software projects is helpful for reducing the effort in locating defects. In this paper, we share our experiences in building a defect prediction model for a large industrial software project. We extract product and process metrics to build models and show that we can build an accurate defect prediction model even when 4% of the software is defective. Objective: Our goal in this project is to integrate a defect predictor into the continuous integration (CI) cycle of a large software project and decrease the effort in testing. Method: We present our approach in the form of an experience report. Specifically, we collected data from seven older versions of the software project and used additional features to predict defects of current versions. We compared several classification techniques including Naive Bayes, Decision Trees, and Random Forest and resampled our training data to present the company with the most accurate defect predictor. Results: Our results indicate that we can focus testing efforts by guiding the test team to only 8% of the software where 53% of actual defects can be found. Our model has 90% accuracy. Conclusion: We produce a defect prediction model with high accuracy for a software with defect rate of 4%. Our model uses Random Forest, that which we show has more predictive power than Naive Bayes, Logistic Regression and Decision Trees in our case.
引用
收藏
页码:14 / 20
页数:7
相关论文
共 50 条
  • [1] Costs and Benefits of Machine Learning Software Defect Prediction: Industrial Case Study
    Stradowski, Szymon
    Madeyski, Lech
    [J]. COMPANION PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, FSE COMPANION 2024, 2024, : 92 - 103
  • [2] Pair Programming and Software Defects - An Industrial Case Study
    Phaphoom, Nattakarn
    Sillitti, Alberto
    Succi, Giancarlo
    [J]. AGILE PROCESSES IN SOFTWARE ENGINEERING AND EXTREME PROGRAMMING, 2011, 77 : 208 - 222
  • [3] An industrial case study of classifier ensembles for locating software defects
    Misirli, Ayse Tosun
    Bener, Ayse Basar
    Turhan, Burak
    [J]. SOFTWARE QUALITY JOURNAL, 2011, 19 (03) : 515 - 536
  • [4] An industrial case study of classifier ensembles for locating software defects
    Ayşe Tosun Mısırlı
    Ayşe Başar Bener
    Burak Turhan
    [J]. Software Quality Journal, 2011, 19 : 515 - 536
  • [5] Cost Effectiveness of Software Defect Prediction in an Industrial Project
    Hryszko, Jaroslaw
    Madeyski, Lech
    [J]. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2018, 43 (01) : 7 - 35
  • [6] BOTTLENECKS IN SOFTWARE DEFECT PREDICTION IMPLEMENTATION IN INDUSTRIAL PROJECTS
    Hryszko, Jaroslaw
    Madeyski, Lech
    [J]. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2015, 40 (01) : 17 - 33
  • [7] Data Collection for Software Defect Prediction an Exploratory Case Study of Open Source Software Projects
    Mausa, Goran
    Grbac, Tihana Galinac
    Basic, Bojana Dalbelo
    [J]. 2015 8TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2015, : 463 - 469
  • [8] Pair Programming and Software Defects-A Large, Industrial Case Study
    di Bella, Enrico
    Fronza, Ilenia
    Phaphoom, Nattakarn
    Sillitti, Alberto
    Succi, Giancarlo
    Vlasenko, Jelena
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (07) : 930 - 953
  • [9] Assessment of the Software Defect Prediction Cost Effectiveness in an Industrial Project
    Hryszko, Jaroslaw
    Madeyski, Lech
    [J]. SOFTWARE ENGINEERING: CHALLENGES AND SOLUTIONS, 2017, 504
  • [10] Adapting God Class thresholds for software defect prediction: A case study
    Gradisnik, Mitja
    Beranic, Tina
    Karakatic, Saso
    Mausa, Goran
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1537 - 1542