Comparative analysis of regression and machine learning methods for predicting fault proneness models

被引:51
|
作者
Singh, Yogesh [1 ]
Kaur, Arvinder [1 ]
Malhotra, Ruchika [1 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, Univ Sch Informat Technol, Kashmere Gate, Delhi 110403, India
关键词
software quality; metrics; logistic regression; receiver operating characteristics curve; decision tree; support vector machine;
D O I
10.1504/IJCAT.2009.026595
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Demand for quality software has undergone rapid growth during the last few years. This is leading to increase in development of machine learning techniques for exploring datasets which can be used in constructing models for predicting quality attributes such as Decision Tree (DT), Support Vector Machine (SVM) and Artificial Neural Network (ANN). This paper examines and compares Logistic Regression (LR), ANN (model predicted in an analogous study using the same dataset), SVM and DT methods. These two methods are explored empirically to find the effect of object-oriented metrics given by Chidamber and Kemerer on the fault proneness of object-oriented system classes. Data collected from Java applications is used in the study. The performance of the methods was compared by Receiver Operating Characteristic (ROC) analysis. DT modelling showed 84.7% of correct classifications of faulty classes and is a better model than the model predicted using LR, SVM and ANN method. The area under the ROC curve of LR, ANN, SVM and DT model is 0.826, 0.85, 0.85 and 0.87, respectively. The paper shows that machine learning methods are useful in constructing software quality models.
引用
收藏
页码:183 / 193
页数:11
相关论文
共 50 条
  • [1] A comparative study of machine learning regression models for predicting construction duration
    Zhang, Shen
    Li, Xuechun
    [J]. JOURNAL OF ASIAN ARCHITECTURE AND BUILDING ENGINEERING, 2023, 23 (06) : 1980 - 1996
  • [2] Comparative Analysis of Machine Learning Regression Models for Unknown Dynamics
    Ordonez, Jaime Campos
    Ferguson, Philip
    [J]. IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2023, 7 : 441 - 450
  • [3] A comparative study of models for predicting fault proneness in object-oriented systems
    Singh, Yogesh
    Kaur, Arvinder
    Malhotra, Ruchika
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2014, 49 (01) : 22 - 41
  • [4] Predicting soot formation in fossil fuels: A comparative study of regression and machine learning models
    Lawal, Ridhwan
    Farooq, Wasif
    Abdulraheem, Abdulazeez
    Jameel, Abdul Gani Abdul
    [J]. DIGITAL CHEMICAL ENGINEERING, 2024, 12
  • [5] Comparative Evaluation and Comprehensive Analysis of Machine Learning Models for Regression Problems
    Sekeroglu, Boran
    Ever, Yoney Kirsal
    Dimililer, Kamil
    Al-Turjman, Fadi
    [J]. DATA INTELLIGENCE, 2022, 4 (03) : 620 - 652
  • [6] Comparative Evaluation and Comprehensive Analysis of Machine Learning Models for Regression Problems
    Boran Sekeroglu
    Yoney Kirsal Ever
    Kamil Dimililer
    Fadi Al-Turjman
    [J]. Data Intelligence, 2022, 4 (03) : 620 - 652
  • [7] Comparative analysis of statistical and machine learning methods for predicting faulty modules
    Malhotra, Ruchika
    [J]. APPLIED SOFT COMPUTING, 2014, 21 : 286 - 297
  • [8] A Comparative Analysis of Machine Learning Models for Predicting Loess Collapse Potential
    Motameni, Sahand
    Rostami, Fateme
    Farzai, Sara
    Soroush, Abbas
    [J]. GEOTECHNICAL AND GEOLOGICAL ENGINEERING, 2024, 42 (02) : 881 - 894
  • [9] Predicting Car Rental Prices: A Comparative Analysis of Machine Learning Models
    Yang, Jiseok
    Kim, Jinseok
    Ryu, Hanwoong
    Lee, Jiwoon
    Park, Cheolsoo
    [J]. ELECTRONICS, 2024, 13 (12)
  • [10] A Comparative Analysis of Machine Learning Models for Predicting Loess Collapse Potential
    Sahand Motameni
    Fateme Rostami
    Sara Farzai
    Abbas Soroush
    [J]. Geotechnical and Geological Engineering, 2024, 42 : 881 - 894