Software Defect Prediction using Propositionalization based Data Preprocessing: An Empirical Study

被引:0
|
作者
Pak, CholMyong [1 ,2 ]
Wang, Tian Tian [3 ]
Su, Xiao Hong [3 ]
机构
[1] Harbin Inst Technol, Harbin, Heilongjiang, Peoples R China
[2] Kim Il Sung Univ, Pyongyang, North Korea
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
software defect prediction; data preprocessing; propostionalization; classifier; METRICS;
D O I
10.1109/ICDSBA.2018.00021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data preprocessing can be used to improve classifier performance in classification problems. Software defect prediction is also one of classification problems, so it is needed to use data preprocessing for improving the performance of model. In this paper, we study about the software defect prediction using propositonalization based data preprocessing method. We proposed propositionalization using decision tree as data preprocessing method and made experiments by using common classifiers over 17 datasets from the PROMISE repository. We also used paired t-test to compare propositionalization using decision tree with attribute subset selection and principal component analysis. Results showed that Propostionalization using decision tree improved the performance of software defect prediction significantly and it was more effective than attribute subset selection and principal component analysis. There were no statistically significant differences between top 5 classifiers.
引用
收藏
页码:71 / 77
页数:7
相关论文
共 50 条
  • [1] The Use of Ensemble-Based Data Preprocessing Techniques for Software Defect Prediction
    Gao, Kehan
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2014, 24 (09) : 1229 - 1253
  • [2] An Empirical Study on Software Defect Prediction Using CodeBERT Model
    Pan, Cong
    Lu, Minyan
    Xu, Biao
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [3] An Empirical Study for Enhanced Software Defect Prediction Using a Learning-Based Framework
    Bashir, Kamal
    Li, Tianrui
    Yohannese, Chubato Wondaferaw
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) : 282 - 298
  • [4] An Empirical Study for Enhanced Software Defect Prediction Using a Learning-Based Framework
    Kamal Bashir
    Tianrui Li
    Chubato Wondaferaw Yohannese
    [J]. International Journal of Computational Intelligence Systems, 2018, 12 : 282 - 298
  • [5] An empirical study to investigate oversampling methods for improving software defect prediction using imbalanced data
    Malhotra, Ruchika
    Kamal, Shine
    [J]. NEUROCOMPUTING, 2019, 343 : 120 - 140
  • [6] An Empirical Study on Software Defect Prediction using Function Point Analysis
    Zhao, Xinghan
    Tian, Cong
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 167 - 176
  • [7] A New Framework Consisted of Data Preprocessing and Classifier Modelling for Software Defect Prediction
    Ji, Haijin
    Huang, Song
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [8] A study on software metrics based software defect prediction using data mining and machine learning techniques
    Prasad, Manjula C.M.
    Florence, Lilly
    Arya, Arti
    [J]. International Journal of Database Theory and Application, 2015, 8 (03): : 179 - 190
  • [9] An empirical study for software change prediction using imbalanced data
    Ruchika Malhotra
    Megha Khanna
    [J]. Empirical Software Engineering, 2017, 22 : 2806 - 2851
  • [10] An empirical study for software change prediction using imbalanced data
    Malhotra, Ruchika
    Khanna, Megha
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (06) : 2806 - 2851