Imbalanced Data Processing Model for Software Defect Prediction

被引:0
|
作者
Lijuan Zhou
Ran Li
Shudong Zhang
Hua Wang
机构
[1] Capital Normal University,College of Information Engineering
来源
关键词
Software defect prediction; Class imbalance; Attribute selection; Sampling; Ensemble algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
In the field of software engineering, software defect prediction is the hotspot of the researches which can effectively guarantee the quality during software development. However, the problem of class imbalanced datasets will affect the accuracy of overall classification of software defect prediction, which is the key issue to be solved urgently today. In order to better solve this problem, this paper proposes a model named ASRA which combines attribute selection, sampling technologies and ensemble algorithm. The model adopts the Chi square test of attribute selection and then utilizes the combined sampling technique which includes SMOTE over-sampling and under-sampling to remove the redundant attributes and make the datasets balance. Afterwards, the model ASRA is eventually established by ensemble algorithm named Adaboost with basic classifier J48 decision tree. The data used in the experiments comes from UCI datasets. It can draw the conclusion that the effect of software defect prediction classification which using this model is improved and better than before by comparing the precision P, F-measure and AUC values from the results of the experiments.
引用
收藏
页码:937 / 950
页数:13
相关论文
共 50 条
  • [31] Software Defect Prediction Based on Stability Test Data
    Okumoto, Kazu
    2011 INTERNATIONAL CONFERENCE ON QUALITY, RELIABILITY, RISK, MAINTENANCE, AND SAFETY ENGINEERING (ICQR2MSE), 2011, : 385 - 387
  • [32] Data quality evaluation method in software defect prediction
    Li N.
    Guo Y.
    Wang X.
    Zhang L.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (11): : 24 - 29
  • [33] A Systematic Data Collection Procedure for Software Defect Prediction
    Mausa, Goran
    Grbac, Tihana Galinac
    Basic, Bojana Dalbelo
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 13 (01) : 173 - 197
  • [34] Research on Software Defect Prediction Based on Data Mining
    Chen, Yuan
    Shen, Xiang-heng
    Du, Peng
    Ge, Bing
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 563 - 567
  • [35] Software Defect Prediction Model Research for Network and Cloud Software Development
    Yang, Yejuan
    PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING (ICMMCCE 2017), 2017, 141 : 717 - 723
  • [36] Seml: A Semantic LSTM Model for Software Defect Prediction
    Liang, Hongliang
    Yu, Yue
    Jiang, Lin
    Xie, Zhuosi
    IEEE ACCESS, 2019, 7 : 83812 - 83824
  • [37] An Improved Confounding Effect Model for Software Defect Prediction
    Yuan, Yuyu
    Li, Chenlong
    Yang, Jincui
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [38] Hybrid model with optimization tactics for software defect prediction
    Gollagi, Shantappa G.
    Balasubramaniam, S.
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2023, 14 (02)
  • [39] Software defect prediction model based on LASSO–SVM
    Kechao Wang
    Lin Liu
    Chengjun Yuan
    Zhifei Wang
    Neural Computing and Applications, 2021, 33 : 8249 - 8259
  • [40] An Empirical Evaluation of the Tobit Model on Software Defect Prediction
    Murakami, Yukasa
    Tsunoda, Masateru
    Toda, Koji
    2016 4TH INTL CONF ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY/3RD INTL CONF ON COMPUTATIONAL SCIENCE/INTELLIGENCE AND APPLIED INFORMATICS/1ST INTL CONF ON BIG DATA, CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (ACIT-CSII-BCD), 2016, : 196 - 201