An Integrated Semi-supervised Software Defect Prediction Model

被引:0
|
作者
Meng, Fanqi [1 ,2 ]
Cheng, Wenying [1 ]
Wang, Jingdong [1 ]
机构
[1] Northeast Elect Power Univ, Sch Comp Sci, Jilin, Peoples R China
[2] Guangdong Atv Acad Performing Arts, Sch Comp Sci, Dongguan, Peoples R China
来源
JOURNAL OF INTERNET TECHNOLOGY | 2023年 / 24卷 / 06期
关键词
learning; Feature selection; Unbalanced classification; Oversampling techniques; FEATURE-SELECTION;
D O I
10.53106/160792642023112406013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel semi-supervised software defect prediction model FFeSSTri (Filtered Feature Selecting, Sample and Tri-training) is proposed to address the problem that class imbalance and too many irrelevant or redundant features in labelled samples lower the accuracy of semi-supervised software defect prediction. Its innovation lies in that the construction of FFeSSTri integrates an oversampling technique, a new feature selection method, and a Tri-training algorithm, thus it can effectively improve the accuracy. Firstly, the oversampling technique is applied to expand the class of inadequate samples, thus it solves the unbalanced classification of the labelled samples. Secondly, a new filtered feature selection method based on relevance and redundancy is proposed, which can exclude those irrelevant or redundant features from labelled samples. Finally, the Tri-training algorithm is used to learn the labelled training samples to build the defect prediction model FFeSSTri. The experiments conducted on the NASA software defect prediction dataset show that FFeSSTri outperforms the existing four supervised learning methods and one semi-supervised learning method in terms of F-Measure values and AUC values.
引用
收藏
页码:1307 / 1317
页数:11
相关论文
共 50 条
  • [1] A Semi-Supervised Approach to Software Defect Prediction
    Lu, Huihua
    Cukic, Bojan
    Culp, Mark
    [J]. 2014 IEEE 38TH ANNUAL INTERNATIONAL COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2014, : 416 - 425
  • [2] Semi-supervised Software Defect Prediction Model Based on Tri-training
    Meng, Fanqi
    Cheng, Wenying
    Wang, Jingdong
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (11): : 4028 - 4042
  • [3] Semi-supervised Software Defect Prediction Model Based on Tri-training
    Meng, Fanqi
    Cheng, Wenying
    Wang, Jingdong
    [J]. KSII Transactions on Internet and Information Systems, 2021, 15 (11) : 4028 - 4042
  • [4] An improved semi-supervised learning method for software defect prediction
    Ma, Ying
    Pan, Weiwei
    Zhu, Shunzhi
    Yin, Huayi
    Luo, Jian
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 27 (05) : 2473 - 2480
  • [5] A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction
    Catal, Cagatay
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2014, 23 (01) : 75 - 82
  • [6] Low-rank representation for semi-supervised software defect prediction
    Zhang, Zhi-Wu
    Jing, Xiao-Yuan
    Wu, Fei
    [J]. IET SOFTWARE, 2018, 12 (06) : 527 - 535
  • [7] Label propagation based semi-supervised learning for software defect prediction
    Zhang, Zhi-Wu
    Jing, Xiao-Yuan
    Wang, Tie-Jian
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2017, 24 (01) : 47 - 69
  • [8] Software Defect Prediction Using Semi-supervised Learning with Dimension Reduction
    Lu, Huihua
    Cukic, Bojan
    Culp, Mark
    [J]. 2012 PROCEEDINGS OF THE 27TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2012, : 314 - 317
  • [9] Label propagation based semi-supervised learning for software defect prediction
    Zhi-Wu Zhang
    Xiao-Yuan Jing
    Tie-Jian Wang
    [J]. Automated Software Engineering, 2017, 24 : 47 - 69
  • [10] Sample-based software defect prediction with active and semi-supervised learning
    Li, Ming
    Zhang, Hongyu
    Wu, Rongxin
    Zhou, Zhi-Hua
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2012, 19 (02) : 201 - 230