Cross-Project and Within-Project Semisupervised Software Defect Prediction: A Unified Approach

被引:108
|
作者
Wu, Fei [1 ]
Jing, Xiao-Yuan [1 ,2 ]
Sun, Ying [1 ]
Sun, Jing [1 ]
Huang, Lin [1 ]
Cui, Fangyi [1 ]
Sun, Yanfei [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Jiangsu, Peoples R China
[2] Wuhan Univ, Sch Comp, State Key Lab Software Engn, Wuhan 430072, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Cost-sensitive kernelized semisupervised dictionary learning (CKSDL); cross-project semisupervised defect prediction (CSDP); within-project semisupervised defect prediction (WSDP); NETWORKS; MACHINE;
D O I
10.1109/TR.2018.2804922
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
When there exist not enough historical defect data for building an accurate prediction model, semisupervised defect prediction (SSDP) and cross-project defect prediction (CPDP) are two feasible solutions. Existing CPDP methods assume that the available source data are well labeled. However, due to expensive human efforts for labeling a large amount of defect data, usually, we can only utilize the suitable unlabeled source data. We call CPDP in this scenario as cross-project semisupervised defect prediction (CSDP). Although some within-project semisupervised defect prediction (WSDP) methods have been developed in recent years, there still exists much room for improvement on prediction performance. In this paper, we aim to provide a unified and effective solution for both CSDP and WSDP problems. We introduce the semisupervised dictionary learning technique and propose a cost-sensitive kernelized semisupervised dictionary learning (CKSDL) approach. CKSDL can make full use of the limited labeled defect data and a large amount of unlabeled data in the kernel space. In addition, CKSDL considers the misclassification costs in the dictionary learning process. Extensive experiments on 16 projects indicate that CKSDL outperforms state-of-the-art WSDP methods, using unlabeled cross-project defect data can help improve the WSDP performance, and CKSDL generally obtains significantly better prediction performance than related SSDP methods in the CSDP scenario.
引用
收藏
页码:581 / 597
页数:17
相关论文
共 50 条
  • [1] Cross-project and Within-project Semi-supervised Software Defect Prediction Problems Study Using a Unified Solution
    Wu, Fei
    Jing, Xiao-Yuan
    Dong, Xiwei
    Cao, Jicheng
    Xu, Mingwei
    Zhang, Hongyu
    Ying, Shi
    Xu, Baowen
    PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017), 2017, : 195 - 197
  • [2] Within-project and cross-project defect prediction based on model averaging
    Li, Tong
    Wang, Zhong
    Shi, Peibei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [3] An empirical evaluation of defect prediction approaches in within-project and cross-project context
    Bhat, Nayeem Ahmad
    Farooq, Sheikh Umar
    SOFTWARE QUALITY JOURNAL, 2023, 31 (03) : 917 - 946
  • [4] An empirical evaluation of defect prediction approaches in within-project and cross-project context
    Nayeem Ahmad Bhat
    Sheikh Umar Farooq
    Software Quality Journal, 2023, 31 : 917 - 946
  • [5] Within-Project and Cross-Project Software Defect Prediction Based on Improved Transfer Naive Bayes Algorithm
    Zhu, Kun
    Zhang, Nana
    Ying, Shi
    Wang, Xu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (02): : 891 - 910
  • [6] A Novel Class-Imbalance Learning Approach for Both Within-Project and Cross-Project Defect Prediction
    Gong, Lina
    Jiang, Shujuan
    Bo, Lili
    Jiang, Li
    Qian, Junyan
    IEEE TRANSACTIONS ON RELIABILITY, 2020, 69 (01) : 40 - 54
  • [7] Tsbagging: A Novel Cross-Project Software Defect Prediction Algorithm Based on Semisupervised Clustering
    Tang, Shiqi
    Huang, Song
    Liu, ErHu
    Yao, YongMing
    Wu, KaiShun
    Ji, Haijin
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [8] Domain Adaptation Approach for Cross-project Software Defect Prediction
    Chen S.
    Ye J.-M.
    Liu T.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (02): : 266 - 281
  • [9] An Improved SDA Based Defect Prediction Framework for Both Within-Project and Cross-Project Class-Imbalance Problems
    Jing, Xiao-Yuan
    Wu, Fei
    Dong, Xiwei
    Xu, Baowen
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2017, 43 (04) : 321 - 339
  • [10] An Improved CNN Model for Within-Project Software Defect Prediction
    Pan, Cong
    Lu, Minyan
    Xu, Biao
    Gao, Houleng
    APPLIED SCIENCES-BASEL, 2019, 9 (10):