Cross-Project Defect Prediction with Metrics Selection and Balancing Approach

被引:1
|
作者
Nevendra, Meetesh [1 ]
Singh, Pradeep [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Raipur, India
关键词
AdaBoost; ensemble; Random Forest; SMOTE; STATISTICAL COMPARISONS; SOFTWARE; ALGORITHMS;
D O I
10.2478/acss-2022-0015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
- In software development, defects influence the quality and cost in an undesirable way. Software defect prediction (SDP) is one of the techniques which improves the software quality and testing efficiency by early identification of defects(bug/fault/error). Thus, several experiments have been suggested for defect prediction (DP) techniques. Mainly DP method utilises historical project data for constructing prediction models. SDP performs well within projects until there is an adequate amount of data accessible to train the models. However, if the data are inadequate or limited for the same project, the researchers mainly use Cross-Project Defect Prediction (CPDP). CPDP is a possible alternative option that refers to anticipating defects using prediction models built on historical data from other projects. CPDP is challenging due to its data distribution and domain difference problem. The proposed framework is an effective two-stage approach for CPDP, i.e., model generation and prediction process. In model generation phase, the conglomeration of different pre-processing, including feature selection and class reweights technique, is used to improve the initial data quality. Finally, a fine-tuned efficient bagging and boosting based hybrid ensemble model is developed, which avoids model overfitting/under-fitting and helps enhance the prediction performance. In the prediction process phase, the generated model predicts the historical data from other projects, which has defects or clean. The framework is evaluated using25 software projects obtained from public repositories. The result analysis shows that the proposed model has achieved a 0.71 +/- 0.03 f1-score, which significantly improves the state-of-the-art approaches by 23 % to 60 %.
引用
收藏
页码:137 / 148
页数:12
相关论文
共 50 条
  • [31] A Hybrid Multiple Models Transfer Approach for Cross-Project Software Defect Prediction
    Zhang, Shenggang
    Jiang, Shujuan
    Yan, Yue
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (03) : 343 - 374
  • [32] Adversarial domain adaptation for cross-project defect prediction
    Song, Hengjie
    Wu, Guobin
    Ma, Le
    Pan, Yufei
    Huang, Qingan
    Jiang, Siyu
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (05)
  • [33] Manifold Learning for Cross-project Software Defect Prediction
    Sun, Jing
    Jing, Xiaoyuan
    Dong, Xiwei
    [J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 567 - 571
  • [34] A transfer cost-sensitive boosting approach for cross-project defect prediction
    Ryu, Duksan
    Jang, Jong-In
    Baik, Jongmoon
    [J]. SOFTWARE QUALITY JOURNAL, 2017, 25 (01) : 235 - 272
  • [35] A Survey on Transfer Learning for Cross-Project Defect Prediction
    Sotto-Mayor, Bruno
    Kalech, Meir
    [J]. IEEE ACCESS, 2024, 12 : 93398 - 93425
  • [36] A Hybrid Instance Selection Using Nearest-Neighbor for Cross-Project Defect Prediction
    Duksan Ryu
    Jong-In Jang
    Jongmoon Baik
    [J]. Journal of Computer Science and Technology, 2015, 30 : 969 - 980
  • [37] CFPS: Collaborative filtering based source projects selection for cross-project defect prediction
    Sun, Zhongbin
    Li, Junqi
    Sun, Heli
    He, Liang
    [J]. APPLIED SOFT COMPUTING, 2021, 99
  • [38] Cross-project smell-based defect prediction
    Bruno Sotto-Mayor
    Meir Kalech
    [J]. Soft Computing, 2021, 25 : 14171 - 14181
  • [39] A transfer cost-sensitive boosting approach for cross-project defect prediction
    Duksan Ryu
    Jong-In Jang
    Jongmoon Baik
    [J]. Software Quality Journal, 2017, 25 : 235 - 272
  • [40] Instance Selection and Class Balancing Techniques for Cross Project Defect Prediction
    Bispo, Alysson
    Prudencio, Ricardo
    Veras, Douglas
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 552 - 557