Cross-Project Defect Prediction with Metrics Selection and Balancing Approach

被引:1
|
作者
Nevendra, Meetesh [1 ]
Singh, Pradeep [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Raipur, India
关键词
AdaBoost; ensemble; Random Forest; SMOTE; STATISTICAL COMPARISONS; SOFTWARE; ALGORITHMS;
D O I
10.2478/acss-2022-0015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
- In software development, defects influence the quality and cost in an undesirable way. Software defect prediction (SDP) is one of the techniques which improves the software quality and testing efficiency by early identification of defects(bug/fault/error). Thus, several experiments have been suggested for defect prediction (DP) techniques. Mainly DP method utilises historical project data for constructing prediction models. SDP performs well within projects until there is an adequate amount of data accessible to train the models. However, if the data are inadequate or limited for the same project, the researchers mainly use Cross-Project Defect Prediction (CPDP). CPDP is a possible alternative option that refers to anticipating defects using prediction models built on historical data from other projects. CPDP is challenging due to its data distribution and domain difference problem. The proposed framework is an effective two-stage approach for CPDP, i.e., model generation and prediction process. In model generation phase, the conglomeration of different pre-processing, including feature selection and class reweights technique, is used to improve the initial data quality. Finally, a fine-tuned efficient bagging and boosting based hybrid ensemble model is developed, which avoids model overfitting/under-fitting and helps enhance the prediction performance. In the prediction process phase, the generated model predicts the historical data from other projects, which has defects or clean. The framework is evaluated using25 software projects obtained from public repositories. The result analysis shows that the proposed model has achieved a 0.71 +/- 0.03 f1-score, which significantly improves the state-of-the-art approaches by 23 % to 60 %.
引用
收藏
页码:137 / 148
页数:12
相关论文
共 50 条
  • [1] Using active learning selection approach for cross-project software defect prediction
    Mi, Wenbo
    Li, Yong
    Wen, Ming
    Chen, Youren
    [J]. CONNECTION SCIENCE, 2022, 34 (01) : 1482 - 1499
  • [2] Using Bandit Algorithms for Project Selection in Cross-Project Defect Prediction
    Asano, Takuya
    Tsunoda, Masateru
    Toda, Koji
    Tahir, Amjed
    Bennin, Kwabena Ebo
    Nakasai, Keitaro
    Monden, Akito
    Matsumoto, Kenichi
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2021), 2021, : 649 - 653
  • [3] Local modeling approach for cross-project defect prediction
    Bhat, Nayeem Ahmad
    Farooq, Sheikh Umar
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (04): : 623 - 637
  • [4] An Empirical Study of Software Metrics Diversity for Cross-Project Defect Prediction
    Zhong, Yiwen
    Song, Kun
    Lv, ShengKai
    He, Peng
    [J]. Mathematical Problems in Engineering, 2021, 2021
  • [5] Training data selection for imbalanced cross-project defect prediction
    Zheng, Shang
    Gai, Jinjing
    Yu, Hualong
    Zou, Haitao
    Gao, Shang
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 94
  • [6] WIFLF: An approach independent of the target project for cross-project defect prediction
    Cui, Can
    Liu, Bin
    Wang, Shihai
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2022, 34 (12)
  • [7] Source selection and transfer defect learning based cross-project defect prediction
    Wen, Wanzhi
    Zhu, Ningbo
    Ye, Bingqing
    Li, Xikai
    Wang, Chuyue
    Chu, Jiawei
    Li, Yuehua
    [J]. INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2022, 16 (03) : 195 - 207
  • [8] Domain Adaptation Approach for Cross-project Software Defect Prediction
    Chen, Shu
    Ye, Jun-Min
    Liu, Tong
    [J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (02): : 266 - 281
  • [9] An Improved Method for Training Data Selection for Cross-Project Defect Prediction
    Nayeem Ahmad Bhat
    Sheikh Umar Farooq
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 1939 - 1954
  • [10] Empirical validation of feature selection techniques for cross-project defect prediction
    Malhotra, Ruchika
    Meena, Shweta
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (05) : 1743 - 1755