Cross-Project Defect Prediction with Metrics Selection and Balancing Approach

被引:1
|
作者
Nevendra, Meetesh [1 ]
Singh, Pradeep [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Raipur, India
关键词
AdaBoost; ensemble; Random Forest; SMOTE; STATISTICAL COMPARISONS; SOFTWARE; ALGORITHMS;
D O I
10.2478/acss-2022-0015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
- In software development, defects influence the quality and cost in an undesirable way. Software defect prediction (SDP) is one of the techniques which improves the software quality and testing efficiency by early identification of defects(bug/fault/error). Thus, several experiments have been suggested for defect prediction (DP) techniques. Mainly DP method utilises historical project data for constructing prediction models. SDP performs well within projects until there is an adequate amount of data accessible to train the models. However, if the data are inadequate or limited for the same project, the researchers mainly use Cross-Project Defect Prediction (CPDP). CPDP is a possible alternative option that refers to anticipating defects using prediction models built on historical data from other projects. CPDP is challenging due to its data distribution and domain difference problem. The proposed framework is an effective two-stage approach for CPDP, i.e., model generation and prediction process. In model generation phase, the conglomeration of different pre-processing, including feature selection and class reweights technique, is used to improve the initial data quality. Finally, a fine-tuned efficient bagging and boosting based hybrid ensemble model is developed, which avoids model overfitting/under-fitting and helps enhance the prediction performance. In the prediction process phase, the generated model predicts the historical data from other projects, which has defects or clean. The framework is evaluated using25 software projects obtained from public repositories. The result analysis shows that the proposed model has achieved a 0.71 +/- 0.03 f1-score, which significantly improves the state-of-the-art approaches by 23 % to 60 %.
引用
收藏
页码:137 / 148
页数:12
相关论文
共 50 条
  • [21] Correlation Metric Selection based Correlation Alignment for Cross-project Defect Prediction
    Niu, Jingwen
    Li, Zhiqiang
    Qi, Chao
    [J]. 20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 490 - 495
  • [22] A Cluster Based Feature Selection Method for Cross-Project Software Defect Prediction
    Chao Ni
    Wang-Shu Liu
    Xiang Chen
    Qing Gu
    Dao-Xu Chen
    Qi-Guo Huang
    [J]. Journal of Computer Science and Technology, 2017, 32 : 1090 - 1107
  • [23] A Cluster Based Feature Selection Method for Cross-Project Software Defect Prediction
    Ni, Chao
    Liu, Wang-Shu
    Chen, Xiang
    Gu, Qing
    Chen, Dao-Xu
    Huang, Qi-Guo
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (06) : 1090 - 1107
  • [24] Cross-project defect prediction method based on genetic algorithm feature selection
    Hu, Zhixi
    Zhu, Yi
    [J]. ENGINEERING REPORTS, 2023, 5 (12)
  • [25] Cross-Project and Within-Project Semisupervised Software Defect Prediction: A Unified Approach
    Wu, Fei
    Jing, Xiao-Yuan
    Sun, Ying
    Sun, Jing
    Huang, Lin
    Cui, Fangyi
    Sun, Yanfei
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2018, 67 (02) : 581 - 597
  • [26] An Evaluation of Cross-Project Defect Prediction Approaches on Cross-Personalized Defect Prediction
    Amasaki, Sousuke
    Aman, Hirohisa
    Yokogawa, Tomoyuki
    [J]. PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROFES 2022, 2022, 13709 : 433 - 448
  • [27] A Three-Stage Defect Prediction Model for Cross-Project Defect Prediction
    Huang, Song
    Wu, Yaning
    Ji, Haijin
    Bai, Chengzu
    [J]. 2017 FOURTH INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND THEIR APPLICATIONS (DSA 2017), 2017, : 169 - 169
  • [28] Adversarial domain adaptation for cross-project defect prediction
    Hengjie Song
    Guobin Wu
    Le Ma
    Yufei Pan
    Qingan Huang
    Siyu Jiang
    [J]. Empirical Software Engineering, 2023, 28
  • [29] Discriminant Subspace Alignment for Cross-project Defect Prediction
    Li, Zhiqiang
    Qi, Chao
    Zhang, Li
    Ren, Jie
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 1728 - 1733