An Approach for Cross Project Defect Prediction Using Identical Metrics Matching and Deep Neural Network

被引:0
|
作者
Bal, Pravas Ranjan [1 ,2 ]
Kumar, Sandeep [2 ]
机构
[1] Birla Inst Technol, Dept Comp Sci & Engn, Mesra 835215, India
[2] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee 247667, India
关键词
Training; Software; Measurement; Data models; Accuracy; Predictive models; Prediction algorithms; Correlation; cross project defect prediction (CPDP); deep learning; Kolmogorov-Smirnov (KS) test; matched metrics;
D O I
10.1109/TR.2024.3435709
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Advancements in software defect prediction (SDP) to handle the scenario of no or limited historical data have introduced the concept of cross-project defect prediction (CPDP). CPDP using machine learning (ML) algorithms has been the staple research area for all software practitioners in the SDP domain. An important assumption in ML algorithms is that both train and test data must follow similar data distribution for better accuracy. These assumptions may hold in the within-project defect prediction (WPDP) scenario where both train and test data belong to the same project. However, it is impossible in the CPDP scenario where the train and test data belong to different projects. So, in the CPDP scenario, researchers tried to use a matched metrics approach to handling this issue. However, in this case, there may be an issue if only a small-sized source (train) dataset matches the data distribution with the target (test) dataset, leading to an insufficient training dataset. Hence, we have proposed a cross-project data preprocessing method, namely knowledge transfer from target data to source data using correlation (KTTSC), to handle this issue and hence to improve the CPDP accuracy of ML models. The experimental results demonstrate that using the dropout regularization-based deep neural network, k nearest neighbor, decision tree, logistic regression, and Naive Bayes classifiers with the proposed KTTSC method show an improvement of 22%, 17%, 23.2%, 13.5%, and 9.5%, respectively, in terms of average AUC scores as compared to the traditional CPDP method and an improvement in the range of 6.6% to 11.1% as compared to existing works on CPDP.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Deep neural network based hybrid approach for software defect prediction using software metrics
    C. Manjula
    Lilly Florence
    [J]. Cluster Computing, 2019, 22 : 9847 - 9863
  • [2] Deep neural network based hybrid approach for software defect prediction using software metrics
    Manjula, C.
    Florence, Lilly
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9847 - S9863
  • [3] Cross-Project Defect Prediction with Metrics Selection and Balancing Approach
    Nevendra, Meetesh
    Singh, Pradeep
    [J]. APPLIED COMPUTER SYSTEMS, 2022, 27 (02) : 137 - 148
  • [4] DNNAttention: A deep neural network and attention based architecture for cross project defect number prediction
    Pandey, Sushant Kumar
    Tripathi, Anil Kumar
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 233
  • [5] Transfer Convolutional Neural Network for Cross-Project Defect Prediction
    Qiu, Shaojian
    Xu, Hao
    Deng, Jiehan
    Jiang, Siyu
    Lu, Lu
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [6] An Adversarial Discriminative Convolutional Neural Network for Cross-Project Defect Prediction
    Sheng, Lei
    Lu, Lu
    Lin, Junhao
    [J]. IEEE ACCESS, 2020, 8 : 55241 - 55253
  • [7] Software defect prediction techniques using metrics based on neural network classifier
    R. Jayanthi
    Lilly Florence
    [J]. Cluster Computing, 2019, 22 : 77 - 88
  • [8] Software defect prediction techniques using metrics based on neural network classifier
    Jayanthi, R.
    Florence, Lilly
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 77 - 88
  • [9] Defect Prediction in Android Binary Executables Using Deep Neural Network
    Feng Dong
    Junfeng Wang
    Qi Li
    Guoai Xu
    Shaodong Zhang
    [J]. Wireless Personal Communications, 2018, 102 : 2261 - 2285
  • [10] Defect Prediction in Android Binary Executables Using Deep Neural Network
    Dong, Feng
    Wang, Junfeng
    Li, Qi
    Xu, Guoai
    Zhang, Shaodong
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (03) : 2261 - 2285