MASTER: Multi-Source Transfer Weighted Ensemble Learning for Multiple Sources Cross-Project Defect Prediction

被引:0
|
作者
Tong, Haonan [1 ]
Zhang, Dalin [1 ]
Liu, Jiqiang [1 ]
Xing, Weiwei [1 ]
Lu, Lingyun [1 ]
Lu, Wei [1 ]
Wu, Yumei [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China
[2] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
关键词
Multiple source datasets; cross-project defect prediction; software defect proneness; feature weighting; transfer learning; MODEL;
D O I
10.1109/TSE.2024.3381235
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-source cross-project defect prediction (MSCPDP) attempts to transfer defect knowledge learned from multiple source projects to the target project. MSCPDP has drawn increasing attention from academic and industry communities owing to its advantages compared with single-source cross-project defect prediction (SSCPDP). However, two main problems, which are how to effectively extract the transferable knowledge from each source dataset and how to measure the amount of knowledge transferred from each source dataset to the target dataset, seriously restrict the performance of existing MSCPDP models. In this paper, we propose a novel <bold>m</bold>ulti-source tr<bold>a</bold>n<bold>s</bold>fer weigh<bold>t</bold>ed <bold>e</bold>nsemble lea<bold>r</bold>ning (MASTER) method for MSCPDP. MASTER measures the weight of each source dataset based on feature importance and distribution difference and then extracts the transferable knowledge based on the proposed feature-weighted transfer learning algorithm. Experiments are performed on 30 software projects. We compare MASTER with the latest state-of-the-art MSCPDP methods with statistical test in terms of famous effort-unaware measures (i.e., PD, PF, AUC, and MCC) and two widely used effort-aware measures (P-opt 20% and IFA). The experiment results show that: 1) MASTER can substantially improve the prediction performance compared with the baselines, e.g., an improvement of at least 49.1% in MCC, 48.1% in IFA; 2) MASTER significantly outperforms each baseline on most datasets in terms of AUC, MCC, P-opt 20% and IFA; 3) MSCPDP model significantly performs better than the mean case of SSCPDP model on most datasets and even outperforms the best case of SSCPDP on some datasets. It can be concluded that 1) it is very necessary to conduct MSCPDP, and 2) the proposed MASTER is a more promising alternative for MSCPDP.
引用
收藏
页码:1281 / 1305
页数:25
相关论文
共 50 条
  • [31] Improve cross-project just-in-time defect prediction with dynamic transfer learning
    Dai, Hongming
    Xi, Jianqing
    Dai, Hong-Liang
    [J]. Journal of Systems and Software, 2025, 219
  • [32] Effective multi-objective naive Bayes learning for cross-project defect prediction
    Ryu, Duksan
    Baik, Jongmoon
    [J]. APPLIED SOFT COMPUTING, 2016, 49 : 1062 - 1077
  • [33] Cross-project Defect Prediction Method Using Adversarial Learning
    Xing, Ying
    Qian, Xiao-Meng
    Guan, Yu
    Zhang, Shi-Hao
    Zhao, Meng-Ci
    Lin, Wan-Ting
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (06): : 2097 - 2112
  • [34] DeepCPDP: Deep Learning Based Cross-Project Defect Prediction
    Chen, Deyu
    Chen, Xiang
    Li, Hao
    Xie, Junfeng
    Mu, Yanzhou
    [J]. IEEE ACCESS, 2019, 7 : 184832 - 184848
  • [35] ARRAY: Adaptive triple feature-weighted transfer Naive Bayes for cross-project defect prediction
    Tong, Haonan
    Lu, Wei
    Xing, Weiwei
    Wang, Shihai
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 202
  • [36] Tackling Imbalanced Class on Cross-Project Defect Prediction Using Ensemble SMOTE
    Saifudin, A.
    Hendric, S. W. H. L.
    Soewito, B.
    Gaol, F. L.
    Abdurachman, E.
    Heryadi, Y.
    [J]. 2ND INTERNATIONAL CONFERENCE ON INFORMATICS, ENGINEERING, SCIENCE, AND TECHNOLOGY (INCITEST 2019), 2019, 662
  • [37] Cross-project clone consistent-defect prediction via transfer-learning method
    Jiang, Wenchao
    Qiu, Shaojian
    Liang, Tiancai
    Zhang, Fanlong
    [J]. INFORMATION SCIENCES, 2023, 635 : 138 - 150
  • [38] Cross-project bug type prediction based on transfer learning
    Xiaoting Du
    Zenghui Zhou
    Beibei Yin
    Guanping Xiao
    [J]. Software Quality Journal, 2020, 28 : 39 - 57
  • [39] Understanding the Automated Parameter Optimization on Transfer Learning for Cross-Project Defect Prediction: An Empirical Study
    Li, Ke
    Xiang, Zilin
    Chen, Tao
    Wang, Shuo
    Tan, Kay Chen
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 566 - 577
  • [40] Cross-Project Defect Prediction Considering Multiple Data Distribution Simultaneously
    Zhao, Yu
    Zhu, Yi
    Yu, Qiao
    Chen, Xiaoying
    [J]. SYMMETRY-BASEL, 2022, 14 (02):