A three-stage transfer learning framework for multi-source cross-project software defect prediction

被引:0
|
作者
Bai, Jiaojiao [1 ]
Jia, Jingdong [1 ]
Capretz, Luiz Fernando [2 ]
机构
[1] Beihang Univ, Sch Software, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[2] Western Univ, Elect & Comp Engn, London, ON, Canada
关键词
Transfer learning; Cross-project defect prediction; Source selection; Multi-source utilization; 3SW-MSTL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Transfer learning techniques have been proved to be effective in the field of Cross-project defect prediction (CPDP). However, some questions still remain. First, the conditional distribution difference between source and target projects has not been considered. Second, facing multiple source projects, most studies only rarely consider the issues of source selection and multi-source data utilization; instead, they use all available projects and merge multi-source data together to obtain one final dataset. Objective: To address these issues, in this paper, we propose a three-stage weighting framework for multi-source transfer learning (3SW-MSTL) in CPDP. In stage 1, a source selection strategy is needed to select a suitable number of source projects from all available projects. In stage 2, a transfer technique is applied to minimize marginal differences. In stage 3, a multi-source data utilization scheme that uses conditional distribution information is needed to help guide researchers in the use of multi-source transferred data. Method: First, we have designed five source selection strategies and four multi-source utilization schemes and chosen the best one to be used in stage 1 and 3 in 3SW-MSTL by comparing their influences on prediction performance. Second, to validate the performance of 3SW-MSTL, we compared it with four multi-source and six single-source CPDP methods, a baseline within-project defect prediction (WPDP) method, and two unsupervised methods on the data from 30 widely used open-source projects. Results: Through experiments, bellwether and weighted vote are separately chosen as a source selection strategy and a multi-source utilization scheme used in 3SW-MSTL. And, our results indicate that 3SW-MSTL outperforms four multi-source, six single-source CPDP methods and two unsupervised methods. And, 3SW-MSTL is comparable to the WPDP method. Conclusion: The proposed 3SW-MSTL model is more effective for considering the two issues mentioned before.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] An Investigation of Cross-Project Learning in Online Just-In-Time Software Defect Prediction
    Tabassum, Sadia
    Minku, Leandro L.
    Feng, Danyi
    Cabral, George G.
    Song, Liyan
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 554 - 565
  • [42] Cross-project software defect prediction based on the reduction and hybridization of software metrics
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Lee, Sungon
    Al-masni, Mohammed A.
    Gu, Yeong Hyeon
    Algabri, Redhwan
    [J]. Alexandria Engineering Journal, 2025, 112 : 161 - 176
  • [43] Mobile Application Online Cross-Project Just-in-Time Software Defect Prediction Framework
    Jiang, Siyu
    He, Zhenhang
    chen, Yuwen
    Zhang, Mingrong
    Ma, Le
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (06)
  • [44] Transfer Convolutional Neural Network for Cross-Project Defect Prediction
    Qiu, Shaojian
    Xu, Hao
    Deng, Jiehan
    Jiang, Siyu
    Lu, Lu
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [45] Improve cross-project just-in-time defect prediction with dynamic transfer learning
    Dai, Hongming
    Xi, Jianqing
    Dai, Hong-Liang
    [J]. Journal of Systems and Software, 2025, 219
  • [46] Impact of hyper parameter optimization for cross-project software defect prediction
    Qu, Yubin
    Chen, Xiang
    Zhao, Yingquan
    Ju, Xiaolin
    [J]. International Journal of Performability Engineering, 2018, 14 (06): : 1291 - 1299
  • [47] Effective multi-objective naive Bayes learning for cross-project defect prediction
    Ryu, Duksan
    Baik, Jongmoon
    [J]. APPLIED SOFT COMPUTING, 2016, 49 : 1062 - 1077
  • [48] Cross-project Defect Prediction Method Using Adversarial Learning
    Xing, Ying
    Qian, Xiao-Meng
    Guan, Yu
    Zhang, Shi-Hao
    Zhao, Meng-Ci
    Lin, Wan-Ting
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (06): : 2097 - 2112
  • [49] DeepCPDP: Deep Learning Based Cross-Project Defect Prediction
    Chen, Deyu
    Chen, Xiang
    Li, Hao
    Xie, Junfeng
    Mu, Yanzhou
    [J]. IEEE ACCESS, 2019, 7 : 184832 - 184848
  • [50] An Empirical Study of Software Metrics Diversity for Cross-Project Defect Prediction
    Zhong, Yiwen
    Song, Kun
    Lv, ShengKai
    He, Peng
    [J]. Mathematical Problems in Engineering, 2021, 2021