Improve cross-project just-in-time defect prediction with dynamic transfer learning

被引:0
|
作者
Dai, Hongming [1 ,2 ]
Xi, Jianqing [1 ]
Dai, Hong-Liang [3 ]
机构
[1] School of Software, South China University of Technology, Guangzhou,510006, China
[2] School of Information, Guangdong Polytechnic of Science and Trade, Guangzhou,510430, China
[3] School of Economics and Statistics, Guangzhou University, Guangzhou,510006, China
关键词
Prediction models;
D O I
10.1016/j.jss.2024.112214
中图分类号
学科分类号
摘要
Cross-project just-in-time software defect prediction (CP-JIT-SDP) is a prominent research topic in the field of software engineering. This approach is characterized by its immediacy, accuracy, real-time feedback, and traceability, enabling it to effectively address the challenges of defect prediction in new projects or projects with limited training data. However, CP-JIT-SDP faces significant challenges due to the differences in the feature distribution between the source and target projects. To address this issue, researchers have proposed methods for adjusting marginal or conditional probability distributions. This study introduces a transfer-learning approach that integrates dynamic distribution adaptation. The kernel variance matching (KVM) method is proposed to adjust the disparity in the marginal probability distribution by recalculating the variance of the source and target projects within the reproducing kernel Hilbert space (RKHS) to minimize the variance disparity. The categorical boosting (CatBoost) algorithm is used to construct models, while the improved CORrelation ALignment (CORAL) method is applied to develop the loss function to address the difference in the conditional probability distribution. This method is abbreviated as KCC, where the symbol K represents KVM, the symbol C represents CatBoost, and the next symbol C represents improved CORAL. The KCC method aims to optimize the joint probability distribution of the source project so that it closely agrees with that of the target project through iterative and dynamic integration. Six well-known open-source projects were used to evaluate the effectiveness of the proposed method. The empirical findings indicate that the KCC method exhibited significant improvements over the baseline methods. In particular, the KCC method demonstrated an average increase of 18% in the geometric mean (G-mean), 105.4% in the Matthews correlation coefficient (MCC), 25.6% in the F1-score, and 16.9% in the area under the receiver operating characteristic curve (AUC) when compared to the baseline methods. Furthermore, the KCC method demonstrated greater stability. © 2024 Elsevier Inc.
引用
收藏
相关论文
共 50 条
  • [1] An Investigation of Cross-Project Learning in Online Just-In-Time Software Defect Prediction
    Tabassum, Sadia
    Minku, Leandro L.
    Feng, Danyi
    Cabral, George G.
    Song, Liyan
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 554 - 565
  • [2] Cross-Project Online Just-In-Time Software Defect Prediction
    Tabassum, Sadia
    Minku, Leandro L.
    Feng, Danyi
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (01) : 268 - 287
  • [3] Studying just-in-time defect prediction using cross-project models
    Yasutaka Kamei
    Takafumi Fukushima
    Shane McIntosh
    Kazuhiro Yamashita
    Naoyasu Ubayashi
    Ahmed E. Hassan
    [J]. Empirical Software Engineering, 2016, 21 : 2072 - 2106
  • [4] Studying just-in-time defect prediction using cross-project models
    Kamei, Yasutaka
    Fukushima, Takafumi
    McIntosh, Shane
    Yamashita, Kazuhiro
    Ubayashi, Naoyasu
    Hassan, Ahmed E.
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (05) : 2072 - 2106
  • [5] Mobile Application Online Cross-Project Just-in-Time Software Defect Prediction Framework
    Jiang, Siyu
    He, Zhenhang
    chen, Yuwen
    Zhang, Mingrong
    Ma, Le
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (06)
  • [6] A novel instance-based method for cross-project just-in-time defect prediction
    Zhu, Xiaoyan
    Qiu, Tian
    Wang, Jiayin
    Lai, Xin
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2024, 54 (06): : 1087 - 1117
  • [7] Online cross-project approach with project-level similarity for just-in-time software defect prediction
    Teng, Cong
    Song, Liyan
    Yao, Xin
    [J]. Empirical Software Engineering, 2024, 29 (06)
  • [8] Effort-aware cross-project just-in-time defect prediction framework for mobile apps
    Cheng, Tian
    Zhao, Kunsong
    Sun, Song
    Mateen, Muhammad
    Wen, Junhao
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (06)
  • [9] Effort-aware cross-project just-in-time defect prediction framework for mobile apps
    CHENG Tian
    ZHAO Kunsong
    SUN Song
    MATEEN Muhammad
    WEN Junhao
    [J]. Frontiers of Computer Science, 2022, 16 (06)
  • [10] Effort-aware cross-project just-in-time defect prediction framework for mobile apps
    Tian Cheng
    Kunsong Zhao
    Song Sun
    Muhammad Mateen
    Junhao Wen
    [J]. Frontiers of Computer Science, 2022, 16