Dissimilarity Space Based Multi-Source Cross-Project Defect Prediction

被引:4
|
作者
Ren, Shengbing [1 ]
Zhang, Wanying [1 ]
Munir, Hafiz Shahbaz [2 ]
Xia, Lei [1 ]
机构
[1] Cent S Univ, Sch Software, Changsha 410075, Hunan, Peoples R China
[2] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
关键词
software quality; cross-project defect prediction; multi-source; dissimilarity space; arc-cosine kernel function;
D O I
10.3390/a12010013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction is an important means to guarantee software quality. Because there are no sufficient historical data within a project to train the classifier, cross-project defect prediction (CPDP) has been recognized as a fundamental approach. However, traditional defect prediction methods use feature attributes to represent samples, which cannot avoid negative transferring, may result in poor performance model in CPDP. This paper proposes a multi-source cross-project defect prediction method based on dissimilarity space (DM-CPDP). This method not only retains the original information, but also obtains the relationship with other objects. So it can enhances the discriminant ability of the sample attributes to the class label. This method firstly uses the density-based clustering method to construct the prototype set with the cluster center of samples in the target set. Then, the arc-cosine kernel is used to calculate the sample dissimilarities between the prototype set and the source domain or the target set to form the dissimilarity space. In this space, the training set is obtained with the earth mover's distance (EMD) method. For the unlabeled samples converted from the target set, the k-Nearest Neighbor (KNN) algorithm is used to label those samples. Finally, the model is learned from training data based on TrAdaBoost method and used to predict new potential defects. The experimental results show that this approach has better performance than other traditional CPDP methods.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Cross-project software defect prediction based on multi-source data sets
    Huang Junfu
    Wang Yawen
    Gong Yunzhan
    Jin Dahai
    [J]. The Journal of China Universities of Posts and Telecommunications, 2021, 28 (04) : 75 - 87
  • [2] Cross-project software defect prediction based on multi-source data sets
    Junfu, Huang
    Yawen, Wang
    Yunzhan, Gong
    Dahai, Jin
    [J]. Journal of China Universities of Posts and Telecommunications, 2021, 28 (04): : 75 - 87
  • [3] An Empirical Study on Multi-Source Cross-Project Defect Prediction Models
    Liu, Xuanying
    Li, Zonghao
    Zou, Jiaqi
    Tong, Haonan
    [J]. 2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 318 - 327
  • [4] MHCPDP: multi-source heterogeneous cross-project defect prediction via multi-source transfer learning and autoencoder
    Jie Wu
    Yingbo Wu 
    Nan Niu
    Min Zhou
    [J]. Software Quality Journal, 2021, 29 : 405 - 430
  • [5] MSCPDPLab: A MATLAB toolbox for transfer learning based multi-source cross-project defect prediction
    Zou, Jiaqi
    Li, Zonghao
    Liu, Xuanying
    Tong, Haonan
    [J]. SOFTWAREX, 2023, 21
  • [6] MSCPDPLab: A MATLAB toolbox for transfer learning based multi-source cross-project defect prediction
    Zou, Jiaqi
    Li, Zonghao
    Liu, Xuanying
    Tong, Haonan
    [J]. SOFTWAREX, 2023, 21
  • [7] MHCPDP: multi-source heterogeneous cross-project defect prediction via multi-source transfer learning and autoencoder
    Wu, Jie
    Wu, Yingbo
    Niu, Nan
    Zhou, Min
    [J]. SOFTWARE QUALITY JOURNAL, 2021, 29 (02) : 405 - 430
  • [8] A three-stage transfer learning framework for multi-source cross-project software defect prediction
    Bai, Jiaojiao
    Jia, Jingdong
    Capretz, Luiz Fernando
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 150
  • [9] A three-stage transfer learning framework for multi-source cross-project software defect prediction
    Bai, Jiaojiao
    Jia, Jingdong
    Capretz, Luiz Fernando
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 150
  • [10] MASTER: Multi-Source Transfer Weighted Ensemble Learning for Multiple Sources Cross-Project Defect Prediction
    Tong, Haonan
    Zhang, Dalin
    Liu, Jiqiang
    Xing, Weiwei
    Lu, Lingyun
    Lu, Wei
    Wu, Yumei
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (05) : 1281 - 1305