Dissimilarity Space Based Multi-Source Cross-Project Defect Prediction

被引:4
|
作者
Ren, Shengbing [1 ]
Zhang, Wanying [1 ]
Munir, Hafiz Shahbaz [2 ]
Xia, Lei [1 ]
机构
[1] Cent S Univ, Sch Software, Changsha 410075, Hunan, Peoples R China
[2] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
关键词
software quality; cross-project defect prediction; multi-source; dissimilarity space; arc-cosine kernel function;
D O I
10.3390/a12010013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction is an important means to guarantee software quality. Because there are no sufficient historical data within a project to train the classifier, cross-project defect prediction (CPDP) has been recognized as a fundamental approach. However, traditional defect prediction methods use feature attributes to represent samples, which cannot avoid negative transferring, may result in poor performance model in CPDP. This paper proposes a multi-source cross-project defect prediction method based on dissimilarity space (DM-CPDP). This method not only retains the original information, but also obtains the relationship with other objects. So it can enhances the discriminant ability of the sample attributes to the class label. This method firstly uses the density-based clustering method to construct the prototype set with the cluster center of samples in the target set. Then, the arc-cosine kernel is used to calculate the sample dissimilarities between the prototype set and the source domain or the target set to form the dissimilarity space. In this space, the training set is obtained with the earth mover's distance (EMD) method. For the unlabeled samples converted from the target set, the k-Nearest Neighbor (KNN) algorithm is used to label those samples. Finally, the model is learned from training data based on TrAdaBoost method and used to predict new potential defects. The experimental results show that this approach has better performance than other traditional CPDP methods.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Data Transformation in Cross-project Defect Prediction
    Feng Zhang
    Iman Keivanloo
    Ying Zou
    [J]. Empirical Software Engineering, 2017, 22 : 3186 - 3218
  • [22] An investigation on the feasibility of cross-project defect prediction
    Zhimin He
    Fengdi Shu
    Ye Yang
    Mingshu Li
    Qing Wang
    [J]. Automated Software Engineering, 2012, 19 : 167 - 199
  • [23] Multi-source Cross Project Defect Prediction with Joint Wasserstein Distance and Ensemble Learning
    Zou, Quanyi
    Lu, Lu
    Yang, Zhanyu
    Xu, Hao
    [J]. 2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, : 57 - 68
  • [24] An investigation on the feasibility of cross-project defect prediction
    He, Zhimin
    Shu, Fengdi
    Yang, Ye
    Li, Mingshu
    Wang, Qing
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2012, 19 (02) : 167 - 199
  • [25] Data Transformation in Cross-project Defect Prediction
    Zhang, Feng
    Keivanloo, Iman
    Zou, Ying
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (06) : 3186 - 3218
  • [26] eCPDP : Early Cross-Project Defect Prediction
    Kwon, Sunjae
    Ryu, Duksan
    Baik, Jongmoon
    [J]. 2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 470 - 481
  • [27] Cross-Project Defect Prediction: A Literature Review
    Pal, Sourabh
    Sillitti, Alberto
    [J]. IEEE ACCESS, 2022, 10 : 118697 - 118717
  • [28] An Evaluation of Cross-Project Defect Prediction Approaches on Cross-Personalized Defect Prediction
    Amasaki, Sousuke
    Aman, Hirohisa
    Yokogawa, Tomoyuki
    [J]. PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROFES 2022, 2022, 13709 : 433 - 448
  • [29] Cross-project defect prediction based on G-LSTM model
    Xing, Ying
    Qian, Xiaomeng
    Guan, Yu
    Yang, Bin
    Zhang, Yuwei
    [J]. PATTERN RECOGNITION LETTERS, 2022, 160 : 50 - 57
  • [30] Cross-Project Defect Prediction Based on Domain Adaptation and LSTM Optimization
    Javed, Khadija
    Ren, Shengbing
    Asim, Muhammad
    Wani, Mudasir Ahmad
    [J]. ALGORITHMS, 2024, 17 (05)