Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction

被引:12
|
作者
Sun, Ying [1 ]
Jing, Xiao-Yuan [1 ,2 ,3 ]
Wu, Fei [2 ]
Li, Juanjuan [2 ]
Xing, Danlei [1 ]
Chen, Haowen [3 ]
Sun, Yanfei [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Data models; Predictive models; Machine learning; Correlation; Sun; Gallium nitride; Prediction algorithms; Cross-project defect prediction; adversarial learning; semi-supervised learning; NEURAL-NETWORKS; FRAMEWORK; MODELS; CODE;
D O I
10.1109/ACCESS.2020.2974527
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-project defect prediction (CPDP) aims to build a prediction model on existing source projects and predict the labels of target project. The data distribution difference between different projects makes CPDP very challenging. Besides, most existing CPDP methods usually require sufficient and labeled data. However, acquiring lots of labeled data for a new project is difficult while obtaining the unlabeled data is relatively easy. A desirable approach is building a prediction model on unlabeled data and labeled data. CPDP in this scenario is called cross-project semi-supervised defect prediction (CSDP). Recently, generative adversarial networks have achieved impressive results with these strong ability of learning data distribution and discriminative representation. For effectively learning the discriminative features of data from different projects, we propose a Discriminative Adversarial Feature Learning (DAFL) approach for CSDP. DAFL consists of feature transformer and project discriminator, which compete with each other. A feature transformer tries to generate feature representation, which learns the discriminant information and preserves intrinsic structure inferred from both labeled and unlabeled data. A project discriminator tries to discriminate source and target instances on the generated representation. Experiments on 16 projects show that DAFL performs significantly better than baselines.
引用
收藏
页码:32674 / 32687
页数:14
相关论文
共 50 条
  • [21] A Framework for Homogeneous Cross-Project Defect Prediction
    Goel, Lipika
    Sharma, Mayank
    Khatri, Sunil Kumar
    Damodaran, D.
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2021, 9 (01) : 52 - 68
  • [22] Assessing the Effect of Imbalanced Learning on Cross-project Software Defect Prediction
    Sohan, Md Fahimuzzman
    Jabiullah, Md Ismail
    Rahman, Sheikh Shah Mohammad Motiur
    Mahmud, S. M. Hasan
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [23] An investigation on the feasibility of cross-project defect prediction
    Zhimin He
    Fengdi Shu
    Ye Yang
    Mingshu Li
    Qing Wang
    Automated Software Engineering, 2012, 19 : 167 - 199
  • [24] Data Transformation in Cross-project Defect Prediction
    Feng Zhang
    Iman Keivanloo
    Ying Zou
    Empirical Software Engineering, 2017, 22 : 3186 - 3218
  • [25] Research on Cross-Project Software Defect Prediction Based on Machine Learning
    Wang, Baoping
    Wang, Wennan
    Zhu, Linkai
    Liu, Wenjian
    ADVANCES IN WEB-BASED LEARNING - ICWL 2021, 2021, 13103 : 160 - 165
  • [26] An investigation on the feasibility of cross-project defect prediction
    He, Zhimin
    Shu, Fengdi
    Yang, Ye
    Li, Mingshu
    Wang, Qing
    AUTOMATED SOFTWARE ENGINEERING, 2012, 19 (02) : 167 - 199
  • [27] An Investigation of Imbalanced Ensemble Learning Methods for Cross-Project Defect Prediction
    Qiu, Shaojian
    Lu, Lu
    Jiang, Siyu
    Guo, Yang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (12)
  • [28] Data Transformation in Cross-project Defect Prediction
    Zhang, Feng
    Keivanloo, Iman
    Zou, Ying
    EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (06) : 3186 - 3218
  • [29] eCPDP : Early Cross-Project Defect Prediction
    Kwon, Sunjae
    Ryu, Duksan
    Baik, Jongmoon
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 470 - 481
  • [30] MarginGAN: Adversarial Training in Semi-Supervised Learning
    Dong, Jinhao
    Lin, Tong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32