Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction

被引：12

作者：

Sun, Ying ^{[1
]}

Jing, Xiao-Yuan ^{[1
,2
,3
]}

Wu, Fei ^{[2
]}

Li, Juanjuan ^{[2
]}

Xing, Danlei ^{[1
]}

Chen, Haowen ^{[3
]}

Sun, Yanfei ^{[2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Peoples R China

[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Data models; Predictive models; Machine learning; Correlation; Sun; Gallium nitride; Prediction algorithms; Cross-project defect prediction; adversarial learning; semi-supervised learning; NEURAL-NETWORKS; FRAMEWORK; MODELS; CODE;

D O I：

10.1109/ACCESS.2020.2974527

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cross-project defect prediction (CPDP) aims to build a prediction model on existing source projects and predict the labels of target project. The data distribution difference between different projects makes CPDP very challenging. Besides, most existing CPDP methods usually require sufficient and labeled data. However, acquiring lots of labeled data for a new project is difficult while obtaining the unlabeled data is relatively easy. A desirable approach is building a prediction model on unlabeled data and labeled data. CPDP in this scenario is called cross-project semi-supervised defect prediction (CSDP). Recently, generative adversarial networks have achieved impressive results with these strong ability of learning data distribution and discriminative representation. For effectively learning the discriminative features of data from different projects, we propose a Discriminative Adversarial Feature Learning (DAFL) approach for CSDP. DAFL consists of feature transformer and project discriminator, which compete with each other. A feature transformer tries to generate feature representation, which learns the discriminant information and preserves intrinsic structure inferred from both labeled and unlabeled data. A project discriminator tries to discriminate source and target instances on the generated representation. Experiments on 16 projects show that DAFL performs significantly better than baselines.

引用

页码：32674 / 32687

页数：14

共 50 条

[21] A Framework for Homogeneous Cross-Project Defect Prediction
Goel, Lipika
Sharma, Mayank
Khatri, Sunil Kumar
Damodaran, D.
INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2021, 9 (01) : 52 - 68
[22] Assessing the Effect of Imbalanced Learning on Cross-project Software Defect Prediction
Sohan, Md Fahimuzzman
Jabiullah, Md Ismail
Rahman, Sheikh Shah Mohammad Motiur
Mahmud, S. M. Hasan
2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
[23] An investigation on the feasibility of cross-project defect prediction
Zhimin He
Fengdi Shu
Ye Yang
Mingshu Li
Qing Wang
Automated Software Engineering, 2012, 19 : 167 - 199
[24] Data Transformation in Cross-project Defect Prediction
Feng Zhang
Iman Keivanloo
Ying Zou
Empirical Software Engineering, 2017, 22 : 3186 - 3218
[25] Research on Cross-Project Software Defect Prediction Based on Machine Learning
Wang, Baoping
Wang, Wennan
Zhu, Linkai
Liu, Wenjian
ADVANCES IN WEB-BASED LEARNING - ICWL 2021, 2021, 13103 : 160 - 165
[26] An investigation on the feasibility of cross-project defect prediction
He, Zhimin
Shu, Fengdi
Yang, Ye
Li, Mingshu
Wang, Qing
AUTOMATED SOFTWARE ENGINEERING, 2012, 19 (02) : 167 - 199
[27] An Investigation of Imbalanced Ensemble Learning Methods for Cross-Project Defect Prediction
Qiu, Shaojian
Lu, Lu
Jiang, Siyu
Guo, Yang
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (12)
[28] Data Transformation in Cross-project Defect Prediction
Zhang, Feng
Keivanloo, Iman
Zou, Ying
EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (06) : 3186 - 3218
[29] eCPDP : Early Cross-Project Defect Prediction
Kwon, Sunjae
Ryu, Duksan
Baik, Jongmoon
2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 470 - 481
[30] MarginGAN: Adversarial Training in Semi-Supervised Learning
Dong, Jinhao
Lin, Tong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →