Unsupervised methods for Software Defect Prediction

被引:4
|
作者
Ha, Duy-An [1 ]
Chen, Ting-Hsuan [2 ]
Yuan, Shyan-Ming [3 ]
机构
[1] Natl Chiao Tung Univ, EECS Int Grad Program, Hsinchu, Taiwan
[2] Natl Chiao Tung Univ, Inst Comp Sci & Engn, Hsinchu, Taiwan
[3] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
Software Defect Prediction; Unsupervised Learning; Community Structure Detection; Software Engineering; Machine Learning; NONNEGATIVE MATRIX; ALGORITHMS;
D O I
10.1145/3368926.3369711
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Software Defect Prediction (SDP) aims to assess software quality by using machine learning techniques. Recently, by proposing the connectivity-based unsupervised learning method, Zhang et al. have been proven that unsupervised classification has great potential to apply to this problem. Inspiring by this idea, in our work we try to replicate the results of Zhang et al.'s experiment and attempt to improve the performance by examining different techniques at each step of the approach using unsupervised learning methods to solve the SDP problem. Specifically, we try to follow the steps of the experiment described in their work strictly and examine three other clustering methods with four other ways for feature selection besides using all. To the best of our knowledge, these methods are first applied in SDP to evaluate their predictive power. For replicating the results, generally results in our experiments are not as good as the previous work. It may be due to we do not know which features are used in their experiment exactly. Fluid clustering and spectral clustering give better results than Newman clustering and CNM clustering in our experiments. Additionally, the experiments also show that using Kernel Principal Component Analysis (KPCA) or Non-Negative Matrix Factorization (NMF) for feature selection step gives better performance than using all features in the case of unlabeled data. Lastly, to make replicating our work easy, a lightweight framework is created and released on Github.
引用
收藏
页码:49 / 55
页数:7
相关论文
共 50 条
  • [1] Software defect number prediction: Unsupervised vs supervised methods
    Chen, Xiang
    Zhang, Dun
    Zhao, Yingquan
    Cui, Zhanqi
    Ni, Chao
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 106 : 161 - 181
  • [2] A systematic review of unsupervised learning techniques for software defect prediction
    Li, Ning
    Shepperd, Martin
    Guo, Yuchen
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2020, 122
  • [3] UDA-DP: Unsupervised Domain Adaptation for Software Defect Prediction
    Huang, Xiaosong
    Wu, Yifan
    Liu, Hongyi
    Li, Ying
    Yu, Hao
    Guo, Dadi
    Wu, Zhonghai
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, : 308 - 318
  • [4] MPT-embedding: An unsupervised representation learning of code for software defect prediction
    Shi, Ke
    Lu, Yang
    Liu, Guangliang
    Wei, Zhenchun
    Chang, Jingfei
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2021, 33 (04)
  • [5] Analysis of Feature Selection Methods in Software Defect Prediction Models
    Ali, Misbah
    Mazhar, Tehseen
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Mohsin, Syed Muhammad
    Akber, Syed Muhammad Abrar
    Ali, Mohammed
    [J]. IEEE ACCESS, 2023, 11 : 145954 - 145974
  • [6] Unsupervised software defect prediction using signed Laplacian-based spectral classifier
    Aris Marjuni
    Teguh Bharata Adji
    Ridi Ferdiana
    [J]. Soft Computing, 2019, 23 : 13679 - 13690
  • [7] Unsupervised software defect prediction using signed Laplacian-based spectral classifier
    Marjuni, Aris
    Adji, Teguh Bharata
    Ferdiana, Ridi
    [J]. SOFT COMPUTING, 2019, 23 (24) : 13679 - 13690
  • [8] Revisiting Unsupervised Learning for Defect Prediction
    Fu, Wei
    Menzies, Tim
    [J]. ESEC/FSE 2017: PROCEEDINGS OF THE 2017 11TH JOINT MEETING ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2017, : 72 - 83
  • [9] SMOTE-Based Homogeneous Ensemble Methods for Software Defect Prediction
    Balogun, Abdullateef O.
    Lafenwa-Balogun, Fatimah B.
    Mojeed, Hammed A.
    Adeyemo, Victor E.
    Akande, Oluwatobi N.
    Akintola, Abimbola G.
    Bajeh, Amos O.
    Usman-Hamza, Fatimah E.
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2020, PT VI, 2020, 12254 : 615 - 631
  • [10] Collaborative filtering based recommendation of sampling methods for software defect prediction
    Sun, Zhongbin
    Zhang, Jingqi
    Sun, Heli
    Zhu, Xiaoyan
    [J]. APPLIED SOFT COMPUTING, 2020, 90