Unsupervised methods for Software Defect Prediction

被引:4
|
作者
Ha, Duy-An [1 ]
Chen, Ting-Hsuan [2 ]
Yuan, Shyan-Ming [3 ]
机构
[1] Natl Chiao Tung Univ, EECS Int Grad Program, Hsinchu, Taiwan
[2] Natl Chiao Tung Univ, Inst Comp Sci & Engn, Hsinchu, Taiwan
[3] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
Software Defect Prediction; Unsupervised Learning; Community Structure Detection; Software Engineering; Machine Learning; NONNEGATIVE MATRIX; ALGORITHMS;
D O I
10.1145/3368926.3369711
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Software Defect Prediction (SDP) aims to assess software quality by using machine learning techniques. Recently, by proposing the connectivity-based unsupervised learning method, Zhang et al. have been proven that unsupervised classification has great potential to apply to this problem. Inspiring by this idea, in our work we try to replicate the results of Zhang et al.'s experiment and attempt to improve the performance by examining different techniques at each step of the approach using unsupervised learning methods to solve the SDP problem. Specifically, we try to follow the steps of the experiment described in their work strictly and examine three other clustering methods with four other ways for feature selection besides using all. To the best of our knowledge, these methods are first applied in SDP to evaluate their predictive power. For replicating the results, generally results in our experiments are not as good as the previous work. It may be due to we do not know which features are used in their experiment exactly. Fluid clustering and spectral clustering give better results than Newman clustering and CNM clustering in our experiments. Additionally, the experiments also show that using Kernel Principal Component Analysis (KPCA) or Non-Negative Matrix Factorization (NMF) for feature selection step gives better performance than using all features in the case of unlabeled data. Lastly, to make replicating our work easy, a lightweight framework is created and released on Github.
引用
收藏
页码:49 / 55
页数:7
相关论文
共 50 条
  • [31] Classifier Evaluation for Software Defect Prediction
    Kou, Gang
    Peng, Yi
    Shi, Yong
    Wu, Wenshuai
    [J]. STUDIES IN INFORMATICS AND CONTROL, 2012, 21 (02): : 117 - 126
  • [32] Open Issues in Software Defect Prediction
    Arora, Ishani
    Tetarwal, Vivek
    Saha, Anju
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 906 - 912
  • [33] A defect prediction method for software versioning
    Yomi Kastro
    Ayşe Basar Bener
    [J]. Software Quality Journal, 2008, 16 : 543 - 562
  • [34] A Systematic Review on Software Defect Prediction
    Singh, Pradeep Kumar
    Agarwal, Dishti
    Gupta, Aakriti
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1793 - 1797
  • [35] Progress on approaches to software defect prediction
    Li, Zhiqiang
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    [J]. IET SOFTWARE, 2018, 12 (03) : 161 - 175
  • [36] On the Costs and Profit of Software Defect Prediction
    Herbold, Steffen
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (11) : 2617 - 2631
  • [37] Software defect prediction via LSTM
    Deng, Jiehan
    Lu, Lu
    Qiu, Shaojian
    [J]. IET SOFTWARE, 2020, 14 (04) : 443 - 450
  • [38] Software Defect Prediction for LSI Designs
    Parizy, Matthieu
    Takayama, Koichiro
    Kanazawa, Yuji
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, : 565 - 568
  • [39] Research Progress of Software Defect Prediction
    Gong L.-N.
    Jiang S.-J.
    Jiang L.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2019, 30 (10): : 3090 - 3114
  • [40] Progress in Automated Software Defect Prediction
    Ostrand, Thomas J.
    Weyuker, Elaine J.
    [J]. HARDWARE AND SOFTWARE: VERIFICATION AND TESTING, PROCEEDINGS, 2009, 5394 : 200 - 204