A Feature Selection-Based K-NN Model for Fast Software Defect Prediction

被引:4
|
作者
Awotunde, Joseph Bamidele [1 ]
Misra, Sanjay [2 ]
Adeniyi, Abidemi Emmanuel [2 ]
Abiodun, Moses Kazeem [1 ,3 ]
Kaushik, Manju [4 ]
Lawrence, Morolake Oladayo [5 ]
机构
[1] Univ Ilorin, Dept Comp Sci, Ilorin, Nigeria
[2] Ostfold Univ Coll, Dept Comp Sci & Commun, Halden, Norway
[3] Landmark Univ, Dept Comp Sci, Omu Aran, Nigeria
[4] Amity Univ, Amity Inst Informat Technol, Jaipur, Rajasthan, India
[5] Baze Univ, Dept Comp Sci, Abuja, Nigeria
关键词
Software defect prediction; Machine learning; Extreme gradient boost; Feature selection; Prediction; Software development life cycle;
D O I
10.1007/978-3-031-10542-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software Defect Prediction (SDP) is an advanced technological method of predicting software defects in the software development life cycle. Various research works have been previously being done on SDP but the performance of these methods varied from several datasets, hence, making them inconsistent for SDP in the unknown software project. But the hybrid technique using feature selection enabled with machine learning for SDP can be very efficient as it takes the advantage of various methods to come up with better prediction accuracy for a given dataset when compared with an individual classifier. The major issues with individual ML-based models for SDP are the long detection time, vulnerability of the software project, and high dimensionality of the feature parameters. Therefore, this study proposes a hybrid model using a feature selection enabled Extreme Gradient Boost (XGB) classifier to address these mentioned challenges. The cleaned NASA MDP datasets were used for the implementation of the proposed model, and various performance metrics like F-score, accuracy, and MCC were used to reveal the performance of the model. The results of the proposed model when compared with state-of-the-art methods without feature selection perform better in terms of the metrics used. The results reveal that the proposed model outperformed all other prediction techniques.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
  • [21] Fast k-NN Classifier for Documents Based on a Graph Structure
    Jose Artigas-Fuentes, Fernando
    Gil-Garcia, Reynaldo
    Manuel Badia-Contelles, Jose
    Pons-Porrata, Aurora
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2010, 6419 : 228 - +
  • [22] An efficient network intrusion detection model for IoT security using K-NN classifier and feature selection
    Mouaad Mohy-eddine
    Azidine Guezzaz
    Said Benkirane
    Mourade Azrour
    Multimedia Tools and Applications, 2023, 82 : 23615 - 23633
  • [23] An automatic selection method of k in k-NN classifier
    Du, L. (dulei.323@stu.xjtu.edu.cn), 2013, Northeast University (28):
  • [24] A feature selection approach based on a similarity measure for software defect prediction
    Qiao Yu
    Shu-juan Jiang
    Rong-cun Wang
    Hong-yang Wang
    Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 1744 - 1753
  • [25] A novel hybrid system for feature selection based on an improved gravitational search algorithm and k-NN method
    Xiang, Jie
    Han, XiaoHong
    Duan, Fu
    Qiang, Yan
    Xiong, XiaoYan
    Lan, Yuan
    Chai, Haishui
    APPLIED SOFT COMPUTING, 2015, 31 : 293 - 307
  • [26] A feature selection approach based on a similarity measure for software defect prediction
    Yu, Qiao
    Jiang, Shu-juan
    Wang, Rong-cun
    Wang, Hong-yang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (11) : 1744 - 1753
  • [27] SAR image classification method based on Gabor feature and K-NN
    Wang, Zhiru
    Chen, Liang
    Shi, Hao
    Qi, Baogui
    Wang, Guanqun
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (20): : 6734 - 6736
  • [28] Fast multistage algorithm for K-NN classifiers
    Soraluze, I
    Rodriguez, C
    Boto, F
    Cortes, A
    PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 448 - 455
  • [29] Fast distributed k-nn graph update
    Debatty, Thibault
    Pulvirenti, Fabio
    Michiardi, Pietro
    Mees, Wim
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3308 - 3317
  • [30] Leaf classification based on Shape and Edge feature with k-NN Classifier
    Kumar, Pullela S. V. V. S. R.
    Rao, Konda Naga Venkateswara
    Raju, Akella S. Narasimha
    Kumar, D. J. Nagendra
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 548 - 552