RFC: a feature selection algorithm for software defect prediction

被引:0
|
作者
XU Xiaolong [1 ]
CHEN Wen [2 ]
WANG Xinheng [3 ]
机构
[1] Jiangsu Key Laboratory of Big Data Security & Intelligent Processing, Nanjing University of Posts and Telecommunications
[2] Institute of Big Data Research at Yancheng, Nanjing University of Posts and Telecommunications
[3] School of Computing and Engineering, University of West London
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP311.5 [软件工程];
学科分类号
081202 ; 0835 ;
摘要
Software defect prediction(SDP) is used to perform the statistical analysis of historical defect data to find out the distribution rule of historical defects, so as to effectively predict defects in the new software. However, there are redundant and irrelevant features in the software defect datasets affecting the performance of defect predictors. In order to identify and remove the redundant and irrelevant features in software defect datasets, we propose Relief F-based clustering(RFC), a clusterbased feature selection algorithm. Then, the correlation between features is calculated based on the symmetric uncertainty. According to the correlation degree, RFC partitions features into k clusters based on the k-medoids algorithm, and finally selects the representative features from each cluster to form the final feature subset. In the experiments, we compare the proposed RFC with classical feature selection algorithms on nine National Aeronautics and Space Administration(NASA) software defect prediction datasets in terms of area under curve(AUC) and Fvalue. The experimental results show that RFC can effectively improve the performance of SDP.
引用
收藏
页码:389 / 398
页数:10
相关论文
共 50 条
  • [41] An Attribute Selection Process for Software Defect Prediction
    Khan, Jobaer Islam
    Ul Gias, Alim
    Siddik, Md. Saeed
    Rahman, Md. Habibur
    Khaled, Shah Mostafa
    Shoyaib, Mohammad
    2014 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2014,
  • [42] A Framework for Software Defect Prediction and Metric Selection
    Huda, Shamsul
    Alyahya, Sultan
    Ali, Mohsin
    Ahmad, Shafiq
    Abawajy, Jemal
    Al-Dossari, Hmood
    Yearwood, John
    IEEE ACCESS, 2018, 6 : 2844 - 2858
  • [43] Hybrid feature selection method for predicting software defect
    Anju, A.J.
    Judith, J.E.
    Journal of Engineering and Applied Science, 2024, 71 (01):
  • [44] Cross-project defect prediction method based on genetic algorithm feature selection
    Hu, Zhixi
    Zhu, Yi
    ENGINEERING REPORTS, 2023, 5 (12)
  • [45] An Optimized Feature Selection Method Using Ensemble Classifiers in Software Defect Prediction for Healthcare Systems
    Mohammad, Uzma Ghulam
    Imtiaz, Salma
    Shakya, Manoj
    Almadhor, Ahmad
    Anwar, Fareeha
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [46] A Feature Selection-Based K-NN Model for Fast Software Defect Prediction
    Awotunde, Joseph Bamidele
    Misra, Sanjay
    Adeniyi, Abidemi Emmanuel
    Abiodun, Moses Kazeem
    Kaushik, Manju
    Lawrence, Morolake Oladayo
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART IV, 2022, 13380 : 49 - 61
  • [47] An empirical study on pareto based multi-objective feature selection for software defect prediction
    Ni, Chao
    Chen, Xiang
    Wu, Fangfang
    Shen, Yuxiang
    Gu, Qing
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 : 215 - 238
  • [48] Software defect prediction using wrapper-based dynamic arithmetic optimization for feature selection
    Anand, Kunal
    Jena, Ajay Kumar
    Das, Himansu
    Askar, S. S.
    Abouhawwash, Mohamed
    CONNECTION SCIENCE, 2025, 37 (01)
  • [49] A comparative study of iterative and non-iterative feature selection techniques for software defect prediction
    Taghi M. Khoshgoftaar
    Kehan Gao
    Amri Napolitano
    Randall Wald
    Information Systems Frontiers, 2014, 16 : 801 - 822
  • [50] Improved Bayesian regularisation using neural networks based on feature selection for software defect prediction
    Jayanthi, R.
    Florence, M. Lilly
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2019, 60 (03) : 225 - 241