FECAR: A Feature Selection Framework for Software Defect Prediction

被引:44
|
作者
Liu, Shulong [1 ]
Chen, Xiang [1 ,2 ]
Liu, Wangshu [1 ]
Chen, Jiaqiang [1 ]
Gu, Qing [1 ]
Chen, Daoxu [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210008, Jiangsu, Peoples R China
[2] Nantong Univ, Sch Comp Sci & Technol, Nantong, Peoples R China
关键词
Software Defect Prediction; Feature Selection; Feature Clustering; Feature Ranking; CODE;
D O I
10.1109/COMPSAC.2014.66
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software defect prediction can classify new software entities into either buggy or clean. However the effectiveness of existing methods is influenced by irrelevant and redundant features. In this paper, we propose a new feature selection framework FECAR using FEature Clustering And feature Ranking. This framework firstly partitions original features into k clusters based on FF-Correlation measure. Then it selects relevant features from each cluster based on FC-Relevance measure. In empirical study, we choose Symmetric Uncertainty as FF-Correlation measure, and choose Information Gain, Chi-Square, and ReliefF as three different FC-Relevance measures. Based on some real projects Eclipse and NASA, we implemented our framework and performed empirical studies to investigate the redundancy rate and the performance of the trained defect predictors. Final results verify the effectiveness of our proposed framework and further provide a guideline for achieving cost-effective feature selection when using our framework.
引用
收藏
页码:426 / 435
页数:10
相关论文
共 50 条
  • [1] A Noise Tolerable Feature Selection Framework for Software Defect Prediction
    Liu, Wang-Shu
    Chen, Xiang
    Gu, Qing
    Liu, Shu-Long
    Chen, Dao-Xu
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2018, 41 (03): : 506 - 520
  • [2] Genetic Feature Selection for Software Defect Prediction
    Wahono, Romi Satria
    Herman, Nanna Suryana
    [J]. ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 239 - 244
  • [3] A Framework for Software Defect Prediction and Metric Selection
    Huda, Shamsul
    Alyahya, Sultan
    Ali, Mohsin
    Ahmad, Shafiq
    Abawajy, Jemal
    Al-Dossari, Hmood
    Yearwood, John
    [J]. IEEE ACCESS, 2018, 6 : 2844 - 2858
  • [4] Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning
    Ali, Misbah
    Mazhar, Tehseen
    Al-Rasheed, Amal
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Khan, Muhammad Amir
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [5] RFC: a feature selection algorithm for software defect prediction
    Xu Xiaolong
    Chen Wen
    Wang Xinheng
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (02) : 389 - 398
  • [6] Feature Selection in Software Defect Prediction: A Comparative Study
    Kakkar, Misha
    Jain, Sarika
    [J]. 2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 658 - 663
  • [7] Feature Selection with Imbalanced Data for Software Defect Prediction
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 235 - +
  • [8] RFC: a feature selection algorithm for software defect prediction
    XU Xiaolong
    CHEN Wen
    WANG Xinheng
    [J]. Journal of Systems Engineering and Electronics, 2021, 32 (02) : 389 - 398
  • [9] Software Defect Prediction Scheme Based on Feature Selection
    Wang, Pei
    Jin, Cong
    Jin, Shu-Wei
    [J]. 2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 477 - 480
  • [10] Analysis of Feature Selection Methods in Software Defect Prediction Models
    Ali, Misbah
    Mazhar, Tehseen
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Mohsin, Syed Muhammad
    Akber, Syed Muhammad Abrar
    Ali, Mohammed
    [J]. IEEE ACCESS, 2023, 11 : 145954 - 145974