RFC: a feature selection algorithm for software defect prediction

被引:0
|
作者
XU Xiaolong [1 ]
CHEN Wen [2 ]
WANG Xinheng [3 ]
机构
[1] Jiangsu Key Laboratory of Big Data Security & Intelligent Processing, Nanjing University of Posts and Telecommunications
[2] Institute of Big Data Research at Yancheng, Nanjing University of Posts and Telecommunications
[3] School of Computing and Engineering, University of West London
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP311.5 [软件工程];
学科分类号
摘要
Software defect prediction(SDP) is used to perform the statistical analysis of historical defect data to find out the distribution rule of historical defects, so as to effectively predict defects in the new software. However, there are redundant and irrelevant features in the software defect datasets affecting the performance of defect predictors. In order to identify and remove the redundant and irrelevant features in software defect datasets, we propose Relief F-based clustering(RFC), a clusterbased feature selection algorithm. Then, the correlation between features is calculated based on the symmetric uncertainty. According to the correlation degree, RFC partitions features into k clusters based on the k-medoids algorithm, and finally selects the representative features from each cluster to form the final feature subset. In the experiments, we compare the proposed RFC with classical feature selection algorithms on nine National Aeronautics and Space Administration(NASA) software defect prediction datasets in terms of area under curve(AUC) and Fvalue. The experimental results show that RFC can effectively improve the performance of SDP.
引用
收藏
页码:389 / 398
页数:10
相关论文
共 50 条
  • [1] RFC: a feature selection algorithm for software defect prediction
    Xu Xiaolong
    Chen Wen
    Wang Xinheng
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (02) : 389 - 398
  • [2] Feature selection using firefly algorithm in software defect prediction
    Anbu, M.
    Mala, G. S. Anandha
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 10925 - 10934
  • [3] Feature selection using firefly algorithm in software defect prediction
    M. Anbu
    G. S. Anandha Mala
    Cluster Computing, 2019, 22 : 10925 - 10934
  • [4] Software Defect Prediction using Feature Selection and Random Forest Algorithm
    Ibrahim, Dyana Rashid
    Ghnemat, Rawan
    Hudaib, Amjad
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 252 - 257
  • [5] Genetic Feature Selection for Software Defect Prediction
    Wahono, Romi Satria
    Herman, Nanna Suryana
    ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 239 - 244
  • [6] Feature-Grouping-Based Two Steps Feature Selection Algorithm in Software Defect Prediction
    Du, Yuntao
    Zhang, Lu
    Shi, Jiahao
    Tang, Jingjuan
    Yin, Ying
    ICAIP 2018: 2018 THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN IMAGE PROCESSING, 2018, : 173 - 178
  • [7] FECAR: A Feature Selection Framework for Software Defect Prediction
    Liu, Shulong
    Chen, Xiang
    Liu, Wangshu
    Chen, Jiaqiang
    Gu, Qing
    Chen, Daoxu
    2014 IEEE 38TH ANNUAL INTERNATIONAL COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2014, : 426 - 435
  • [8] Feature Selection with Imbalanced Data for Software Defect Prediction
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 235 - +
  • [9] Feature Selection in Software Defect Prediction: A Comparative Study
    Kakkar, Misha
    Jain, Sarika
    2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 658 - 663
  • [10] Software Defect Prediction Scheme Based on Feature Selection
    Wang, Pei
    Jin, Cong
    Jin, Shu-Wei
    2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 477 - 480