A Novel Feature Selection Method for Software Fault Prediction Model

被引:0
|
作者
Cui, Can [1 ]
Liu, Bin [1 ]
Li, Guoqi [1 ]
机构
[1] Beihang Univ, Sch Reliabil & Syst Engn, 37 Xueyuan Rd, Beijing 100191, Peoples R China
关键词
classification; data preprocessing; feature selection (FS); machine learning; software fault prediction model; QUALITY; METRICS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Software fault prediction (SFP) is an active issue in software engineering (SE). At present, machine learning (ML) has been successfully applied to SFP classification problems. However, one of the challenges for building software fault prediction models (SFPM) is processing high dimensional datasets, which include many irrelevant and redundant features. To address this issue, feature selection techniques, mainly contain wrapper methods and filter methods, are used. In the paper, we report an empirical study aimed at providing a novel approach to select feature for SFP. First of all, a novel feature selection method based on correlation-based feature subset selection (CFS) is proposed. In stage 1, we use the classical CFS to selected features. Then in stage 2, we propose a method for calculating similarity of feature occurrence frequency to further decrease the usefulness features. Second, to validate the novel FS approach, we compare our method with other three FS techniques. For comparison, 38 releases of 10 Java open source projects collected from the PROMISE repository are used in our proposed method. In addition, 10 releases of 10 projects, a total of 10 different software fault data sets are randomly selected. All the selected data subsets after FS approaches are applied to five typical ML classifiers. The final prediction performance results suggest that our proposed method performs mostly better than other three FS methods. Therefore, the novel feature selection approach is feasible. To sum up, we can use the method to delete irrelevant and redundant features to gain useful data subsets and construct well-performed SFPM. The results of SFP can provide useful advice for other SE activities, such as software testing, software quality assurance. Although the current method is effective, it still has some limitations. Our future work is to test the statistical significance of the classification results to further prove the feasibility of the idea.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Hybrid Feature Selection Method for Software Fault Prediction
    Jian, Yiheng
    Yu, Xiao
    Xu, Zhou
    Ma, Ziyi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (10) : 1966 - 1975
  • [2] An AIS Based Feature Selection Method For Software Fault Prediction
    Soleimani, A.
    Asdaghi, F.
    [J]. 2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [3] Metaheuristic feature selection for software fault prediction
    Kumar, Kulamala Vinod
    Kumari, Priyanka
    Rao, Madhuri
    Mohapatra, Durga Prasad
    [J]. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (05): : 1013 - 1020
  • [4] FECS: a Cluster based Feature Selection Method for Software Fault Prediction with Noises
    Liu, Wangshu
    Liu, Shulong
    Gu, Qing
    Chen, Xiang
    Chen, Daoxu
    [J]. 39TH ANNUAL IEEE COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2015), VOL 2, 2015, : 276 - 281
  • [5] A feature selection model for prediction of software defects
    Kumar, Amit
    Kumar, Yugal
    Kukkar, Ashima
    [J]. INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2020, 13 (01) : 28 - 39
  • [6] Majority Vote Feature Selection Algorithm in Software Fault Prediction
    Borandag, Emin
    Ozcift, Akin
    Kilinc, Deniz
    Yucalar, Fatih
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2019, 16 (02) : 515 - 539
  • [7] Combining feature selection, feature learning and ensemble learning for software fault prediction
    Hung Duy Tran
    Le Thi My Hanh
    Nguyen Thanh Binh
    [J]. PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 78 - 85
  • [8] A Novel Feature Selection Method for Fault Diagnosis
    Voulgaris, Zacharias
    Sconyers, Chris
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 262 - 269
  • [9] Feature Selection Using Golden Jackal Optimization for Software Fault Prediction
    Das, Himansu
    Prajapati, Sanjay
    Gourisaria, Mahendra Kumar
    Pattanayak, Radha Mohan
    Alameen, Abdalla
    Kolhar, Manjur
    [J]. MATHEMATICS, 2023, 11 (11)
  • [10] Novel Feature Selection Algorithm for Thermal Prediction Model
    Abad, Javad Mohebbi Najm
    Soleimani, Ali
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (10) : 1831 - 1844