A Novel Feature Selection Method for Software Fault Prediction Model

被引：0

作者：

Cui, Can ^{[1
]}

Liu, Bin ^{[1
]}

Li, Guoqi ^{[1
]}

机构：

[1] Beihang Univ, Sch Reliabil & Syst Engn, 37 Xueyuan Rd, Beijing 100191, Peoples R China

来源：

2019 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS 2019) - R & M IN THE SECOND MACHINE AGE - THE CHALLENGE OF CYBER PHYSICAL SYSTEMS | 2019年

关键词：

classification; data preprocessing; feature selection (FS); machine learning; software fault prediction model; QUALITY; METRICS;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Software fault prediction (SFP) is an active issue in software engineering (SE). At present, machine learning (ML) has been successfully applied to SFP classification problems. However, one of the challenges for building software fault prediction models (SFPM) is processing high dimensional datasets, which include many irrelevant and redundant features. To address this issue, feature selection techniques, mainly contain wrapper methods and filter methods, are used. In the paper, we report an empirical study aimed at providing a novel approach to select feature for SFP. First of all, a novel feature selection method based on correlation-based feature subset selection (CFS) is proposed. In stage 1, we use the classical CFS to selected features. Then in stage 2, we propose a method for calculating similarity of feature occurrence frequency to further decrease the usefulness features. Second, to validate the novel FS approach, we compare our method with other three FS techniques. For comparison, 38 releases of 10 Java open source projects collected from the PROMISE repository are used in our proposed method. In addition, 10 releases of 10 projects, a total of 10 different software fault data sets are randomly selected. All the selected data subsets after FS approaches are applied to five typical ML classifiers. The final prediction performance results suggest that our proposed method performs mostly better than other three FS methods. Therefore, the novel feature selection approach is feasible. To sum up, we can use the method to delete irrelevant and redundant features to gain useful data subsets and construct well-performed SFPM. The results of SFP can provide useful advice for other SE activities, such as software testing, software quality assurance. Although the current method is effective, it still has some limitations. Our future work is to test the statistical significance of the classification results to further prove the feasibility of the idea.

引用

页数：6

共 50 条

[1] A Hybrid Feature Selection Method for Software Fault Prediction
Jian, Yiheng
Yu, Xiao
Xu, Zhou
Ma, Ziyi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (10) : 1966 - 1975
[2] An AIS Based Feature Selection Method For Software Fault Prediction
Soleimani, A.
Asdaghi, F.
[J]. 2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
[3] Metaheuristic feature selection for software fault prediction
Kumar, Kulamala Vinod
Kumari, Priyanka
Rao, Madhuri
Mohapatra, Durga Prasad
[J]. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (05): : 1013 - 1020
[4] FECS: a Cluster based Feature Selection Method for Software Fault Prediction with Noises
Liu, Wangshu
Liu, Shulong
Gu, Qing
Chen, Xiang
Chen, Daoxu
[J]. 39TH ANNUAL IEEE COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2015), VOL 2, 2015, : 276 - 281
[5] A feature selection model for prediction of software defects
Kumar, Amit
Kumar, Yugal
Kukkar, Ashima
[J]. INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2020, 13 (01) : 28 - 39
[6] Majority Vote Feature Selection Algorithm in Software Fault Prediction
Borandag, Emin
Ozcift, Akin
Kilinc, Deniz
Yucalar, Fatih
[J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2019, 16 (02) : 515 - 539
[7] Combining feature selection, feature learning and ensemble learning for software fault prediction
Hung Duy Tran
Le Thi My Hanh
Nguyen Thanh Binh
[J]. PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 78 - 85
[8] A Novel Feature Selection Method for Fault Diagnosis
Voulgaris, Zacharias
Sconyers, Chris
[J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 262 - 269
[9] Feature Selection Using Golden Jackal Optimization for Software Fault Prediction
Das, Himansu
Prajapati, Sanjay
Gourisaria, Mahendra Kumar
Pattanayak, Radha Mohan
Alameen, Abdalla
Kolhar, Manjur
[J]. MATHEMATICS, 2023, 11 (11)
[10] Novel Feature Selection Algorithm for Thermal Prediction Model
Abad, Javad Mohebbi Najm
Soleimani, Ali
[J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (10) : 1831 - 1844

← 1 2 3 4 5 →