New Feature Selection Algorithm Based on Feature Stability and Correlation

被引:3
|
作者
Al-Shalabi, Luai [1 ]
机构
[1] Arab Open Univ, Fac Comp Studies, Al Ardia 92400, Kuwait
关键词
Feature extraction; Classification algorithms; Machine learning algorithms; Dimensionality reduction; Correlation; Filtering theory; Information filters; correlation; feature selection; stability; FILTER; CLASSIFICATION; REDUCTION; DATASETS;
D O I
10.1109/ACCESS.2022.3140209
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The analysis of a large amount of data with high dimensionality of rows and columns increases the load of machine learning algorithms. Such data are likely to have noise and consequently, obstruct the performance of machine learning algorithms. Feature selection (FS) is one of the most essential machine learning techniques that can solve the above-mentioned problem. It tries to identify and eliminate irrelevant information as much as possible and only maintain a minimum subset of appropriate features. It plays an important role in improving the accuracy of machine-learning algorithms. It also reduces computational complexity, run time, storage, and cost. In this paper, a new feature selection algorithm based on feature stability and correlation is proposed to select the effective minimum subset of appropriate features. The efficiency of the proposed algorithm was evaluated by comparing it with other state-of-the-art dimensionality reduction (DR) algorithms using benchmark datasets. The evaluation criteria included the size of the minimum subset, the classification accuracy, the F-measure, and the area under curve (AUC). The results showed that the proposed algorithm is the pioneer in reducing a given dataset with high predictive accuracy.
引用
收藏
页码:4699 / 4713
页数:15
相关论文
共 50 条
  • [1] A Hybrid Approach for Feature Selection Based on Correlation Feature Selection and Genetic Algorithm
    Rani, Pooja
    Kumar, Rajneesh
    Jain, Anurag
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2022, 10 (01)
  • [2] Feature Selection Algorithm Based on Label Correlation
    Lü, Yuejiao
    Li, Deyu
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (08): : 716 - 723
  • [3] Improved feature selection algorithm based on SVM and correlation
    Xie, Zong-Xia
    Hu, Qing-Hua
    Yu, Da-Ren
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1373 - 1380
  • [4] Correlation Based Feature Selection Algorithm for Machine Learning
    Gopika, N.
    Kowshalaya, A. Meena
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 692 - 695
  • [5] Feature selection based on distance correlation: a filter algorithm
    Tan, Hongwei
    Wang, Guodong
    Wang, Wendong
    Zhang, Zili
    [J]. JOURNAL OF APPLIED STATISTICS, 2022, 49 (02) : 411 - 426
  • [6] Feature selection algorithm and cobweb correlation
    Klimesova, D
    Saic, S
    [J]. PATTERN RECOGNITION LETTERS, 1998, 19 (08) : 681 - 685
  • [7] Stability of feature selection algorithm: A review
    Khaire, Utkarsh Mahadeo
    Dhanalakshmi, R.
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (04) : 1060 - 1073
  • [8] A Feature Selection Method Based on Feature Correlation Networks
    Savic, Milos
    Kurbalija, Vladimir
    Ivanovic, Mirjana
    Bosnic, Zoran
    [J]. MODEL AND DATA ENGINEERING (MEDI 2017), 2017, 10563 : 248 - 261
  • [9] Correlation measure-based feature selection algorithm for IDS
    College of Computer, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
    不详
    不详
    [J]. J. Comput. Inf. Syst, 2008, 1 (301-310):
  • [10] Sigmis: A Feature Selection Algorithm Using Correlation Based Method
    Blessie, E. Chandra
    Karthikeyan, E.
    [J]. JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2012, 6 (03) : 385 - 394