Research on Feature Selection Methods based on Random Forest

被引:7
|
作者
Wang, Zhuo [1 ]
机构
[1] Nanchang Univ, Sch Software, Nanchang, Jiangxi, Peoples R China
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2023年 / 30卷 / 02期
关键词
feature selection; irrelevant; random forest; redundant; ALGORITHM; CLASSIFICATION; PREDICTION;
D O I
10.17559/TV-20220823104912
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Aiming to deal with the irrelevant or redundant features, this paper proposes eight kinds of feature selection methods. The first seven feature selection methods include CART and Random Forests (CART-RF), CHIAD and Random Forests (CHIAD-RF), SVM and Random Forests (SVM-RF), Bayesian Network and Random Forests (BN-RF), neural Network and Random Forests (NN-RF), K-Means and Random Forests (K-Means-RF) and Kohonen and Random Forests (Kohonen-RF). These methods use CART, CHAID, SVM, BN, NN, K-Means and Kohonen to evaluate the importance and ranking of features, and then obtain feature subsets through RF algorithm. The eighth method is named hybrid integration methods and random forests (Integrate-RF). Integrate-RF uses the average importance of the seven methods and the optimal features subset can be selected based on the OOB data classification error rate. Experimental results indicate that feature selection methods proposed in this article can effectively select features and reduce the data dimension.
引用
收藏
页码:623 / 633
页数:11
相关论文
共 50 条
  • [1] Feature selection algorithm based on random forest
    Yao, Deng-Ju
    Yang, Jing
    Zhan, Xiao-Juan
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2014, 44 (01): : 137 - 141
  • [2] A review of random forest-based feature selection methods for data science education and applications
    Iranzad, Reza
    Liu, Xiao
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [3] IoT Intrusion Detection Using Modified Random Forest Based on Double Feature Selection Methods
    Hussein, Adil Yousef
    Falcarin, Paolo
    Sadiq, Ahmed T.
    [J]. EMERGING TECHNOLOGY TRENDS IN INTERNET OF THINGS AND COMPUTING, TIOTC 2021, 2022, : 61 - 78
  • [4] Train delays prediction based on feature selection and random forest
    Ji, Yuanyuan
    Zheng, Wei
    Dong, Hairong
    Gao, Pengfei
    [J]. 2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [5] A New Noisy Random Forest Based Method for Feature Selection
    Akhiat, Yassine
    Manzali, Youness
    Chahhou, Mohamed
    Zinedine, Ahmed
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2021, 21 (02) : 10 - 28
  • [6] Distance Correlation-Based Feature Selection in Random Forest
    Ratnasingam, Suthakaran
    Munoz-Lopez, Jose
    [J]. ENTROPY, 2023, 25 (09)
  • [7] Microgrid fault classification based on random forest feature selection
    Wang, Changhong
    Gao, Yanjie
    Tang, Min
    [J]. REVIEWS OF ADHESION AND ADHESIVES, 2023, 11 (02): : 220 - 237
  • [8] Intrusion Detection Model Based on Feature Selection and Random Forest
    Dong, Rui Hong
    Shui, Yong Li
    Zhang, Qiu Yu
    [J]. International Journal of Network Security, 2021, 23 (06) : 985 - 996
  • [9] Random Forest-based feature selection for emotion recognition
    Gharsalli, Sonia
    Emile, Bruno
    Laurent, Helene
    Desquesnes, Xavier
    Vivet, Damien
    [J]. 5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, THEORY, TOOLS AND APPLICATIONS 2015, 2015, : 268 - 272
  • [10] Research on the Application of Random Forest-based Feature Selection Algorithm in Data Mining Experiments
    Wang, Huan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 505 - 518