Identifying feature relevance using a random forest

被引:0
|
作者
Rogers, Jeremy [1 ]
Gunn, Steve [1 ]
机构
[1] Univ Southampton, Image Speech & Intelligent Res Grp, Sch Elect & Comp Sci, Southampton SO9 5NH, Hants, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is known that feature selection and feature relevance can benefit the performance and interpretation of machine learning algorithms. Here we consider feature selection within a Random Forest framework. A feature selection technique is introduced that combines hypothesis testing with an approximation to the expected performance of an irrelevant feature during Random Forest construction. It is demonstrated that the lack of implicit feature selection within Random Forest has an adverse effect on the accuracy and efficiency of the algorithm. It is also shown that irrelevant features can slow the rate of error convergence and a theoretical justification of this effect is given.
引用
收藏
页码:173 / 184
页数:12
相关论文
共 50 条
  • [31] Feature selection algorithm based on random forest
    Yao, Deng-Ju
    Yang, Jing
    Zhan, Xiao-Juan
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2014, 44 (01): : 137 - 141
  • [32] EcmPred: Prediction of extracellular matrix proteins based on random forest with maximum relevance minimum redundancy feature selection
    Kandaswamy, Krishna Kumar
    Pugalenthi, Ganesan
    Kalies, Kai-Uwe
    Hartmann, Enno
    Martinetz, Thomas
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2013, 317 : 377 - 383
  • [33] Feature-Weighting and Clustering Random Forest
    Liu, Zhenyu
    Wen, Tao
    Sun, Wei
    Zhang, Qilong
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 257 - 265
  • [34] Intra-feature Random Forest Clustering
    Cohen, Michael
    [J]. MACHINE LEARNING, OPTIMIZATION, AND BIG DATA, MOD 2017, 2018, 10710 : 41 - 49
  • [35] Random-Sets for Dealing with Uncertainties in Relevance Feature
    Alharbi, Abdullah Semran
    Abul Bashar, Md
    Li, Yuefeng
    [J]. AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 11320 : 656 - 668
  • [36] APPLICATION OF RANDOM FOREST IN IDENTIFYING WINTER WHEAT USING LANDSAT8 IMAGERY
    Li, Xu
    Lv, Xifeng
    He, Yufeng
    Zhou, Baoping
    Deng, Jinmei
    Qin, Anzhen
    [J]. ENGENHARIA AGRICOLA, 2021, 41 (06): : 619 - 633
  • [37] Identifying the influencing factors of soil nitrous acid emissions using random forest model
    School of Electrical and Photoelectronic Engineering, West Anhui University, Luan
    237012, China
    不详
    237012, China
    不详
    230031, China
    [J]. Atmos. Environ., 2024,
  • [38] Identifying early permanent teeth caries factors in children using random forest algorithm
    Masaebi, Fatemeh
    Ghorbani, Zahra
    Looha, Mehdi Azizmohammad
    Deghatipour, Marzie
    Mohammadzadeh, Morteza
    Ahsaie, Mitra Ghazizadeh
    Asadi, Fariba
    Zayeri, Farid
    [J]. FRONTIERS IN DENTAL MEDICINE, 2024, 5
  • [39] Identifying important microbial biomarkers for the diagnosis of colon cancer using a random forest approach
    Cao, Lichao
    Wei, Shangqing
    Yin, Zongyi
    Chen, Fang
    Ba, Ying
    Weng, Qi
    Zhang, Jiahao
    Zhang, Hezi
    [J]. HELIYON, 2024, 10 (02)
  • [40] Identifying commuters based on random forest of smartcard data
    Mei, Zhenyu
    Ding, Wenchao
    Feng, Chi
    Shen, Liting
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (04) : 207 - 212