Identifying feature relevance using a random forest

被引:0
|
作者
Rogers, Jeremy [1 ]
Gunn, Steve [1 ]
机构
[1] Univ Southampton, Image Speech & Intelligent Res Grp, Sch Elect & Comp Sci, Southampton SO9 5NH, Hants, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is known that feature selection and feature relevance can benefit the performance and interpretation of machine learning algorithms. Here we consider feature selection within a Random Forest framework. A feature selection technique is introduced that combines hypothesis testing with an approximation to the expected performance of an irrelevant feature during Random Forest construction. It is demonstrated that the lack of implicit feature selection within Random Forest has an adverse effect on the accuracy and efficiency of the algorithm. It is also shown that irrelevant features can slow the rate of error convergence and a theoretical justification of this effect is given.
引用
收藏
页码:173 / 184
页数:12
相关论文
共 50 条
  • [1] iEnhancer-RF: Identifying enhancers and their strength by enhanced feature representation using random forest
    Lim, Dae Yeong
    Khanal, Jhabindra
    Tayara, Hilal
    Chong, Kil To
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 212
  • [2] Lightweight surrogate random forest support for model simplification and feature relevance
    Kim, Sangwon
    Jeong, Mira
    Ko, Byoung Chul
    [J]. APPLIED INTELLIGENCE, 2022, 52 (01) : 471 - 481
  • [3] Lightweight surrogate random forest support for model simplification and feature relevance
    Sangwon Kim
    Mira Jeong
    Byoung Chul Ko
    [J]. Applied Intelligence, 2022, 52 : 471 - 481
  • [4] Feature selection and classification of leukocytes using random forest
    Saraswat, Mukesh
    Arya, K. V.
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2014, 52 (12) : 1041 - 1052
  • [5] Feature selection and classification of leukocytes using random forest
    Mukesh Saraswat
    K. V. Arya
    [J]. Medical & Biological Engineering & Computing, 2014, 52 : 1041 - 1052
  • [6] CLASSIFICATION OF URBAN ENVIRONMENTS USING FEATURE EXTRACTION AND RANDOM FOREST
    dos Anjos, Camila Souza
    Lacerda, Marielcio Goncalves
    Andrade, Leidiane do Livramento
    Salles, Roberto Neves
    [J]. 2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1205 - 1208
  • [7] Quantifying Feature Importance for Detecting Depression using Random Forest
    AlSagri, Hatoon
    Ykhlef, Mourad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 628 - 635
  • [8] Identifying Skype Traffic by Random Forest
    Li Jun
    Zhang Shunyi
    Xuan Ye
    Sun Yanfei
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 2841 - +
  • [9] Computational Method for Identifying Malonylation Sites by Using Random Forest Algorithm
    Wang, ShaoPeng
    Li, JiaRui
    Sun, Xijun
    Zhang, Yu-Hang
    Huang, Tao
    Cai, Yu-Dong
    [J]. COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2020, 23 (04) : 304 - 312
  • [10] iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest
    Zhao, Dongxu
    Teng, Zhixia
    Li, Yanjuan
    Chen, Dong
    [J]. FRONTIERS IN GENETICS, 2021, 12