Analysis of a Random Forests Model

被引:0
|
作者
Biau, Gerard [1 ,2 ]
机构
[1] Univ Paris 06, LSTA & LPMA, F-75252 Paris 05, France
[2] Ecole Normale Super, DMA, F-75230 Paris 05, France
关键词
random forests; randomization; sparsity; dimension reduction; consistency; rate of convergence; REGRESSION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Random forests are a scheme proposed by Leo Breiman in the 2000's for building a predictor ensemble with a set of decision trees that grow in randomly selected subspaces of data. Despite growing interest and practical use, there has been little exploration of the statistical properties of random forests, and little is known about the mathematical forces driving the algorithm. In this paper, we offer an in-depth analysis of a random forests model suggested by Breiman (2004), which is very close to the original algorithm. We show in particular that the procedure is consistent and adapts to sparsity, in the sense that its rate of convergence depends only on the number of strong features and not on how many noise variables are present.
引用
收藏
页码:1063 / 1095
页数:33
相关论文
共 50 条
  • [1] Analysis of a random forests model
    Biau, Gérard
    Journal of Machine Learning Research, 2012, 13 : 1063 - 1095
  • [2] Sharp Analysis of a Simple Model for Random Forests
    Klusowski, Jason M.
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 757 - +
  • [3] Random Forests and Networks Analysis
    Avena, Luca
    Castell, Fabienne
    Gaudilliere, Alexandre
    Melot, Clothilde
    JOURNAL OF STATISTICAL PHYSICS, 2018, 173 (3-4) : 985 - 1027
  • [4] Random Forests and Networks Analysis
    Luca Avena
    Fabienne Castell
    Alexandre Gaudillière
    Clothilde Mélot
    Journal of Statistical Physics, 2018, 173 : 985 - 1027
  • [5] Model Class Reliance for Random Forests
    Smith, Gavin
    Mansilla, Roberto
    Goulding, James
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Random forests for genomic data analysis
    Chen, Xi
    Ishwaran, Hemant
    GENOMICS, 2012, 99 (06) : 323 - 329
  • [7] Selective of informative metabolites using random forests based on model population analysis
    Huang, Jian-Hua
    Yan, Jun
    Wu, Qing-Hua
    Ferro, Miguel Duarte
    Yi, Lun-Zhao
    Lu, Hong-Mei
    Xu, Qing-Song
    Liang, Yi-Zeng
    TALANTA, 2013, 117 : 549 - 555
  • [8] LANGUAGE MODEL ADAPTATION USING RANDOM FORESTS
    Deoras, Anoop
    Jelinek, Frederick
    Su, Yi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5198 - 5201
  • [9] Seasonal Analysis and Prediction of Wind Energy Using Random Forests and ARX Model Structures
    Lin, Yujie
    Kruger, Uwe
    Zhang, Junping
    Wang, Qi
    Lamont, Lisa
    El Chaar, Lana
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (05) : 1994 - 2002
  • [10] Random Forests for mapping and analysis of microkinetics models
    Partopour, Behnam
    Paffenroth, Randy C.
    Dixon, Anthony G.
    COMPUTERS & CHEMICAL ENGINEERING, 2018, 115 : 286 - 294