Analysis of a Random Forests Model

被引:0
|
作者
Biau, Gerard [1 ,2 ]
机构
[1] Univ Paris 06, LSTA & LPMA, F-75252 Paris 05, France
[2] Ecole Normale Super, DMA, F-75230 Paris 05, France
关键词
random forests; randomization; sparsity; dimension reduction; consistency; rate of convergence; REGRESSION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Random forests are a scheme proposed by Leo Breiman in the 2000's for building a predictor ensemble with a set of decision trees that grow in randomly selected subspaces of data. Despite growing interest and practical use, there has been little exploration of the statistical properties of random forests, and little is known about the mathematical forces driving the algorithm. In this paper, we offer an in-depth analysis of a random forests model suggested by Breiman (2004), which is very close to the original algorithm. We show in particular that the procedure is consistent and adapts to sparsity, in the sense that its rate of convergence depends only on the number of strong features and not on how many noise variables are present.
引用
收藏
页码:1063 / 1095
页数:33
相关论文
共 50 条
  • [41] Calibrating Random Forests
    Bostrom, Henrik
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 121 - 126
  • [42] On the asymptotics of random forests
    Scornet, Erwan
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 146 : 72 - 83
  • [43] Critical random forests
    Martin, James B.
    Yeo, Dominic
    ALEA-LATIN AMERICAN JOURNAL OF PROBABILITY AND MATHEMATICAL STATISTICS, 2018, 15 (02): : 913 - 960
  • [44] Multivariate random forests
    Segal, Mark
    Xiao, Yuanyuan
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) : 80 - 87
  • [45] Random Tessellation Forests
    Ge, Shufei
    Wang, Shijia
    Teh, Yee Whye
    Wang, Liangliang
    Elliott, Lloyd T.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [46] Random Similarity Forests
    Piernik, Maciej
    Brzezinski, Dariusz
    Zawadzki, Pawel
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT V, 2023, 13717 : 53 - 69
  • [47] Random Forests in Chapel
    Albrecht, Benjamin
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 676 - 676
  • [48] Random Survival Forests
    Taylor, Jeremy M. G.
    JOURNAL OF THORACIC ONCOLOGY, 2011, 6 (12) : 1974 - 1975
  • [49] Neural Random Forests
    Biau, Gerard
    Scornet, Erwan
    Welbl, Johannes
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2019, 81 (02): : 347 - 386
  • [50] Dynamic Random Forests
    Bernard, Simon
    Adam, Sebastien
    Heutte, Laurent
    PATTERN RECOGNITION LETTERS, 2012, 33 (12) : 1580 - 1586