Towards Convergence Rate Analysis of Random Forests for Classification

被引:0
|
作者
Gao, Wei [1 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
关键词
REGRESSION; RECOGNITION; HEIGHT; TREES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random forests have been one of the successful ensemble algorithms in machine learning. The basic idea is to construct a large number of random trees individually and make prediction based on an average of their predictions. The great successes have attracted much attention on the consistency of random forests, mostly focusing on regression. This work takes one step towards convergence rates of random forests for classification. We present the first finite- sample rate O(n(-1/(8d+2))) on the convergence of pure random forests for classification, which can be improved to be of O(n(-1/(3.87d+2))) by considering the midpoint splitting mechanism. We introduce another variant of random forests, which follow Breiman's original random forests but with different mechanisms on splitting dimensions and positions. We get a convergence rate O(n(-1/(d+2))(ln n)(1/(d+2))) for the variant of random forests, which reaches the minimax rate, except for a factor (ln n)(1/(d+2)), of the optimal plug-in classifier under the L-Lipschitz assumption. We achieve tighter convergence rate O(J ln n/n) under proper assumptions over structural data.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Data Calibration Based on Multisensor Using Classification Analysis: A Random Forests Approach
    Xing, Xue
    Yu, Dexin
    Zhang, Wei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [32] ENSEMBLE DIVERSITY ANALYSIS ON REMOTE SENSING DATA CLASSIFICATION USING RANDOM FORESTS
    Boukir, Samia
    Mellor, Andrew
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1302 - 1306
  • [33] Weak convergence and rate of convergence of MIMO capacity random variable
    Raghavan, Vasanthan
    Sayeed, Akbar M.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (08) : 3799 - 3809
  • [34] Rate of Convergence Towards Hartree Dynamics
    Chen, Li
    Lee, Ji Oon
    Schlein, Benjamin
    JOURNAL OF STATISTICAL PHYSICS, 2011, 144 (04) : 872 - 903
  • [35] Rate of Convergence Towards Hartree Dynamics
    Li Chen
    Ji Oon Lee
    Benjamin Schlein
    Journal of Statistical Physics, 2011, 144 : 872 - 903
  • [36] Cautious Classification with Data Missing Not at Random Using Generative Random Forests
    Llerena, Julissa Villanueva
    Maua, Denis Deratani
    Antonucci, Alessandro
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2021, 2021, 12897 : 284 - 298
  • [37] Random Forests and Networks Analysis
    Avena, Luca
    Castell, Fabienne
    Gaudilliere, Alexandre
    Melot, Clothilde
    JOURNAL OF STATISTICAL PHYSICS, 2018, 173 (3-4) : 985 - 1027
  • [38] Analysis of a random forests model
    Biau, Gérard
    Journal of Machine Learning Research, 2012, 13 : 1063 - 1095
  • [39] Random Forests and Networks Analysis
    Luca Avena
    Fabienne Castell
    Alexandre Gaudillière
    Clothilde Mélot
    Journal of Statistical Physics, 2018, 173 : 985 - 1027
  • [40] Analysis of a Random Forests Model
    Biau, Gerard
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 1063 - 1095