Multinomial random forest

被引:57
|
作者
Bai, Jiawang [1 ,2 ]
Li, Yiming [1 ]
Li, Jiawei [1 ]
Yang, Xue [1 ,2 ]
Jiang, Yong [1 ,2 ]
Xia, Shu-Tao [1 ,2 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] Peng Cheng Lab, PCL Res Ctr Networks & Commun, Shenzhen, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Random forest; Consistency; Differential privacy; Classification; CONSISTENCY; ENSEMBLE;
D O I
10.1016/j.patcog.2021.108331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the impressive performance of random forests (RF), its theoretical properties have not been thoroughly understood. In this paper, we propose a novel RF framework, dubbed multinomial random forest (MRF), to analyze its consistency and privacy-preservation . Instead of deterministic greedy split rule or with simple randomness, the MRF adopts two impurity-based multinomial distributions to randomly select a splitting feature and a splitting value, respectively. Theoretically, we prove the consistency of MRF and analyze its privacy-preservation within the framework of differential privacy. We also demonstrate with multiple datasets that its performance is on par with the standard RF. To the best of our knowledge, MRF is the first consistent RF variant that has comparable performance to the standard RF. The code is available at https://github.com/jiawangbai/Multinomial- Random-Forest . (c) 2021 Published by Elsevier Ltd.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A note on the estimation of the multinomial logit model with random effects
    Chen, Z
    Kuo, L
    AMERICAN STATISTICIAN, 2001, 55 (02): : 89 - 95
  • [22] A SIMPLE ALGORITHM FOR GENERATING MULTINOMIAL RANDOM VECTORS WITH EXTRAVARIATION
    MOREL, JG
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1992, 21 (04) : 1255 - 1268
  • [23] Several generation methods of multinomial distributed random number
    Lei, Tian
    He, Linxi
    Zhang, Zhigang
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 98 - 102
  • [24] Multinomial-sampling models for random genetic drift
    Nagylaki, T
    GENETICS, 1997, 145 (02) : 485 - 491
  • [25] Sequence-Based Estimation of Multinomial Random Variables
    Oommen, B. John
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2016, 2016, 9673 : XIII - XIV
  • [26] Multinomial distributions applied to random sampling of particulate materials
    Geelhoed, B
    Glass, HJ
    STATISTICA NEERLANDICA, 2002, 56 (01) : 58 - 76
  • [27] How common random numbers affect multinomial selection
    Miller, JO
    Bauer, KW
    PROCEEDINGS OF THE 1997 WINTER SIMULATION CONFERENCE, 1997, : 342 - 347
  • [28] Specific Random Trees for Random Forest
    Liu, Zhi
    Sun, Zhaocai
    Wang, Hongjun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03) : 739 - 741
  • [29] Generalized multinomial probit Model: Accommodating constrained random parameters
    Paleti, Rajesh
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2018, 118 : 248 - 262
  • [30] Nonparametric identification and estimation of random coefficients in multinomial choice models
    Fox, Jeremy T.
    Gandhi, Amit
    RAND JOURNAL OF ECONOMICS, 2016, 47 (01): : 118 - 139