Multinomial random forest

被引:57
|
作者
Bai, Jiawang [1 ,2 ]
Li, Yiming [1 ]
Li, Jiawei [1 ]
Yang, Xue [1 ,2 ]
Jiang, Yong [1 ,2 ]
Xia, Shu-Tao [1 ,2 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] Peng Cheng Lab, PCL Res Ctr Networks & Commun, Shenzhen, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Random forest; Consistency; Differential privacy; Classification; CONSISTENCY; ENSEMBLE;
D O I
10.1016/j.patcog.2021.108331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the impressive performance of random forests (RF), its theoretical properties have not been thoroughly understood. In this paper, we propose a novel RF framework, dubbed multinomial random forest (MRF), to analyze its consistency and privacy-preservation . Instead of deterministic greedy split rule or with simple randomness, the MRF adopts two impurity-based multinomial distributions to randomly select a splitting feature and a splitting value, respectively. Theoretically, we prove the consistency of MRF and analyze its privacy-preservation within the framework of differential privacy. We also demonstrate with multiple datasets that its performance is on par with the standard RF. To the best of our knowledge, MRF is the first consistent RF variant that has comparable performance to the standard RF. The code is available at https://github.com/jiawangbai/Multinomial- Random-Forest . (c) 2021 Published by Elsevier Ltd.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Intelligent classification of coal structure using multinomial logistic regression, random forest and fully connected neural network with multisource geophysical logging data
    Wang, Zihao
    Cai, Yidong
    Liu, Dameng
    Qiu, Feng
    Sun, Fengrui
    Zhou, Yingfang
    INTERNATIONAL JOURNAL OF COAL GEOLOGY, 2023, 268
  • [42] Using Random Forest To Model the Domain Applicability of Another Random Forest Model
    Sheridan, Robert P.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (11) : 2837 - 2850
  • [43] Modelling natural forest expansion on a landscape level by multinomial logistic regression
    Corona, P.
    Calvani, P.
    Mugnozza, G. S.
    Pompei, E.
    PLANT BIOSYSTEMS, 2008, 142 (03): : 509 - 517
  • [44] LIMIT-THEOREMS ARISING FROM SEQUENCES OF MULTINOMIAL RANDOM VECTORS
    PANGANIBAN, RL
    JOURNAL OF MULTIVARIATE ANALYSIS, 1989, 28 (02) : 204 - 210
  • [45] Boundary Crossing Random Walks, Clinical Trials, and Multinomial Sequential Estimation
    Bibbona, Enrico
    Rubba, Alessandro
    SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2012, 31 (01): : 99 - 107
  • [46] Random Effects Multinomial Processing Tree Models: A Maximum Likelihood Approach
    Steffen Nestler
    Edgar Erdfelder
    Psychometrika, 2023, 88 : 809 - 829
  • [47] Measuring learning in LISP: An application of the random coefficients multinomial logit model
    Draney, KL
    Wilson, M
    Pirolli, P
    OBJECTIVE MEASUREMENT: THEORY INTO PRACTICE, VOL 3, 1996, : 195 - 218
  • [48] A self-consistency approach to multinomial logit model with random effects
    Wang, Shufang
    Tsodikov, Alex
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (07) : 1939 - 1947
  • [49] Forest volume decompositions and Abel-Cayley-Hurwitz multinomial expansions
    Pitman, J
    JOURNAL OF COMBINATORIAL THEORY SERIES A, 2002, 98 (01) : 175 - 191
  • [50] A new approach to random utility mdeling using the Dirichlet multinomial distribution
    Shonkwiler, JS
    Hanley, N
    ENVIRONMENTAL & RESOURCE ECONOMICS, 2003, 26 (03): : 401 - 416