Data-driven multinomial random forest: a new random forest variant with strong consistency

被引:3
|
作者
Chen, Junhao [1 ]
Wang, Xueli [1 ]
Lei, Fei [2 ]
机构
[1] Beijing Univ Technol, Sch Math Stat & Mech, Beijing, Peoples R China
[2] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
关键词
Random forest; Strong consistency; Classification; Regression; Machine learning; CLASSIFICATION;
D O I
10.1186/s40537-023-00874-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we modify the proof methods of some previously weakly consistent variants of random forest into strongly consistent proof methods, and improve the data utilization of these variants in order to obtain better theoretical properties and experimental performance. In addition, we propose the Data-driven Multinomial Random Forest (DMRF) algorithm, which has the same complexity with BreimanRF (proposed by Breiman) while satisfying strong consistency with probability 1. It has better performance in classification and regression tasks than previous RF variants that only satisfy weak consistency, and in most cases even surpasses BreimanRF in classification tasks. To the best of our knowledge, DMRF is currently a low-complexity and high-performing variation of random forest that achieves strong consistency with probability 1.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Data-driven multinomial random forest: a new random forest variant with strong consistency
    JunHao Chen
    XueLi Wang
    Fei Lei
    Journal of Big Data, 11
  • [2] Multinomial random forest
    Bai, Jiawang
    Li, Yiming
    Li, Jiawei
    Yang, Xue
    Jiang, Yong
    Xia, Shu-Tao
    PATTERN RECOGNITION, 2022, 122
  • [3] Data-driven disruption prediction using random forest in KSTAR
    Lee, Jeongwon
    Kim, Jayhyun
    Hahn, Sang-hee
    Han, Hyunsun
    Shin, Giwook
    Kim, Woong-Chae
    Yoon, Si-Woo
    FUSION ENGINEERING AND DESIGN, 2024, 199
  • [4] Data-driven random forest forecasting method of monthly electricity consumption
    Pang, Xinfu
    Luan, Changfeng
    Liu, Li
    Liu, Wei
    Zhu, Yuancheng
    ELECTRICAL ENGINEERING, 2022, 104 (04) : 2045 - 2059
  • [5] Data-driven random forest forecasting method of monthly electricity consumption
    Xinfu Pang
    Changfeng Luan
    Li Liu
    Wei Liu
    Yuancheng Zhu
    Electrical Engineering, 2022, 104 : 2045 - 2059
  • [6] Data-driven retrieval of spray details with random forest-based distance
    Peng, Chen
    Zhao, Zipeng
    Li, Chen
    Wang, Changbo
    Qin, Hong
    Quan, Hongyan
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2019, 30 (3-4)
  • [7] Data-driven Shoreline Change Forecasting on Eretan Beach Using Random Forest
    Gunawan, Putu Harry
    Iryanto
    Ghozali, Ahamd Lubis
    Ismantohadi, Eka
    Baizal, Z. K. Abdurahman
    Satrio, Ari
    2022 INTERNATIONAL CONFERENCE ON ADVANCED CREATIVE NETWORKS AND INTELLIGENT SYSTEMS, ICACNIS, 2022, : 105 - 108
  • [8] Data-Driven Fault Detection for SOFC system based on Random Forest and SVM
    Chen Meng-ting
    Fu Xiao-wei
    Deng Zhong-hua
    Li Xi
    Wu Xiao-long
    Xu Yuan-wu
    Xue Tao
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2829 - 2834
  • [9] A data-driven method of traffic emissions mapping with land use random forest models
    Wen, Yifan
    Wu, Ruoxi
    Zhou, Zihang
    Zhang, Shaojun
    Yang, Shengge
    Wallington, Timothy J.
    Shen, Wei
    Tan, Qinwen
    Deng, Ye
    Wu, Ye
    APPLIED ENERGY, 2022, 305
  • [10] Random Bits Forest: a Strong Classifier/Regressor for Big Data
    Yi Wang
    Yi Li
    Weilin Pu
    Kathryn Wen
    Yin Yao Shugart
    Momiao Xiong
    Li Jin
    Scientific Reports, 6