Analytical Split Value Calculation for Numerical Attributes in Hoeffding Trees with Misclassification-Based Impurity

被引:0
|
作者
Mirkhan M. [1 ]
Amir Haeri M. [1 ]
Meybodi M.R. [1 ]
机构
[1] Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran
关键词
Gaussian distribution; Hoeffding tree; Massive and streaming data; Misclassification error;
D O I
10.1007/s40745-019-00225-4
中图分类号
学科分类号
摘要
Hoeffding tree is a method to incrementally build decision trees. A common approach to handle numerical attributes in Hoeffding trees is to represent their sufficient statistics as Gaussian distributions. Our contribution in this paper is to prove that by using Gaussian distribution as sufficient statistics and misclassification error as impurity measure, there is an analytical method to exactly calculate the best splitting values. Three different approaches for using this theorem are proposed and all three are tested on both synthetic and real datasets. The experiments suggest that this approach can create smaller trees and learn faster and achieve higher accuracy in most problems. © 2019, Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:645 / 665
页数:20
相关论文
共 7 条
  • [1] Magnetic Field Calculation for Indoor Substation Busbars Based on Analytical-numerical Method
    Jun, Lei
    Xiao, Dongping
    Duan, Huiqing
    2008 WORLD AUTOMATION CONGRESS PROCEEDINGS, VOLS 1-3, 2008, : 831 - +
  • [2] Research on a Numerical Calculation for Ball Bearings Based on a Finite Initial Value Search Method
    Hu, Jing
    Qiao, XiaoLi
    Lv, QiongYing
    Zhang, XinMing
    Zhou, XiaoPing
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [3] Efficient numerical calculation of frequency-domain matching method based on an analytical control surface
    Deng, Baoli
    Shan, Penghao
    Liu, Ruixiang
    Wang, Zhe
    Li, Hui
    ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 2022, 141 : 140 - 152
  • [4] An efficient analytical based technique to numerical calculation of extended earth return impedance and admittance of overhead lines
    Liu, Z.
    De Silva, H. M. J.
    ELECTRIC POWER SYSTEMS RESEARCH, 2021, 197
  • [5] Numerical and numerically-analytical methods for calculation of versal model parameters based on the Campbell-Hausdorff expansion
    Udilov, V.V.
    Journal of Automation and Information Sciences, 1999, 31 (4-5): : 1 - 5
  • [6] On numerical analytical methods for solving boundary-value problems of continua based on invariant solutions of the basic equations of mathematical physics
    Druzhinin, GV
    Zakirov, IM
    Bodunov, NM
    COMPUTATIONAL METHODS AND EXPERIMENTAL MEASUREMENTS X, 2001, 3 : 997 - 1006
  • [7] Stiffness and Deformation Analysis of Cross-Laminated Timber (CLT) Panels Made of Nordic Spruce Based on Experimental Testing, Analytical Calculation and Numerical Modeling
    Dobes, Pavel
    Lokaj, Antonin
    Vavrusova, Kristyna
    BUILDINGS, 2023, 13 (01)