The impact of the change in the splitting method of decision trees on the prediction power

被引:0
|
作者
Chang, Youngjae [1 ,2 ]
机构
[1] Korea Natl Open Univ, Dept Stat & Data Sci, Seoul, South Korea
[2] Korea Natl Open Univ, Dept Stat & Data Sci, 86 Daehak Ro, Seoul 03087, South Korea
关键词
data mining; decision tree; prediction power; split variable;
D O I
10.5351/KJAS.2022.35.4.517
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the era of big data, various data mining techniques have been proposed as major analysis methodologies. As complex and diverse data is mass-produced, data mining techniques have attracted attention as a method that forms the foundation of data science. In this paper, we focused on the decision tree, which is frequently used in practice and easy to understand as one of representative data mining methods. Specifically, we analyzed the effect of the splitting method of decision trees on the model performance. We compared the prediction power and structures of decision tree models with different split methods based on various simulated data. The results show that the linear combination split method can improve the prediction accuracy of decision trees in the case of data simulated from nonlinear models with complex structure.
引用
收藏
页码:517 / 525
页数:9
相关论文
共 50 条
  • [1] Just Change on Change: Adaptive Splitting Time for Decision Trees in Data Stream Classification
    Assis, Daniel Nowak
    Barddal, Jean Paul
    Enembreck, Fabricio
    [J]. 39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 351 - 357
  • [2] A METHOD FOR EARLY-SPLITTING OF HEVC INTER BLOCKS BASED ON DECISION TREES
    Correa, Guilherme
    Assuncao, Pedro
    Agostini, Luciano
    da Silva Cruz, Luis A.
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 276 - 280
  • [3] A Discriminative Splitting Criterion for Phonetic Decision Trees
    Wiesler, Simon
    Heigold, Georg
    Nussbaum-Thom, Markus
    Schlueter, Ralf
    Ney, Hermann
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 54 - 57
  • [4] A discriminative splitting criterion for phonetic decision trees
    Wiesler, Simon
    Heigold, Georg
    Nußbaum-Thom, Markus
    Schlüter, Ralf
    Ney, Hermann
    [J]. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, 2010, : 54 - 57
  • [5] A new metric splitting criterion for decision trees
    Simovici, Dan A.
    Jaroszewicz, Szymon
    [J]. INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2006, 21 (04) : 239 - 256
  • [6] Statistical analysis of various splitting criteria for decision trees
    Aaboub, Fadwa
    Chamlal, Hasna
    Ouaderhman, Tayeb
    [J]. JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [7] An eager splitting strategy for online decision trees in ensembles
    Manapragada, Chaitanya
    Gomes, Heitor M.
    Salehi, Mahsa
    Bifet, Albert
    Webb, Geoffrey, I
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (02) : 566 - 619
  • [8] Splitting Choice and Computational Complexity Analysis of Decision Trees
    Zhao, Xin
    Nie, Xiaokai
    [J]. ENTROPY, 2021, 23 (10)
  • [9] Implementation and Evaluation of Decision Trees with Range and Region Splitting
    Yasuhiko Morimoto
    Takeshi Fukuda
    Shinichi Morishita
    Takeshi Tokuyama
    [J]. Constraints, 1997, 2 (3-4) : 401 - 427
  • [10] Splitting with Confidence in Decision Trees with Application to Stream Mining
    De Rosa, Rocco
    Cesa-Bianchi, Nicolo
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,