A Unified Framework for Decision Tree on Continuous Attributes

被引:5
|
作者
Yan, Jianjian [1 ]
Zhang, Zhongnan [1 ]
Xie, Lingwei [1 ]
Zhu, Zhantu [1 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen 361005, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Decision tree; classification; unified framework; split criteria; CLASSIFICATION; NETWORKS; SVM;
D O I
10.1109/ACCESS.2019.2892083
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The standard algorithms of decision trees and their derived methods are usually constructed on the basis of the frequency information. However, they still suffer from a dilemma or multichotomous question for continuous attributes when two or more candidate cut points have the same or similar splitting performance with the optimal value, such as the maximal information gain ratio or the minimal Gini index. In this paper, we propose a unified framework model to deal with this question. We then design two algorithms based on Splitting Performance and the number of Expected Segments, called SPES1 and SPES2, which determine the optimal cut point, as follows. First, several candidate cut points are selected based on their splitting performances being the closest to the optimal. Second, we compute the number of expected segments for each candidate cut point. Finally, we combine these two measures by introducing a weighting factor alpha to determine the optimal one from several candidate cut points. To validate the effectiveness of our methods, we perform them on 25 benchmark datasets. The experimental results demonstrate that the classification accuracies of the proposed algorithms are superior to the current state-of-the-art methods in tackling the multichotomous question, about 5% in some cases. In particular, according to the proposed methods, the number of candidate cut points converges to a certain extent.
引用
收藏
页码:11924 / 11933
页数:10
相关论文
共 50 条
  • [31] Discretizing Numerical Attributes in Decision Tree for Big Data Analysis
    Zhang, Yiqun
    Cheung, Yiu-Ming
    2014 IEEE International Conference on Data Mining Workshop (ICDMW), 2014, : 1150 - 1156
  • [32] Constructing X-of-N attributes for decision tree learning
    Zheng, ZJ
    MACHINE LEARNING, 2000, 40 (01) : 35 - 75
  • [33] A lazy algorithm for decision tree induction based on importance of attributes
    Wang, JF
    Wang, XZ
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1549 - 1552
  • [34] Grow and Merge: A Unified Framework for Continuous Categories Discovery
    Zhang, Xinwei
    Jiang, Jianwen
    Feng, Yutong
    Wu, Zhi-fan
    Zhao, Xibin
    Wan, Hai
    Tang, Mingqian
    Jin, Rong
    Gao, Yue
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [35] A DECISION-TREE BASED CONTINUOUS LEARNING FRAMEWORK FOR REAL-TIME PREDICTION OF RUNWAY CAPACITIES
    Andy, Lam Jun Guang
    Alam, Sameer
    Piplani, Rajesh
    Lilith, Nimrod
    Dhief, Imen
    2021 INTEGRATED COMMUNICATIONS NAVIGATION AND SURVEILLANCE CONFERENCE (ICNS), 2021,
  • [36] Online Adaptive Clustering in a Decision Tree Framework
    Basak, Jayanta
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3626 - 3629
  • [37] A Decision Tree Framework for Spatiotemporal Sequence Prediction
    Kim, Taehwan
    Yue, Yisong
    Taylor, Sarah
    Matthews, Iain
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 577 - 586
  • [38] Bayesian evidence framework for decision tree learning
    Chatpatanasiri, R
    Kijsirikul, B
    Bayesian Inference and Maximum Entropy Methods in Science and Engineering, 2005, 803 : 88 - 95
  • [39] A pairwise decision tree framework for hyperspectral classification
    Chen, J.
    Wang, R.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2007, 28 (12) : 2821 - 2830
  • [40] A unified framework for tree search decoding: Rediscovering the sequential decoder
    Murugan, AD
    El Gamal, H
    Damen, MO
    Caire, G
    2005 IEEE 6th Workshop on Signal Processing Advances in Wireless Communications, 2005, : 761 - 765