Building multi-way decision trees with numerical attributes

被引:29
|
作者
Berzal, F [1 ]
Cubero, JC [1 ]
Marín, N [1 ]
Sánchez, D [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, E-18071 Granada, Spain
关键词
supervised learning; classification; decision trees; numerical attributes; hierarchical clustering;
D O I
10.1016/j.ins.2003.09.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Decision trees are probably the most popular and commonly used classification model. They are recursively built following a top-down approach (from general concepts to particular examples) by repeated splits of the training dataset. When this dataset contains numerical attributes, binary splits are usually performed by choosing the threshold value which minimizes the impurity measure used as splitting criterion (e.g. C4.5 gain ration criterion or CART Gini's index). In this paper we propose the use of multi-way splits for continuous attributes in order to reduce the tree complexity without decreasing classification accuracy. This can be done by intertwining a hierarchical clustering algorithm with the usual greedy decision tree learning. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:73 / 90
页数:18
相关论文
共 50 条
  • [1] Boosting with multi-way branching in decision trees
    Mansour, Y
    McAllester, D
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 300 - 306
  • [2] MML inference of decision graphs with multi-way joins and dynamic attributes
    Tan, PJ
    Dowe, DL
    [J]. AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 269 - 281
  • [3] Maintaining optimal multi-way splits for numerical attributes in data streams
    Elomaa, Tapio
    Lehtinen, Petri
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 544 - 553
  • [4] A simple boosting algorithm using multi-way branching decision trees
    Hatano, K
    [J]. THEORY OF COMPUTING SYSTEMS, 2004, 37 (04) : 503 - 518
  • [5] A Simple Boosting Algorithm Using Multi-Way Branching Decision Trees
    Kohei Hatano
    [J]. Theory of Computing Systems, 2004, 37 : 503 - 518
  • [6] Multi-way space partitioning trees
    Duncan, CA
    [J]. ALGORITHMS AND DATA STRUCTURES, PROCEEDINGS, 2003, 2748 : 219 - 230
  • [7] OPTIMAL MULTI-WAY SEARCH-TREES
    GOTLIEB, L
    [J]. SIAM JOURNAL ON COMPUTING, 1981, 10 (03) : 422 - 433
  • [8] Relaxed multi-way trees with group updates
    Larsen, KS
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 657 - 670
  • [9] Fuzzy decision trees and numerical attributes
    Zeidler, J
    Schlosser, M
    Ittner, A
    Posthoff, C
    [J]. FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 985 - 990
  • [10] Numerical attributes in decision trees:: A hierarchical approach
    Berzal, F
    Cubero, JC
    Marín, N
    Sánchez, D
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS V, 2003, 2810 : 198 - 207