Decision tree induction with a constrained number of leaf nodes

被引:29
|
作者
Wu, Chia-Chi [1 ]
Chen, Yen-Liang [2 ]
Liu, Yi-Hung [3 ]
Yang, Xiang-Yu [2 ]
机构
[1] Adv Res Inst, Inst Informat Ind, Taipei 115, Taiwan
[2] Natl Cent Univ, Dept Informat Management, Chungli 320, Taiwan
[3] Shantou Univ, Sch Business, Shantou 515063, Guangdong, Peoples R China
关键词
Classification; Data mining; Decision tree; Constraint tree; EMOTION RECOGNITION; CLASSIFIER; ALGORITHMS; SYSTEM;
D O I
10.1007/s10489-016-0785-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the advantages of being easy to understand and efficient to compute, the decision tree method has long been one of the most popular classifiers. Decision trees constructed with existing approaches, however, tend to be huge and complex, and consequently are difficult to use in practical applications. In this study, we deal with the problem of tree complexity by allowing users to specify the number of leaf nodes, and then construct a decision tree that allows maximum classification accuracy with the given number of leaf nodes. A new algorithm, the Size Constrained Decision Tree (SCDT), is proposed with which to construct a decision tree, paying close attention on how to efficiently use the limited number of leaf nodes. Experimental results show that the SCDT method can successfully generate a simpler decision tree and offers better accuracy.
引用
收藏
页码:673 / 685
页数:13
相关论文
共 50 条
  • [1] Decision tree induction with a constrained number of leaf nodes
    Chia-Chi Wu
    Yen-Liang Chen
    Yi-Hung Liu
    Xiang-Yu Yang
    [J]. Applied Intelligence, 2016, 45 : 673 - 685
  • [2] Performance Estimation of Decision Tree Excluding Insignificant Leaf Nodes
    Jeon, Hae Sook
    Lee, Won Don
    [J]. 2014 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2014, : 122 - 127
  • [3] Induction of decision tree with fuzzy number-valued attribute
    Huang, DM
    Yang, R
    Wang, XZ
    Ha, MH
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1446 - 1450
  • [4] Time-constrained cost-sensitive decision tree induction
    Chen, Yen-Liang
    Wu, Chia-Chi
    Tang, Kwei
    [J]. INFORMATION SCIENCES, 2016, 354 : 140 - 152
  • [5] ON THE AVERAGE NUMBER OF NODES IN A SUBTREE OF A TREE
    JAMISON, RE
    [J]. JOURNAL OF COMBINATORIAL THEORY SERIES B, 1983, 35 (03) : 207 - 223
  • [6] ON THE NUMBER OF INDEPENDENT SETS OF NODES IN A TREE
    DUTTON, R
    CHANDRASEKHARAN, N
    BRIGHAM, R
    [J]. FIBONACCI QUARTERLY, 1993, 31 (02): : 98 - 104
  • [7] Automated induction with constrained tree automata
    Bouhoula, Adel
    Jacquemard, Florent
    [J]. AUTOMATED REASONING, PROCEEDINGS, 2008, 5195 : 539 - 554
  • [8] Induction of ordinal decision tree
    Lee, JWT
    Liu, DZ
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 2220 - 2224
  • [9] κ-Anonymous decision tree induction
    Friedman, Arik
    Schuster, Assaf
    Wolff, Ran
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 151 - 162
  • [10] Decision tree induction with CBR
    Selvamani, BR
    Khemani, D
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 786 - 791