The use of vicinal-risk minimization for training decision trees

被引:8
|
作者
Cao, Yilong [1 ]
Rockett, Peter I. [1 ]
机构
[1] Univ Sheffield, Dept Elect & Elect Engn, Sheffield S1 3JD, S Yorkshire, England
关键词
Decision trees; Vicinal-risk minimization; Classification; CLASSIFIERS;
D O I
10.1016/j.asoc.2015.02.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose the use of Vapnik's vicinal risk minimization (VRM) for training decision trees to approximately maximize decision margins. We implement VRM by propagating uncertainties in the input attributes into the labeling decisions. In this way, we perform a global regularization over the decision tree structure. During a training phase, a decision tree is constructed to minimize the total probability of misclassifying the labeled training examples, a process which approximately maximizes the margins of the resulting classifier. We perform the necessary minimization using an appropriate meta-heuristic (genetic programming) and present results over a range of synthetic and benchmark real datasets. We demonstrate the statistical superiority of VRM training over conventional empirical risk minimization (ERM) and the well-known C4.5 algorithm, for a range of synthetic and real datasets. We also conclude that there is no statistical difference between trees trained by ERM and using C4.5. Training with VRM is shown to be more stable and repeatable than by ERM. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:185 / 195
页数:11
相关论文
共 50 条
  • [1] Training genetic programming classifiers by vicinal-risk minimization
    Ni, Ji
    Rockett, Peter
    [J]. GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2015, 16 (01) : 3 - 25
  • [2] Training genetic programming classifiers by vicinal-risk minimization
    Ji Ni
    Peter Rockett
    [J]. Genetic Programming and Evolvable Machines, 2015, 16 : 3 - 25
  • [3] Vicinal Risk Minimization
    Chapelle, O
    Weston, J
    Bottou, L
    Vapnik, V
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 416 - 422
  • [4] Structural risk minimization on decision trees using an evolutionary multiobjective optimization
    Kim, DE
    [J]. GENETIC PROGRAMMING, PROCEEDINGS, 2004, 3003 : 338 - 348
  • [5] Model selection in omnivariate decision trees using Structural Risk Minimization
    Yildiz, Olcay Taner
    [J]. INFORMATION SCIENCES, 2011, 181 (23) : 5214 - 5226
  • [6] Minimization of decision trees is hard to approximate
    Sieling, D
    [J]. 18TH IEEE ANNUAL CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2003, : 84 - 92
  • [7] Feature minimization within decision trees
    Bredensteiner, EJ
    Bennett, KP
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1998, 10 (02) : 111 - 126
  • [8] Minimization of decision trees is hard to approximate
    Sieling, Detlef
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2008, 74 (03) : 394 - 403
  • [9] Evolutionary induction of decision trees for misclassification cost minimization
    Kretowski, Marek
    Grzes, Marek
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 1, 2007, 4431 : 1 - +
  • [10] Genetic Adversarial Training of Decision Trees
    Ranzato, Francesco
    Zanella, Marco
    [J]. PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 358 - 367