Model selection in omnivariate decision trees using Structural Risk Minimization

被引:10
|
作者
Yildiz, Olcay Taner [1 ]
机构
[1] Isik Univ, Dept Comp Engn, TR-34980 Istanbul, Turkey
关键词
Classification; Machine learning; Model selection; VC-dimension; Structural Risk Minimization; Decision tree; CLASSIFICATION; CONSTRUCTION; INDUCTION; DIMENSION;
D O I
10.1016/j.ins.2011.07.028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As opposed to trees that use a single type of decision node, an omnivariate decision tree contains nodes of different types. We propose to use Structural Risk Minimization (SRM) to choose between node types in omnivariate decision tree construction to match the complexity of a node to the complexity of the data reaching that node. In order to apply SRM for model selection, one needs the VC-dimension of the candidate models. In this paper, we first derive the VC-dimension of the univariate model, and estimate the VC-dimension of all three models (univariate, linear multivariate or quadratic multivariate) experimentally. Second, we compare SRM with other model selection techniques including Akaike's Information Criterion (AIC), Bayesian Information Criterion (BIC) and cross-validation (CV) on standard datasets from the UCI and Delve repositories. We see that SRM induces omnivariate trees that have a small percentage of multivariate nodes close to the root and they generalize more or at least as accurately as those constructed using other model selection techniques. (C) 2011 Published by Elsevier Inc.
引用
收藏
页码:5214 / 5226
页数:13
相关论文
共 50 条
  • [1] Model selection in omnivariate decision trees
    Yildiz, OT
    Alpaydin, E
    [J]. MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 473 - 484
  • [2] Omnivariate decision trees
    Yildiz, OT
    Alpaydin, E
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (06): : 1539 - 1546
  • [3] Structural risk minimization on decision trees using an evolutionary multiobjective optimization
    Kim, DE
    [J]. GENETIC PROGRAMMING, PROCEEDINGS, 2004, 3003 : 338 - 348
  • [4] Classifiability based omnivariate decision trees
    Li, Y
    Dong, M
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 3223 - 3228
  • [5] Classifiability-based omnivariate decision trees
    Li, YH
    Dong, M
    Kothari, R
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (06): : 1547 - 1560
  • [6] OmniGA: Optimized Omnivariate Decision Trees for Generalizable Classification Models
    Magana-Mora, Arturo
    Bajic, Vladimir B.
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [7] OmniGA: Optimized Omnivariate Decision Trees for Generalizable Classification Models
    Arturo Magana-Mora
    Vladimir B. Bajic
    [J]. Scientific Reports, 7
  • [8] Predicate selection for structural decision trees
    Ng, KS
    Lloyd, JW
    [J]. INDUCTIVE LOGIC PROGRAMMING, PROCEEDINGS, 2005, 3625 : 264 - 278
  • [9] The use of vicinal-risk minimization for training decision trees
    Cao, Yilong
    Rockett, Peter I.
    [J]. APPLIED SOFT COMPUTING, 2015, 31 : 185 - 195
  • [10] Integrally private model selection for decision trees
    Senavirathne, Navoda
    Torra, Vicenc
    [J]. COMPUTERS & SECURITY, 2019, 83 : 167 - 181