Ordinal Variable Selection in Decision Trees

被引:0
|
作者
Kim, Hyunjoong [1 ]
机构
[1] Yonsei Univ, Dept Appl Stat, Shinchon Dong 134, Seoul 120749, South Korea
关键词
Decision Trees; Nonparametric statistics; Cramer-von Mises test; Ordinal variable; CART;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The most important component in decision tree algorithm is the rule for split variable selection. Many earlier algorithms such as CART and C4.5 use greedy search algorithm for variable selection. Recently, many methods were developed to cope with the weakness of greedy search algorithm. Most algorithms have different selection criteria depending on the type of variables: continuous or nominal. However, ordinal type variables are usually treated as continuous ones. This approach did not cause any trouble for the methods using greedy search algorithm. However, it may cause problems for the newer algorithms because they use statistical methods valid for continuous or nominal types only. In this paper, we propose a ordinal variable selection method that uses Cramer-von Mises testing procedure. We performed comparisons among CART, C4.5, QUEST, CRUISE, and the new method. It was shown that the new method has a good variable selection power for ordinal type variables.
引用
收藏
页码:149 / 161
页数:13
相关论文
共 50 条
  • [1] Ordinal Decision Trees
    HU Qinghua
    [J]. 浙江海洋大学学报(自然科学版), 2010, 29 (05) : 450 - 461
  • [2] Split criterions for variable selection using decision trees
    Abellan, Joaquin
    Masegosa, Andres R.
    [J]. SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, PROCEEDINGS, 2007, 4724 : 489 - +
  • [3] Growing decision trees in an ordinal setting
    Cao-Van, K
    De Baets, B
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2003, 18 (07) : 733 - 750
  • [4] Variable selection heuristics and optimum decision trees - An experimental study-
    Miyakawa, M
    Otsu, N
    Rosenberg, NG
    [J]. ISMVL 2002: 32ND IEEE INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC, PROCEEDINGS, 2002, : 238 - 244
  • [5] Using Decision Trees to Improve Variable Selection for Building Composite Indicators
    Otoiu, Adrian
    Titan, Emilia
    [J]. STATISTIKA-STATISTICS AND ECONOMY JOURNAL, 2020, 100 (03) : 296 - 308
  • [6] ORDINAL DECISION TREES BASED ON FUZZY RANK ENTROPY
    Wang, Xin
    Zhai, Junhai
    Chen, Jiankai
    Wang, Xizhao
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2015, : 208 - 213
  • [7] Variable Selection with Regression Trees
    Chang, Youngjae
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2010, 23 (02) : 357 - 366
  • [8] Random forest for ordinal responses: Prediction and variable selection
    Janitza, Silke
    Tutz, Gerhard
    Boulesteix, Anne-Laure
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 96 : 57 - 73
  • [9] Variable Importance using Decision Trees
    Kazemitabar, S. Jalil
    Amini, Arash A.
    Bloniarz, Adam
    Talwalkar, Ameet
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [10] Variable consistency monotonic decision trees
    Giove, S
    Greco, S
    Matarazzo, B
    Slowinski, R
    [J]. ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2002, 2475 : 247 - 254