Discriminant analysis and factorial multiple splits in recursive partitioning for data mining

被引:0
|
作者
Mola, F
Siciliano, R
机构
[1] Univ Cagliari, Dipartimento Econ, I-09100 Cagliari, Italy
[2] Univ Naples Federico II, Dipartimento Matemat & Stat, I-80126 Naples, Italy
来源
MULTIPLE CLASSIFIER SYSTEMS | 2002年 / 2364卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The framework of this paper is supervised statistical learning in data raining. In particular, multiple sets of inputs are used to predict an output on the basis of a training set. A typical data mining problem is to deal with large sets of within-groups correlated inputs compared to the number of observed objects. Standard tree-based procedures offer unstable and not interpretable solutions especially in case of complex relationships. For that multiple splits defined upon a suitable combination of inputs are required. This paper provides a methodology to build up a tree-based model which nodes splitting is due to factorial multiple splitting variables. A recursive partitioning algorithm is introduced considering a two-stage splitting criterion based on linear discriminant functions. As a result, an automated and fast procedure allows to look for factorial multiple splits able to capture suitable directions in the variability among the sets of inputs. Real world applications are discussed and the results of a simulation study are shown to describe fruitful properties of the proposed methodology.
引用
收藏
页码:118 / 126
页数:9
相关论文
共 50 条
  • [1] Canonical variates for recursive partitioning in data mining
    Cappelli, C
    Conversano, C
    [J]. COMPSTAT 2002: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2002, : 213 - 218
  • [2] Data mining for response shift patterns in multiple sclerosis patients using recursive partitioning tree analysis
    Li, Yuelin
    Schwartz, Carolyn E.
    [J]. QUALITY OF LIFE RESEARCH, 2011, 20 (10) : 1543 - 1553
  • [3] Data mining for response shift patterns in multiple sclerosis patients using recursive partitioning tree analysis
    Yuelin Li
    Carolyn E. Schwartz
    [J]. Quality of Life Research, 2011, 20 : 1543 - 1553
  • [4] A ROBUST RECURSIVE PARTITIONING ALGORITHM FOR MINING MULTIPLE POPULATIONS
    Alvir, J.
    Cabrera, J.
    Caridi, F.
    Nguyen, H.
    Roberts, C.
    [J]. FRONTIERS OF APPLIED AND COMPUTATIONAL MATHEMATICS, 2008, : 75 - +
  • [5] Erratum to: Data mining for response shift patterns in multiple sclerosis patients using recursive partitioning tree analysis
    Yuelin Li
    Carolyn E. Schwartz
    [J]. Quality of Life Research, 2012, 21 : 183 - 183
  • [6] Gliomas classification by multivariate analysis of in vivo MRI/MRSI data based on recursive partitioning tree and discriminant analysis
    Li, XJ
    Ying, L
    Nelson, SJ
    [J]. 2002 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, PROCEEDINGS, 2002, : 209 - 212
  • [7] Data mining for response shift patterns in multiple sclerosis patients using recursive partitioning tree analysis (vol 20, pg 1543, 2011)
    Li, Yuelin
    Schwartz, Carolyn E.
    [J]. QUALITY OF LIFE RESEARCH, 2012, 21 (01) : 183 - 183
  • [9] RESEARCH ON MOST DISCRIMINANT CHARACTERISTICS BY DISCRIMINANT FACTORIAL ANALYSIS
    SMADJA, J
    GAYDOU, EM
    [J]. ANALUSIS, 1991, 19 (01) : 39 - 40
  • [10] Application of multiple-Y recursive partitioning methods to mining large databases.
    Roush, DM
    Zhang, L
    Stockfisch, TP
    Rao, SN
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2002, 223 : U501 - U501