Measure inducing classification and regression trees for functional data

被引:8
|
作者
Belli, Edoardo [1 ]
Vantini, Simone [1 ]
机构
[1] Politecn Milan, MOX Dept Math, Milan, Italy
关键词
constrained convex optimization; decision trees; functional data analysis; high-dimensional data; splitting rule; weight function; SMOOTHING SPLINES ESTIMATORS; UNBIASED VARIABLE SELECTION; DECISION TREES; LEAST-SQUARES; FORESTS; MODELS;
D O I
10.1002/sam.11569
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a tree-based algorithm (mu CART) for classification and regression problems in the context of functional data analysis, which allows to leverage measure learning and multiple splitting rules at the node level, with the objective of reducing error while retaining the interpretability of a tree. For each internal node, our main contribution is the idea of learning a weighted functional L-2 space by means of constrained convex optimization, which is then used to extract multiple weighted integral features from the functional predictors, in order to determine the binary split. The approach is designed to manage multiple functional predictors and/or responses, by defining suitable splitting rules and loss functions that can depend on the specific problem and can also be combined with additional scalar and categorical predictors, as the tree is grown with the original greedy CART algorithm. We focus on the case of scalar-valued functional predictors defined on unidimensional domains and illustrate the effectiveness of our method in both classification and regression tasks, through a simulation study and four real-world applications.
引用
收藏
页码:553 / 569
页数:17
相关论文
共 50 条
  • [31] The feature extraction of water-quality data by using classification and regression trees
    Kaihara, M
    Takahashi, N
    Sato, S
    Higuchi, Y
    Yusa, S
    [J]. BUNSEKI KAGAKU, 2003, 52 (05) : 331 - 336
  • [32] Classification and Regression Trees on Aggregate Data Modeling: An Application in Acute Myocardial Infarction
    Quantin, C.
    Billard, L.
    Touati, M.
    Andreu, N.
    Cottin, Y.
    Zeller, M.
    Afonso, F.
    Battaglia, G.
    Seck, D.
    Le Teuff, G.
    Diday, E.
    [J]. JOURNAL OF PROBABILITY AND STATISTICS, 2011, 2011
  • [33] Classification and regression trees: A powerful yet simple technique for ecological data analysis
    De'ath, G
    Fabricius, KE
    [J]. ECOLOGY, 2000, 81 (11) : 3178 - 3192
  • [34] Data reduction based on NN-kNN measure for NN classification and regression
    Shuang An
    Qinghua Hu
    Changzhong Wang
    Ge Guo
    Piyu Li
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 765 - 781
  • [35] Data reduction based on NN-kNN measure for NN classification and regression
    An, Shuang
    Hu, Qinghua
    Wang, Changzhong
    Guo, Ge
    Li, Piyu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (03) : 765 - 781
  • [36] A Comparison of Classification/Regression Trees and Logistic Regression in Failure Models
    Irimia-Dieguez, A. I.
    Blanco-Oliver, A.
    Vazquez-Cueto, M. J.
    [J]. 2ND GLOBAL CONFERENCE ON BUSINESS, ECONOMICS, MANAGEMENT AND TOURISM, 2015, 23 : 9 - 14
  • [37] A Comparison of Classification/Regression Trees and Logistic Regression in Failure Models
    Irimia-Dieguez, A. I.
    Blanco-Oliver, A.
    Vazquez-Cueto, M. J.
    [J]. 4TH WORLD CONFERENCE ON BUSINESS, ECONOMICS AND MANAGEMENT (WCBEM-2015), 2015, 26 : 23 - 28
  • [38] Predictive performance of missing data methods for logistic regression, classification trees and neural networks
    Schmid, CH
    Terrin, N
    Griffith, JL
    D'Agostino, RB
    Selker, HP
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2001, 71 (02) : 115 - 140
  • [39] Mapping wetlands using ASTER data: a comparison between classification trees and logistic regression
    Pantaleoni, E.
    Wynne, R. H.
    Galbraith, J. M.
    Campbell, J. B.
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2009, 30 (13) : 3423 - 3440
  • [40] A similarity measure to assess the stability of classification trees
    Briand, Benedicte
    Ducharme, Gilles R.
    Parache, Vanessa
    Mercat-Rommens, Catherine
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (04) : 1208 - 1217