Measure inducing classification and regression trees for functional data

被引:8
|
作者
Belli, Edoardo [1 ]
Vantini, Simone [1 ]
机构
[1] Politecn Milan, MOX Dept Math, Milan, Italy
关键词
constrained convex optimization; decision trees; functional data analysis; high-dimensional data; splitting rule; weight function; SMOOTHING SPLINES ESTIMATORS; UNBIASED VARIABLE SELECTION; DECISION TREES; LEAST-SQUARES; FORESTS; MODELS;
D O I
10.1002/sam.11569
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a tree-based algorithm (mu CART) for classification and regression problems in the context of functional data analysis, which allows to leverage measure learning and multiple splitting rules at the node level, with the objective of reducing error while retaining the interpretability of a tree. For each internal node, our main contribution is the idea of learning a weighted functional L-2 space by means of constrained convex optimization, which is then used to extract multiple weighted integral features from the functional predictors, in order to determine the binary split. The approach is designed to manage multiple functional predictors and/or responses, by defining suitable splitting rules and loss functions that can depend on the specific problem and can also be combined with additional scalar and categorical predictors, as the tree is grown with the original greedy CART algorithm. We focus on the case of scalar-valued functional predictors defined on unidimensional domains and illustrate the effectiveness of our method in both classification and regression tasks, through a simulation study and four real-world applications.
引用
收藏
页码:553 / 569
页数:17
相关论文
共 50 条