Measure inducing classification and regression trees for functional data

被引：8

作者：

Belli, Edoardo ^{[1
]}

Vantini, Simone ^{[1
]}

机构：

[1] Politecn Milan, MOX Dept Math, Milan, Italy

来源：

STATISTICAL ANALYSIS AND DATA MINING | 2022年 / 15卷 / 05期

关键词：

constrained convex optimization; decision trees; functional data analysis; high-dimensional data; splitting rule; weight function; SMOOTHING SPLINES ESTIMATORS; UNBIASED VARIABLE SELECTION; DECISION TREES; LEAST-SQUARES; FORESTS; MODELS;

D O I：

10.1002/sam.11569

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a tree-based algorithm (mu CART) for classification and regression problems in the context of functional data analysis, which allows to leverage measure learning and multiple splitting rules at the node level, with the objective of reducing error while retaining the interpretability of a tree. For each internal node, our main contribution is the idea of learning a weighted functional L-2 space by means of constrained convex optimization, which is then used to extract multiple weighted integral features from the functional predictors, in order to determine the binary split. The approach is designed to manage multiple functional predictors and/or responses, by defining suitable splitting rules and loss functions that can depend on the specific problem and can also be combined with additional scalar and categorical predictors, as the tree is grown with the original greedy CART algorithm. We focus on the case of scalar-valued functional predictors defined on unidimensional domains and illustrate the effectiveness of our method in both classification and regression tasks, through a simulation study and four real-world applications.

引用

页码：553 / 569

页数：17

共 50 条

[1] Missing data imputation using classification and regression trees
Chen, Cheng-Yang
Chang, Yu-Wei
[J]. PEERJ COMPUTER SCIENCE, 2024, 10
[2] Classification and regression trees
Martin Krzywinski
Naomi Altman
[J]. Nature Methods, 2017, 14 : 757 - 758
[3] Classification and regression trees
Speybroeck, N.
[J]. INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2012, 57 (01) : 243 - 246
[4] Classification and regression trees
Loh, Wei-Yin
[J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) : 14 - 23
[5] Classification and regression trees
Krzywinski, Martin
Altman, Naomi
[J]. NATURE METHODS, 2017, 14 (08) : 755 - 756
[6] Optimal classification and nonparametric regression for functional data
Meister, Alexander
[J]. BERNOULLI, 2016, 22 (03) : 1729 - 1744
[7] The Application of Classification and Regression Trees Algorithm in the Production Data of Mounter
Zhang Lu
Shang Yan-Ling
[J]. MANUFACTURING SCIENCE AND TECHNOLOGY, PTS 1-8, 2012, 383-390 : 4312 - +
[8] Multivariate data analysis and modeling through classification and regression trees
Siciliano, R
Mola, F
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 32 (3-4) : 285 - 301
[9] CLASSIFICATION AND REGRESSION TREES (CART)
YEH, CH
[J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1991, 12 (01) : 95 - 96
[10] CORT: Classification or regression trees
Scott, CD
Willett, RM
Nowak, RD
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL VI, PROCEEDINGS: SIGNAL PROCESSING THEORY AND METHODS, 2003, : 153 - 156

← 1 2 3 4 5 →