Tree-based ensembles for multi-output regression: Comparing multivariate approaches with separate univariate ones

被引:9
|
作者
Schmid, Lena [1 ]
Gerharz, Alexander [1 ]
Groll, Andreas [1 ]
Pauly, Markus [1 ,2 ]
机构
[1] TU Dortmund Univ, Dept Stat, D-44227 Dortmund, Germany
[2] UA Ruhr, Res Ctr Trustworthy Data Sci & Secur, D-44227 Dortmund, Germany
关键词
Machine learning; Multi -output regression; Multivariate trees; VARIABLE SELECTION; RANDOM FOREST; CLASSIFICATION; MATCHES; DESIGNS;
D O I
10.1016/j.csda.2022.107628
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Tree-based ensembles such as the Random Forest are modern classics among statistical learning methods. In particular, they are used for predicting univariate responses. In case of multiple outputs the question arises whether it is better to separately fit univariate models or directly follow a multivariate approach. For the latter, several possibilities exist that are, e.g. based on modified splitting or stopping rules for multi-output regression. These methods are compared in extensive simulations and a real data example to help in answering the primary question when to use multivariate ensemble techniques instead of univariate ones.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] Deep tree-ensembles for multi-output prediction
    Nakano, Felipe Kenji
    Pliakos, Konstantinos
    Vens, Celine
    [J]. PATTERN RECOGNITION, 2022, 121
  • [2] Regularization-based model tree for multi-output regression
    Jeong, Jun-Yong
    Kang, Ju-Seok
    Jun, Chi-Hyuck
    [J]. INFORMATION SCIENCES, 2020, 507 : 240 - 255
  • [3] Multivariate output global sensitivity analysis using multi-output support vector regression
    Kai Cheng
    Zhenzhou Lu
    Kaichao Zhang
    [J]. Structural and Multidisciplinary Optimization, 2019, 59 : 2177 - 2187
  • [4] Multivariate output global sensitivity analysis using multi-output support vector regression
    Cheng, Kai
    Lu, Zhenzhou
    Zhang, Kaichao
    [J]. STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2019, 59 (06) : 2177 - 2187
  • [5] A Partially Linear Tree-based Regression Model for Multivariate Outcomes
    Yu, Kai
    Wheeler, William
    Li, Qizhai
    Bergen, Andrew W.
    Caporaso, Neil
    Chatterjee, Nilanjan
    Chen, Jinbo
    [J]. BIOMETRICS, 2010, 66 (01) : 89 - 96
  • [6] Multivariate Gaussian and Student-t process regression for multi-output prediction
    Chen, Zexun
    Wang, Bo
    Gorban, Alexander N.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 3005 - 3028
  • [7] Multivariate Gaussian and Student-t process regression for multi-output prediction
    Zexun Chen
    Bo Wang
    Alexander N. Gorban
    [J]. Neural Computing and Applications, 2020, 32 : 3005 - 3028
  • [8] Online tree-based ensembles and option trees for regression on evolving data streams
    Ikonomovska, Elena
    Gama, Joao
    Dzeroski, Saso
    [J]. NEUROCOMPUTING, 2015, 150 : 458 - 470
  • [9] Transformer based Multi-output Regression Learning for Wastewater Treatment
    Huang, Zi-lan
    Zhang, Luo-yi
    Zhang, Yi
    Qian, Shu-wei
    Wang, Chong-jun
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 698 - 703
  • [10] Correction to: Multivariate Gaussian and Student-t process regression for multi-output prediction
    Zexun Chen
    Bo Wang
    Alexander N. Gorban
    [J]. Neural Computing and Applications, 2020, 32 : 11963 - 11963