Regression Trees and Ensemble for Multivariate Outcomes

被引:1
|
作者
Reynolds, Evan L. [1 ]
Callaghan, Brian C. [1 ]
Gaies, Michael [2 ]
Banerjee, Mousumi [3 ]
机构
[1] Univ Michigan, Dept Neurol, Ann Arbor, MI 48109 USA
[2] Univ Cincinnati, Dept Pediat, Ann Arbor, MI USA
[3] Univ Michigan, Dept Biostat, Ann Arbor, MI USA
基金
美国国家卫生研究院;
关键词
Multivariate outcomes; regression trees; Mahalanobis distance; clinical interpretability; machine learning; NEUROPATHY; PREVALENCE; OBESITY;
D O I
10.1007/s13571-023-00301-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Tree-based methods have become one of the most flexible, intuitive, and powerful analytic tools for exploring complex data structures. The best documented, and arguably most popular uses of tree-based methods are in biomedical research, where multivariate outcomes occur commonly (e.g. diastolic and systolic blood pressure and nerve conduction measures in studies of neuropathy). Existing tree-based methods for multivariate outcomes do not appropriately take into account the correlation that exists in such data. In this paper, we develop goodness-of-split measures for building multivariate regression trees for continuous multivariate outcomes. We propose two general approaches: minimizing within-node homogeneity and maximizing between-node separation. Within-node homogeneity is measured using the average Mahalanobis distance and the determinant of the variance-covariance matrix. Between-node separation is measured using the Mahalanobis distance, Euclidean distance and standardized Euclidean distance. To enhance prediction accuracy we extend the single multivariate regression tree to an ensemble of multivariate trees. Extensive simulations are presented to examine the properties of our goodness-of-split measures. Finally, the proposed methods are illustrated using two clinical datasets of neuropathy and pediatric cardiac surgery.
引用
收藏
页码:77 / 109
页数:33
相关论文
共 50 条
  • [1] Regression Trees and Ensemble for Multivariate Outcomes
    Evan L. Reynolds
    Brian C. Callaghan
    Michael Gaies
    Mousumi Banerjee
    Sankhya B, 2023, 85 : 77 - 109
  • [2] Multivariate trees for mixed outcomes
    Dine, Abdessamad
    Larocque, Denis
    Bellavance, Francois
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (11) : 3795 - 3804
  • [3] Splitting variable selection for multivariate regression trees
    Hsiao, Wei-Cheng
    Shih, Yu-Shan
    STATISTICS & PROBABILITY LETTERS, 2007, 77 (03) : 265 - 271
  • [4] Multivariate regression trees for analysis of abundance data
    Larsen, DR
    Speckman, PL
    BIOMETRICS, 2004, 60 (02) : 543 - 549
  • [5] An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines
    Choubin, Bahram
    Moradi, Ehsan
    Golshan, Mohammad
    Adamowski, Jan
    Sajedi-Hosseini, Farzaneh
    Mosavi, Amir
    SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 651 : 2087 - 2096
  • [6] Multivariate wind power curve modeling using multivariate adaptive regression splines and regression trees
    Mushtaq, Khurram
    Zou, Runmin
    Waris, Asim
    Yang, Kaifeng
    Wang, Ji
    Iqbal, Javaid
    Jameel, Mohammed
    PLOS ONE, 2023, 18 (08):
  • [7] One Millisecond Face Alignment with an Ensemble of Regression Trees
    Kazemi, Vahid
    Sullivan, Josephine
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1867 - 1874
  • [8] Bayesian additive regression trees for multivariate skewed responses
    Um, Seungha
    Linero, Antonio R.
    Sinha, Debajyoti
    Bandyopadhyay, Dipankar
    STATISTICS IN MEDICINE, 2023, 42 (03) : 246 - 263
  • [9] Spatial Multivariate Trees for Big Data Bayesian Regression
    Peruzzi, Michele
    Dunson, David B.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [10] TAIL FORECASTING WITH MULTIVARIATE BAYESIAN ADDITIVE REGRESSION TREES
    Clark, Todd E.
    Huber, Florian
    Koop, Gary
    Marcellino, Massimiliano
    Pfarrhofer, Michael
    INTERNATIONAL ECONOMIC REVIEW, 2023, 64 (03) : 979 - 1022