Covariance regression with random forests

被引:2
|
作者
Alakus, Cansu [1 ]
Larocque, Denis [1 ]
Labbe, Aurelie [1 ]
机构
[1] HEC Montreal, Dept Decis Sci, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Covariance regression; Multivariate response; Random forests; Variable importance; DIMENSION; MATRICES; TESTS; EQUALITY; TSH;
D O I
10.1186/s12859-023-05377-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Capturing the conditional covariances or correlations among the elements of a multivariate response vector based on covariates is important to various fields including neuroscience, epidemiology and biomedicine. We propose a new method called Covariance Regression with Random Forests (CovRegRF) to estimate the covariance matrix of a multivariate response given a set of covariates, using a random forest framework. Random forest trees are built with a splitting rule specially designed to maximize the difference between the sample covariance matrix estimates of the child nodes. We also propose a significance test for the partial effect of a subset of covariates. We evaluate the performance of the proposed method and significance test through a simulation study which shows that the proposed method provides accurate covariance matrix estimates and that the Type-1 error is well controlled. An application of the proposed method to thyroid disease data is also presented. CovRegRF is implemented in a freely available R package on CRAN.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Prediction of Torpedo Initial Velocity Based on Random Forests Regression
    Zhang, Ling
    Wang, Pukai
    Jiang, Tianyuan
    Fan, Gehua
    Dan, Caihong
    [J]. 2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL I, 2015, : 337 - 339
  • [32] Distributional Random Forests: Heterogeneity Adjustment and Multivariate Distributional Regression
    Cevid, Domagoj
    Michel, Loris
    Naf, Jeffrey
    Buhlmann, Peter
    Meinshausen, Nicolai
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [33] Robust Regression Random Forests by Small and Noisy Training Data
    Min, Lev, V
    Kovalev, Maxim S.
    Coolen, Frank P. A.
    [J]. PROCEEDINGS OF 2019 XXII INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2019, : 134 - 137
  • [34] Constructing covariance functions for random regression models for growth in Gelbvieh beef cattle
    Legarra, A
    Misztal, I
    Bertrand, JK
    [J]. JOURNAL OF ANIMAL SCIENCE, 2004, 82 (06) : 1564 - 1571
  • [35] A regression approach for estimating the parameters of the covariance function of a stationary spatial random process
    Hyun, Jung Won
    Burman, Prabir
    Paul, Debashis
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (08) : 2330 - 2344
  • [36] Penalized semiparametric Cox regression model on XGBoost and random survival forests
    Wang, Yating
    Su, Jinxia
    Zhao, Xuejing
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3095 - 3103
  • [37] Causal Random Forests Model Using Instrumental Variable Quantile Regression
    Chen, Jau-er
    Hsiang, Chen-Wei
    [J]. ECONOMETRICS, 2019, 7 (04)
  • [38] Comparing spatial regression to random forests for large environmental data sets
    Fox, Eric W.
    Ver Hoef, Jay M.
    Olsen, Anthony R.
    [J]. PLOS ONE, 2020, 15 (03):
  • [39] Accurate Object Detection with Joint Classification-Regression Random Forests
    Schulter, Samuel
    Leistner, Christian
    Wohlhart, Paul
    Roth, Peter M.
    Bischof, Horst
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 923 - 930
  • [40] Regression with missing data, a comparison study of techniques based on random forests
    Gomez-Mendez, Irving
    Joly, Emilien
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (12) : 1924 - 1949