Variable selection in multivariate multiple regression

被引:11
|
作者
Variyath, Asokan Mulayath [1 ]
Brobbey, Anita [1 ]
机构
[1] Mem Univ Newfoundland, Dept Math & Stat, St John, NF, Canada
来源
PLOS ONE | 2020年 / 15卷 / 07期
关键词
MIXED DISCRETE; MODELING SLUMP; CONCRETE; RESPONSES; SEVERITY; MIXTURE;
D O I
10.1371/journal.pone.0236067
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Introduction In many practical situations, we are interested in the effect of covariates on correlated multiple responses. In this paper, we focus on estimation and variable selection in multi-response multiple regression models. Correlation among the response variables must be modeled for valid inference. Method We used an extension of the generalized estimating equation (GEE) methodology to simultaneously analyze binary, count, and continuous outcomes with nonlinear functions. Variable selection plays an important role in modeling correlated responses because of the large number of model parameters that must be estimated. We propose a penalized-likelihood approach based on the extended GEEs for simultaneous parameter estimation and variable selection. Results and conclusions We conducted a series of Monte Carlo simulations to investigate the performance of our method, considering different sample sizes and numbers of response variables. The results showed that our method works well compared to treating the responses as uncorrelated. We recommend using an unstructured correlation model with the Bayesian information criterion (BIC) to select the tuning parameters. We demonstrated our method using data from a concrete slump test.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] VARIABLE SELECTION IN MULTIVARIATE MULTIPLE-REGRESSION
    SMITH, DW
    GILL, DS
    HAMMOND, JJ
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1985, 22 (3-4) : 217 - 227
  • [2] ON VARIABLE SELECTION IN MULTIVARIATE REGRESSION
    SPARKS, RS
    ZUCCHINI, W
    COUTSOURIDES, D
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1985, 14 (07) : 1569 - 1587
  • [3] Variable selection for multivariate logistic regression models
    Chen, MH
    Dey, DK
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2003, 111 (1-2) : 37 - 55
  • [4] Variable Selection and Redundancy in Multivariate Regression Models
    Westad, Frank
    Marini, Federico
    [J]. FRONTIERS IN ANALYTICAL SCIENCE, 2022, 2
  • [5] Splitting variable selection for multivariate regression trees
    Hsiao, Wei-Cheng
    Shih, Yu-Shan
    [J]. STATISTICS & PROBABILITY LETTERS, 2007, 77 (03) : 265 - 271
  • [6] Variable Selection in Multivariate Functional Linear Regression
    Yeh, Chi-Kuang
    Sang, Peijun
    [J]. STATISTICS IN BIOSCIENCES, 2023,
  • [7] A COMPUTATIONAL FRAMEWORK FOR VARIABLE SELECTION IN MULTIVARIATE REGRESSION
    BARRETT, BE
    GRAY, JB
    [J]. STATISTICS AND COMPUTING, 1994, 4 (03) : 203 - 212
  • [8] Multivariate Regression: The Pitfalls of Automated Variable Selection
    Sainani, Kristin L.
    [J]. PM&R, 2013, 5 (09) : 791 - 794
  • [9] RESPONSE VARIABLE SELECTION IN MULTIVARIATE LINEAR REGRESSION
    Khare, Kshitij
    Su, Zhihua
    [J]. STATISTICA SINICA, 2024, 34 (03) : 1325 - 1345
  • [10] A MATLAB toolbox for multivariate regression coupled with variable selection
    Consonni, Viviana
    Baccolo, Giacomo
    Gosetti, Fabio
    Todeschini, Roberto
    Ballabio, Davide
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213