Boosting multivariate structured additive distributional regression models

被引:5
|
作者
Stroemer, Annika [1 ]
Klein, Nadja [2 ,3 ]
Staerk, Christian [1 ]
Klinkhammer, Hannah [1 ,4 ]
Mayr, Andreas [1 ]
机构
[1] Univ Hosp Bonn, Dept Med Biometr Informat & Epidemiol, Bonn, Germany
[2] Tech Univ Dortmund, Chair Uncertainty Quantificat & Stat Learning, Res Ctr Trustworthy Data Sci & Secur UA Ruhr, Dortmund, Germany
[3] Tech Univ Dortmund, Dept Stat, Dortmund, Germany
[4] Univ Hosp Bonn, Inst Genom Stat & Bioinformat, Bonn, Germany
关键词
generalized additive models for location; scale and shape; model-based boosting; multivariate Gaussian distribution; multivariate logit model; multivariate Poisson distribution; semiparametric regression; VARIABLE SELECTION; POISSON REGRESSION; R PACKAGE; BIVARIATE; REGULARIZATION; ALGORITHMS;
D O I
10.1002/sim.9699
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We develop a model-based boosting approach for multivariate distributional regression within the framework of generalized additive models for location, scale, and shape. Our approach enables the simultaneous modeling of all distribution parameters of an arbitrary parametric distribution of a multivariate response conditional on explanatory variables, while being applicable to potentially high-dimensional data. Moreover, the boosting algorithm incorporates data-driven variable selection, taking various different types of effects into account. As a special merit of our approach, it allows for modeling the association between multiple continuous or discrete outcomes through the relevant covariates. After a detailed simulation study investigating estimation and prediction performance, we demonstrate the full flexibility of our approach in three diverse biomedical applications. The first is based on high-dimensional genomic cohort data from the UK Biobank, considering a bivariate binary response (chronic ischemic heart disease and high cholesterol). Here, we are able to identify genetic variants that are informative for the association between cholesterol and heart disease. The second application considers the demand for health care in Australia with the number of consultations and the number of prescribed medications as a bivariate count response. The third application analyses two dimensions of childhood undernutrition in Nigeria as a bivariate response and we find that the correlation between the two undernutrition scores is considerably different depending on the child's age and the region the child lives in.
引用
收藏
页码:1779 / 1801
页数:23
相关论文
共 50 条
  • [41] Boosting distributional copula regression for bivariate binary, discrete and mixed responses
    Sanchez, Guillermo Briseno
    Klein, Nadja
    Klinkhammer, Hannah
    Mayr, Andreas
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2025,
  • [42] Outliers in multivariate regression models
    Srivastava, MS
    von Rosen, D
    JOURNAL OF MULTIVARIATE ANALYSIS, 1998, 65 (02) : 195 - 208
  • [43] Predictions in multivariate regression models
    Stulajter, F
    TATRA MOUNTAINS MATHEMATICAL PUBLICATIONS, VOL 17, 1998, : 265 - 272
  • [44] Multivariate spatial regression models
    Dani, GA
    Moreira, ARB
    JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 91 (02) : 262 - 281
  • [45] Multivariate boundary regression models
    Selk, Leonie
    Tillier, Charles
    Marigliano, Orlando
    SCANDINAVIAN JOURNAL OF STATISTICS, 2022, 49 (01) : 400 - 426
  • [46] Additive logistic regression: A statistical view of boosting - Rejoinder
    Friedman, J
    Hastie, T
    Tibshirani, R
    ANNALS OF STATISTICS, 2000, 28 (02): : 400 - 407
  • [47] Online Boosting Algorithm for Regression with Additive and Multiplicative Updates
    Mirza, Ali H.
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [48] Additive logistic regression: A statistical view of boosting - Discussion
    Breiman, L
    ANNALS OF STATISTICS, 2000, 28 (02): : 374 - 377
  • [49] Structured Dimensionality Reduction for Additive Model Regression
    Fawzi, Alhussein
    Fiot, Jean-Baptiste
    Chen, Bei
    Sinn, Mathieu
    Frossard, Pascal
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (06) : 1589 - 1601
  • [50] Boosting Functional Regression Models with FDboost
    Brockhaus, Sarah
    Ruegamer, David
    Greven, Sonja
    JOURNAL OF STATISTICAL SOFTWARE, 2020, 94 (10): : 1 - 50