Boosting multivariate structured additive distributional regression models

被引:5
|
作者
Stroemer, Annika [1 ]
Klein, Nadja [2 ,3 ]
Staerk, Christian [1 ]
Klinkhammer, Hannah [1 ,4 ]
Mayr, Andreas [1 ]
机构
[1] Univ Hosp Bonn, Dept Med Biometr Informat & Epidemiol, Bonn, Germany
[2] Tech Univ Dortmund, Chair Uncertainty Quantificat & Stat Learning, Res Ctr Trustworthy Data Sci & Secur UA Ruhr, Dortmund, Germany
[3] Tech Univ Dortmund, Dept Stat, Dortmund, Germany
[4] Univ Hosp Bonn, Inst Genom Stat & Bioinformat, Bonn, Germany
关键词
generalized additive models for location; scale and shape; model-based boosting; multivariate Gaussian distribution; multivariate logit model; multivariate Poisson distribution; semiparametric regression; VARIABLE SELECTION; POISSON REGRESSION; R PACKAGE; BIVARIATE; REGULARIZATION; ALGORITHMS;
D O I
10.1002/sim.9699
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We develop a model-based boosting approach for multivariate distributional regression within the framework of generalized additive models for location, scale, and shape. Our approach enables the simultaneous modeling of all distribution parameters of an arbitrary parametric distribution of a multivariate response conditional on explanatory variables, while being applicable to potentially high-dimensional data. Moreover, the boosting algorithm incorporates data-driven variable selection, taking various different types of effects into account. As a special merit of our approach, it allows for modeling the association between multiple continuous or discrete outcomes through the relevant covariates. After a detailed simulation study investigating estimation and prediction performance, we demonstrate the full flexibility of our approach in three diverse biomedical applications. The first is based on high-dimensional genomic cohort data from the UK Biobank, considering a bivariate binary response (chronic ischemic heart disease and high cholesterol). Here, we are able to identify genetic variants that are informative for the association between cholesterol and heart disease. The second application considers the demand for health care in Australia with the number of consultations and the number of prescribed medications as a bivariate count response. The third application analyses two dimensions of childhood undernutrition in Nigeria as a bivariate response and we find that the correlation between the two undernutrition scores is considerably different depending on the child's age and the region the child lives in.
引用
收藏
页码:1779 / 1801
页数:23
相关论文
共 50 条
  • [1] Truly Multivariate Structured Additive Distributional Regression
    Kock, Lucas
    Klein, Nadja
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2025,
  • [2] Bayesian structured additive distributional regression for multivariate responses
    Klein, Nadja
    Kneib, Thomas
    Klasen, Stephan
    Lang, Stefan
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2015, 64 (04) : 569 - 591
  • [3] Bayesian Effect Selection in Structured Additive Distributional Regression Models
    Klein, Nadja
    Carlan, Manuel
    Kneib, Thomas
    Lang, Stefan
    Wagner, Helga
    BAYESIAN ANALYSIS, 2021, 16 (02): : 545 - 573
  • [4] Scalable Estimation for Structured Additive Distributional Regression
    Umlauf, Nikolaus
    Seiler, Johannes
    Wetscher, Mattias
    Simon, Thorsten
    Lang, Stefan
    Klein, Nadja
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024,
  • [5] Rejoinder on: Modular regression - a Lego system for building structured additive distributional regression models with tensor product interactions
    Thomas Kneib
    Nadja Klein
    Stefan Lang
    Nikolaus Umlauf
    TEST, 2019, 28 : 55 - 59
  • [6] Comments on: Modular regression—a Lego system for building structured additive distributional regression models with tensor product interactions
    M. D. Stasinopoulos
    R. A. Rigby
    G. Z. Heller
    F. De Bastiani
    TEST, 2019, 28 : 52 - 54
  • [7] Rejoinder on: Modular regression - a Lego system for building structured additive distributional regression models with tensor product interactions
    Kneib, Thomas
    Klein, Nadja
    Lang, Stefan
    Umlauf, Nikolaus
    TEST, 2019, 28 (01) : 55 - 59
  • [8] Boosting distributional copula regression
    Hans, Nicolai
    Klein, Nadja
    Faschingbauer, Florian
    Schneider, Michael
    Mayr, Andreas
    BIOMETRICS, 2023, 79 (03) : 2298 - 2310
  • [9] Comments on: Modular regressiona Lego system for building structured additive distributional regression models with tensor product interactions
    Reimherr, Matthew
    TEST, 2019, 28 (01) : 43 - 45
  • [10] Scale-Dependent Priors for Variance Parameters in Structured Additive Distributional Regression
    Klein, Nadja
    Kneib, Thomas
    BAYESIAN ANALYSIS, 2016, 11 (04): : 1071 - 1106