Robust gradient boosting for generalized additive models for location, scale and shape

被引:0
|
作者
Speller, Jan [1 ]
Staerk, Christian [1 ]
Gude, Francisco [2 ]
Mayr, Andreas [1 ]
机构
[1] Univ Bonn, Med Fac, Dept Med Biometry Informat & Epidemiol, Venusberg Campus 1, D-53127 Bonn, Germany
[2] USC Univ Hosp, Clin Epidemiol Unit, Santiago De Compostela 15706, Galicia, Spain
关键词
Distributional regression; Gradient boosting; High-dimensional; Log-logistic; Robust; Variable selection; VARIABLE SELECTION; R PACKAGE; REGRESSION;
D O I
10.1007/s11634-023-00555-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Due to the increasing complexity and dimensionality of data sources, it is favorable that methodological approaches yield robust results so that corrupted observations do not jeopardize overall conclusions. We propose a modelling approach which is robust towards outliers in the response variable for generalized additive models for location, scale and shape (GAMLSS). We extend a recently proposed robustification of the log-likelihood to gradient boosting for GAMLSS, which is based on trimming low log-likelihood values via a log-logistic function to a boundary depending on a robustness constant. We recommend a data-driven choice for the involved robustness constant based on a quantile of the unconditioned response variable and investigate the choice in a simulation study for low- and high-dimensional data situations. The versatile application possibilities of robust gradient boosting for GAMLSS are illustrated via three biomedical examples-including the modelling of thyroid hormone levels, spatial effects for functional magnetic resonance brain imaging and a high-dimensional application with gene expression levels for cancer cell lines.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Robust Estimation of Mean and Dispersion Functions in Extended Generalized Additive Models
    Croux, Christophe
    Gijbels, Irene
    Prosdocimi, Ilaria
    [J]. BIOMETRICS, 2012, 68 (01) : 31 - 44
  • [42] Predicting tree species presence and basal area in Utah: A comparison of stochastic gradient boosting, generalized additive models, and tree-based methods
    Moisen, Gretchen G.
    Freeman, Elizabeth A.
    Blackard, Jock A.
    Frescino, Tracey S.
    Zimmermann, Niklaus E.
    Edwards, Thomas C., Jr.
    [J]. ECOLOGICAL MODELLING, 2006, 199 (02) : 176 - 187
  • [43] Nonparametric additive location-scale models for interval censored data
    Lambert, Philippe
    [J]. STATISTICS AND COMPUTING, 2013, 23 (01) : 75 - 90
  • [44] Nonparametric additive location-scale models for interval censored data
    Philippe Lambert
    [J]. Statistics and Computing, 2013, 23 : 75 - 90
  • [45] A new large-scale learning algorithm for generalized additive models
    Bin Gu
    Chenkang Zhang
    Zhouyuan Huo
    Heng Huang
    [J]. Machine Learning, 2023, 112 : 3077 - 3104
  • [46] A new large-scale learning algorithm for generalized additive models
    Gu, Bin
    Zhang, Chenkang
    Huo, Zhouyuan
    Huang, Heng
    [J]. MACHINE LEARNING, 2023, 112 (09) : 3077 - 3104
  • [47] Gradient boosting for linear mixed models
    Griesbach, Colin
    Saefken, Benjamin
    Waldmann, Elisabeth
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2021, 17 (02): : 317 - 329
  • [48] Orthogonal Gradient Boosting for Simpler Additive Rule Ensembles
    Yang, Fan
    Le Bodic, Pierre
    Kamp, Michael
    Boley, Mario
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [49] Robust location and scale estimation based on the univariate generalized t (GT) distribution
    Arslan, O
    Genç, AI
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2003, 32 (08) : 1505 - 1525
  • [50] Signal regression models for location, scale and shape with an application to stock returns
    Brockhaus, Sarah
    Fuest, Andreas
    Mayr, Andreas
    Greven, Sonja
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2018, 67 (03) : 665 - 686