LASSO type penalized spline regression for binary data

被引:25
|
作者
Mullah, Muhammad Abu Shadeque [1 ]
Hanley, James A. [1 ]
Benedetti, Andrea [1 ,2 ,3 ]
机构
[1] McGill Univ, Dept Epidemiol Biostat & Occupat Hlth, Montreal, PQ, Canada
[2] McGill Univ, Dept Med, Montreal, PQ, Canada
[3] McGill Univ, Montreal Chest Inst, Resp Epidemiol & Clin Res Unit, Hlth Ctr, Montreal, PQ, Canada
关键词
Penalized splines; Generalized linear mixed models; Ridge regression; Least absolute shrinkage and selection operator (LASSO); Markov chain Monte Carlo; SEMIPARAMETRIC MIXED MODELS; BAYESIAN-INFERENCE;
D O I
10.1186/s12874-021-01234-9
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Generalized linear mixed models (GLMMs), typically used for analyzing correlated data, can also be used for smoothing by considering the knot coefficients from a regression spline as random effects. The resulting models are called semiparametric mixed models (SPMMs). Allowing the random knot coefficients to follow a normal distribution with mean zero and a constant variance is equivalent to using a penalized spline with a ridge regression type penalty. We introduce the least absolute shrinkage and selection operator (LASSO) type penalty in the SPMM setting by considering the coefficients at the knots to follow a Laplace double exponential distribution with mean zero. Methods We adopt a Bayesian approach and use the Markov Chain Monte Carlo (MCMC) algorithm for model fitting. Through simulations, we compare the performance of curve fitting in a SPMM using a LASSO type penalty to that of using ridge penalty for binary data. We apply the proposed method to obtain smooth curves from data on the relationship between the amount of pack years of smoking and the risk of developing chronic obstructive pulmonary disease (COPD). Results The LASSO penalty performs as well as ridge penalty for simple shapes of association and outperforms the ridge penalty when the shape of association is complex or linear. Conclusion We demonstrated that LASSO penalty captured complex dose-response association better than the Ridge penalty in a SPMM.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] LASSO type penalized spline regression for binary data
    Muhammad Abu Shadeque Mullah
    James A. Hanley
    Andrea Benedetti
    [J]. BMC Medical Research Methodology, 21
  • [2] Data-driven selection of the spline dimension in penalized spline regression
    Kauermann, Goeran
    Opsomer, Jean D.
    [J]. BIOMETRIKA, 2011, 98 (01) : 225 - 230
  • [3] Bootstrapping for Penalized Spline Regression
    Kauermann, Goeran
    Claeskens, Gerda
    Opsomer, J. D.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (01) : 126 - 146
  • [4] Hazard regression for interval-censored data with penalized spline
    Cai, TX
    Betensky, RA
    [J]. BIOMETRICS, 2003, 59 (03) : 570 - 579
  • [5] Influence on Smoothness in Penalized Likelihood Regression for Binary Data
    Robert Jernigan
    Julie O’Connell
    [J]. Computational Statistics, 2001, 16 : 481 - 504
  • [6] On knot placement for penalized spline regression
    Yao, Fang
    Lee, Thomas C. M.
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2008, 37 (03) : 259 - 267
  • [7] On knot placement for penalized spline regression
    Fang Yao
    Thomas C. M. Lee
    [J]. Journal of the Korean Statistical Society, 2008, 37 : 259 - 267
  • [8] Influence on smoothness in penalized likelihood regression for binary data
    Jernigan, R
    O'Connell, J
    [J]. COMPUTATIONAL STATISTICS, 2001, 16 (04) : 481 - 504
  • [9] COORDINATE DESCENT ALGORITHMS FOR LASSO PENALIZED REGRESSION
    Wu, Tong Tong
    Lange, Kenneth
    [J]. ANNALS OF APPLIED STATISTICS, 2008, 2 (01): : 224 - 244
  • [10] Trimmed LASSO regression estimator for binary response data
    Sun, Hongwei
    Cui, Yuehua
    Gao, Qian
    Wang, Tong
    [J]. STATISTICS & PROBABILITY LETTERS, 2020, 159