Debiased lasso for generalized linear models with a diverging number of covariates

被引:6
|
作者
Xia, Lu [1 ]
Nan, Bin [2 ]
Li, Yi [3 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Univ Calif Irvine, Dept Stat, Irvine, CA 92717 USA
[3] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院;
关键词
asymptotics; bias correction; high-dimensional regression; lung cancer; statistical inference; NONCONCAVE PENALIZED LIKELIHOOD; GENOME-WIDE ASSOCIATION; P-REGRESSION PARAMETERS; VARIABLE SELECTION; LUNG-CANCER; CONFIDENCE-INTERVALS; ASYMPTOTIC-BEHAVIOR; M-ESTIMATORS; SUSCEPTIBILITY; REGULARIZATION;
D O I
10.1111/biom.13587
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Modeling and drawing inference on the joint associations between single-nucleotide polymorphisms and a disease has sparked interest in genome-wide associations studies. In the motivating Boston Lung Cancer Survival Cohort (BLCSC) data, the presence of a large number of single nucleotide polymorphisms of interest, though smaller than the sample size, challenges inference on their joint associations with the disease outcome. In similar settings, we find that neither the debiased lasso approach (van de Geer et al., 2014), which assumes sparsity on the inverse information matrix, nor the standard maximum likelihood method can yield confidence intervals with satisfactory coverage probabilities for generalized linear models. Under this "large n, diverging p" scenario, we propose an alternative debiased lasso approach by directly inverting the Hessian matrix without imposing the matrix sparsity assumption, which further reduces bias compared to the original debiased lasso and ensures valid confidence intervals with nominal coverage probabilities. We establish the asymptotic distributions of any linear combinations of the parameter estimates, which lays the theoretical ground for drawing inference. Simulations show that the proposed refined debiased estimating method performs well in removing bias and yields honest confidence interval coverage. We use the proposed method to analyze the aforementioned BLCSC data, a large-scale hospital-based epidemiology cohort study investigating the joint effects of genetic variants on lung cancer risks.
引用
收藏
页码:344 / 357
页数:14
相关论文
共 50 条
  • [1] Adaptive Lasso for generalized linear models with a diverging number of parameters
    Cui, Yan
    Chen, Xia
    Yan, Li
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (23) : 11826 - 11842
  • [2] Communication-efficient distributed estimator for generalized linear models with a diverging number of covariates
    Zhou, Ping
    Yu, Zhen
    Ma, Jingyi
    Tian, Maozai
    Fan, Ye
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 157
  • [3] GENERALIZED ADDITIVE PARTIAL LINEAR MODELS FOR CLUSTERED DATA WITH DIVERGING NUMBER OF COVARIATES USING GEE
    Lian, Heng
    Liang, Hua
    Wang, Lan
    [J]. STATISTICA SINICA, 2014, 24 (01) : 173 - 196
  • [4] ESTIMATION AND MODEL SELECTION IN GENERALIZED ADDITIVE PARTIAL LINEAR MODELS FOR CORRELATED DATA WITH DIVERGING NUMBER OF COVARIATES
    Wang, Li
    Xue, Lan
    Qu, Annie
    Liang, Hua
    [J]. ANNALS OF STATISTICS, 2014, 42 (02): : 592 - 624
  • [5] Asymptotic Properties of Maximum Quasi-Likelihood Estimators in Generalized Linear Models with Diverging Number of Covariates
    Qibing Gao
    Xiuli Du
    Xiuqing Zhou
    Fengchang Xie
    [J]. Journal of Systems Science and Complexity, 2018, 31 : 1362 - 1376
  • [6] Asymptotic Properties of Maximum Quasi-Likelihood Estimators in Generalized Linear Models with Diverging Number of Covariates
    GAO Qibing
    DU Xiuli
    ZHOU Xiuqing
    XIE Fengchang
    [J]. Journal of Systems Science & Complexity, 2018, 31 (05) : 1362 - 1376
  • [7] Asymptotic Properties of Maximum Quasi-Likelihood Estimators in Generalized Linear Models with Diverging Number of Covariates
    Gao Qibing
    Du Xiuli
    Zhou Xiuqing
    Xie Fengchang
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2018, 31 (05) : 1362 - 1376
  • [8] Bridge estimation for generalized linear models with a diverging number of parameters
    Wang, Mingqiu
    Song, Lixin
    Wang, Xiaoguang
    [J]. STATISTICS & PROBABILITY LETTERS, 2010, 80 (21-22) : 1584 - 1596
  • [9] Debiased lasso after sample splitting for estimation and inference in high-dimensional generalized linear models
    Vazquez, Omar
    Nan, Bin
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2024,
  • [10] Robust rank-based variable selection in double generalized linear models with diverging number of parameters under adaptive Lasso
    Nguelifack, Brice M.
    Kemajou-Brown, Isabelle
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (11) : 2051 - 2072