Covariate Selection in High-Dimensional Generalized Linear Models With Measurement Error

被引:19
|
作者
Sorensen, Oystein [1 ]
Hellton, Kristoffer Herland [2 ]
Frigessi, Arnoldo [1 ,3 ]
Thoresen, Magne [1 ]
机构
[1] Univ Oslo, Oslo Ctr Biostat & Epidemiol, Dept Biostat, Oslo, Norway
[2] Univ Oslo, Dept Math, Oslo, Norway
[3] Oslo Univ Hosp, Oslo Ctr Biostat & Epidemiol, Res Support Serv, Oslo, Norway
关键词
Generalized linear model; High-dimensional inference; Matrix uncertainty selector; Measurement error; Sparse estimation; VARIABLE SELECTION; DANTZIG SELECTOR; LASSO; REGRESSION; SHRINKAGE; RECOVERY; NOISY;
D O I
10.1080/10618600.2018.1425626
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In many problems involving generalized linear models, the covariates are subject to measurement error. When the number of covariates p exceeds the sample size n, regularized methods like the lasso or Dantzig selector are required. Several recent papers have studied methods which correct for measurement error in the lasso or Dantzig selector for linear models in the p > n setting. We study a correction for generalized linear models, based on Rosenbaum and Tsybakov's matrix uncertainty selector. By not requiring an estimate of the measurement error covariance matrix, this generalized matrix uncertainty selector has a great practical advantage in problems involving high-dimensional data. We further derive an alternative method based on the lasso, and develop efficient algorithms for both methods. In our simulation studies of logistic and Poisson regression with measurement error, the proposed methods outperform the standard lasso and Dantzig selector with respect to covariate selection, by reducing the number of false positives considerably. We also consider classification of patients on the basis of gene expression data with noisy measurements. Supplementary materials for this article are available online.
引用
收藏
页码:739 / 749
页数:11
相关论文
共 50 条
  • [31] Penalized empirical likelihood for high-dimensional generalized linear models
    Chen, Xia
    Mao, Liyue
    [J]. STATISTICS AND ITS INTERFACE, 2021, 14 (02) : 83 - 94
  • [32] High-Dimensional Inference for Generalized Linear Models with Hidden Confounding
    Ouyang, Jing
    Tan, Kean Ming
    Xu, Gongjun
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [33] Overlapping group lasso for high-dimensional generalized linear models
    Zhou, Shengbin
    Zhou, Jingke
    Zhang, Bo
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2019, 48 (19) : 4903 - 4917
  • [34] Adaptive group Lasso for high-dimensional generalized linear models
    Mingqiu Wang
    Guo-Liang Tian
    [J]. Statistical Papers, 2019, 60 : 1469 - 1486
  • [35] Bayesian Subset Modeling for High-Dimensional Generalized Linear Models
    Liang, Faming
    Song, Qifan
    Yu, Kai
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (502) : 589 - 606
  • [36] Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm
    Marion Naveau
    Guillaume Kon Kam King
    Renaud Rincent
    Laure Sansonnet
    Maud Delattre
    [J]. Statistics and Computing, 2024, 34
  • [37] Inference in high dimensional linear measurement error models
    Li, Mengyan
    Li, Runze
    Ma, Yanyuan
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 184
  • [38] Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm
    Naveau, Marion
    King, Guillaume Kon Kam
    Rincent, Renaud
    Sansonnet, Laure
    Delattre, Maud
    [J]. STATISTICS AND COMPUTING, 2024, 34 (01)
  • [39] Variable selection in high-dimensional partly linear additive models
    Lian, Heng
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (04) : 825 - 839
  • [40] Group selection in high-dimensional partially linear additive models
    Wei, Fengrong
    [J]. BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2012, 26 (03) : 219 - 243