Covariate Selection in High-Dimensional Generalized Linear Models With Measurement Error

被引:19
|
作者
Sorensen, Oystein [1 ]
Hellton, Kristoffer Herland [2 ]
Frigessi, Arnoldo [1 ,3 ]
Thoresen, Magne [1 ]
机构
[1] Univ Oslo, Oslo Ctr Biostat & Epidemiol, Dept Biostat, Oslo, Norway
[2] Univ Oslo, Dept Math, Oslo, Norway
[3] Oslo Univ Hosp, Oslo Ctr Biostat & Epidemiol, Res Support Serv, Oslo, Norway
关键词
Generalized linear model; High-dimensional inference; Matrix uncertainty selector; Measurement error; Sparse estimation; VARIABLE SELECTION; DANTZIG SELECTOR; LASSO; REGRESSION; SHRINKAGE; RECOVERY; NOISY;
D O I
10.1080/10618600.2018.1425626
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In many problems involving generalized linear models, the covariates are subject to measurement error. When the number of covariates p exceeds the sample size n, regularized methods like the lasso or Dantzig selector are required. Several recent papers have studied methods which correct for measurement error in the lasso or Dantzig selector for linear models in the p > n setting. We study a correction for generalized linear models, based on Rosenbaum and Tsybakov's matrix uncertainty selector. By not requiring an estimate of the measurement error covariance matrix, this generalized matrix uncertainty selector has a great practical advantage in problems involving high-dimensional data. We further derive an alternative method based on the lasso, and develop efficient algorithms for both methods. In our simulation studies of logistic and Poisson regression with measurement error, the proposed methods outperform the standard lasso and Dantzig selector with respect to covariate selection, by reducing the number of false positives considerably. We also consider classification of patients on the basis of gene expression data with noisy measurements. Supplementary materials for this article are available online.
引用
收藏
页码:739 / 749
页数:11
相关论文
共 50 条
  • [21] Variable selection for proportional hazards models with high-dimensional covariates subject to measurement error
    Chen, Baojiang
    Yuan, Ao
    Yi, Grace Y.
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (02): : 397 - 420
  • [22] Partial profile score feature selection in high-dimensional generalized linear interaction models
    Xu, Zengchao
    Luo, Shan
    Chen, Zehua
    [J]. STATISTICS AND ITS INTERFACE, 2022, 15 (04) : 433 - 447
  • [23] Cluster feature selection in high-dimensional linear models
    Lin, Bingqing
    Pang, Zhen
    Wang, Qihua
    [J]. RANDOM MATRICES-THEORY AND APPLICATIONS, 2018, 7 (01)
  • [24] Variable selection for high-dimensional generalized linear models with the weighted elastic-net procedure
    Wang, Xiuli
    Wang, Mingqiu
    [J]. JOURNAL OF APPLIED STATISTICS, 2016, 43 (05) : 796 - 809
  • [25] Testing generalized linear models with high-dimensional nuisance parameters
    Chen, Jinsong
    Li, Quefeng
    Chen, Hua Yun
    [J]. BIOMETRIKA, 2023, 110 (01) : 83 - 99
  • [26] Adaptive group Lasso for high-dimensional generalized linear models
    Wang, Mingqiu
    Tian, Guo-Liang
    [J]. STATISTICAL PAPERS, 2019, 60 (05) : 1469 - 1486
  • [27] GENERALIZED ADDITIVE PARTIAL LINEAR MODELS WITH HIGH-DIMENSIONAL COVARIATES
    Lian, Heng
    Liang, Hua
    [J]. ECONOMETRIC THEORY, 2013, 29 (06) : 1136 - 1161
  • [28] Generalized autoregressive linear models for discrete high-dimensional data
    Pandit, Parthe
    Sahraee-Ardakan, Mojtaba
    Amini, Arash A.
    Rangan, Sundeep
    Fletcher, Alyson K.
    [J]. IEEE Journal on Selected Areas in Information Theory, 2020, 1 (03): : 884 - 896
  • [29] AN ADAPTIVE TEST ON HIGH-DIMENSIONAL PARAMETERS IN GENERALIZED LINEAR MODELS
    Wu, Chong
    Xu, Gongjun
    Pan, Wei
    [J]. STATISTICA SINICA, 2019, 29 (04) : 2163 - 2186
  • [30] Transfer Learning Under High-Dimensional Generalized Linear Models
    Tian, Ye
    Feng, Yang
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2684 - 2697