Linear Model Selection When Covariates Contain Errors

被引:14
|
作者
Zhang, Xinyu [1 ]
Wang, Haiying [2 ]
Ma, Yanyuan [3 ]
Carroll, Raymond J. [4 ,5 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing, Peoples R China
[2] Univ New Hampshire, Dept Math & Stat, Durham, NH 03824 USA
[3] Penn State Univ, Dept Stat, State Coll, PA USA
[4] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
[5] Univ Technol Sydney, Sch Math & Phys Sci, Broadway, NSW, Australia
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Errors in covariates; Loss efficiency; Measurement error; Model selection; Selection consistency; VARIABLE SELECTION; CROSS-VALIDATION; PREDICTION;
D O I
10.1080/01621459.2016.1219262
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Prediction precision is arguably the most relevant criterion of amodel in practice and is often a sought after property. A common difficulty with covariates measured with errors is the impossibility of performing prediction evaluation on the data even if a model is completely given without any unknown parameters. We bypass this inherent difficulty by using special properties on moment relations in linear regression models with measurement errors. The end product is a model selection procedure that achieves the same optimality properties that are achieved in classical linear regression models without covariate measurement error. Asymptotically, the procedure selects the model with the minimum prediction error in general, and selects the smallest correct model if the regression relation is indeed linear. Our model selection procedure is useful in prediction when future covariates without measurement error become available, for example, due to improved technology or better management and design of data collection procedures. Supplementary materials for this article are available online.
引用
收藏
页码:1553 / 1561
页数:9
相关论文
共 50 条
  • [1] Bayesian model selection for a linear model with grouped covariates
    Xiaoyi Min
    Dongchu Sun
    [J]. Annals of the Institute of Statistical Mathematics, 2016, 68 : 877 - 903
  • [2] Bayesian model selection for a linear model with grouped covariates
    Min, Xiaoyi
    Sun, Dongchu
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2016, 68 (04) : 877 - 903
  • [3] Sequential estimation in generalized linear models when covariates are subject to errors
    Yuan-chin Ivan Chang
    [J]. Metrika, 2011, 73 : 93 - 120
  • [4] Sequential estimation in generalized linear models when covariates are subject to errors
    Chang, Yuan-chin Ivan
    [J]. METRIKA, 2011, 73 (01) : 93 - 120
  • [5] Generalized linear mixed model for binary outcomes when covariates are subject to measurement errors and detection limits
    Xie, Xianhong
    Xue, Xiaonan
    Strickler, Howard D.
    [J]. STATISTICS IN MEDICINE, 2018, 37 (01) : 119 - 136
  • [6] Variable Selection for Generalized Linear Model with Highly Correlated Covariates
    Yue, Li Li
    Wang, Wei Tao
    Li, Gao Rong
    [J]. ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2024, 40 (06) : 1458 - 1480
  • [7] Model selection of generalized partially linear models with missing covariates
    Fu, Ying-Zi
    Chen, Xue-Dong
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (01) : 126 - 138
  • [8] Variable Selection for Generalized Linear Model with Highly Correlated Covariates
    Li Li YUE
    Wei Tao WANG
    Gao Rong LI
    [J]. Acta Mathematica Sinica,English Series, 2024, (06) : 1458 - 1480
  • [9] Goodness-of-fit test for partial functional linear model with errors in scalar covariates
    Zhang, Tong
    Sun, Zhihua
    Sun, Liuquan
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2023, 227 : 91 - 111
  • [10] Model selection strategies for identifying most relevant covariates in homoscedastic linear models
    Min, Aleksey
    Holzmann, Hajo
    Czado, Claudia
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3194 - 3211