Semiparametric estimation for measurement error models with validation data

被引:18
|
作者
Xu, Yuhang [1 ]
Kim, Jae Kwang [2 ,3 ]
Li, Yehua [2 ]
机构
[1] Univ Nebraska, Dept Stat, Lincoln, NE 68583 USA
[2] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
[3] Korea Adv Inst Sci & Technol, Dept Math Sci, Daejeon 34141, South Korea
基金
美国国家科学基金会;
关键词
Errors-in-variables; fractional imputation; kernel smoothing; primary data; semiparametric method; validation data; IN-COVARIABLES MODELS; LOGISTIC-REGRESSION; VARIABLE SELECTION; SCORE METHOD; INFERENCE;
D O I
10.1002/cjs.11314
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider regression problems where error-prone surrogates of true predictors are collected in a primary data set while accurate measurements of the predictors are available only in a relatively small validation data set. We propose a new class of semiparametric estimators for the regression coefficients based on expected estimating equations, where the relationship between the surrogates and the true predictors is modelled nonparametrically using a kernel smoother trained with the validation data. The new methods are developed under two different scenarios where the response variable is either observed or not observed in the validation data set. The proposed estimators have a natural connection with the fractional imputation method. They are consistent, asymptotically unbiased, and normal in both scenarios. Our simulation studies show that the proposed estimators are superior to competitors in terms of bias and mean squared error and are quite robust against the misspecification of the regression model and bandwidth selection. A real data application to the Korean Longitudinal Study of Aging is presented for illustration. The Canadian Journal of Statistics 45: 185-201; 2017 (c) 2017 Statistical Society of Canada
引用
收藏
页码:185 / 201
页数:17
相关论文
共 50 条