Semiparametric estimation of logistic regression model with missing covariates and outcome

被引:8
|
作者
Lee, Shen-Ming [2 ]
Li, Chin-Shang [1 ]
Hsieh, Shu-Hui [2 ]
Huang, Li-Hui [3 ]
机构
[1] Univ Calif Davis, Div Biostat, Dept Publ Hlth Sci, Davis, CA 95616 USA
[2] Feng Chia Univ, Dept Stat, Taichung 40724, Taiwan
[3] Overseas Chinese Univ, Dept Int Trade, Taichung, Taiwan
基金
美国国家卫生研究院;
关键词
Missing value; Logistic regression model; Missing outcome; Missing covariates; MEAN SCORE METHOD; VALIDATION SAMPLE; MEASUREMENT ERROR; MISCLASSIFICATION; INFERENCE;
D O I
10.1007/s00184-011-0345-9
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a semiparametric method to estimate logistic regression models with missing both covariates and an outcome variable, and propose two new estimators. The first, which is based solely on the validation set, is an extension of the validation likelihood estimator of Breslow and Cain (Biometrika 75:11-20, 1988). The second is a joint conditional likelihood estimator based on the validation and non-validation data sets. Both estimators are semiparametric as they do not require any model assumptions regarding the missing data mechanism nor the specification of the conditional distribution of the missing covariates given the observed covariates. The asymptotic distribution theory is developed under the assumption that all covariate variables are categorical. The finite-sample properties of the proposed estimators are investigated through simulation studies showing that the joint conditional likelihood estimator is the most efficient. A cable TV survey data set from Taiwan is used to illustrate the practical use of the proposed methodology.
引用
收藏
页码:621 / 653
页数:33
相关论文
共 50 条