A conditional model for incomplete covariates in parametric regression models

被引:100
|
作者
Lipsitz, SR [1 ]
Ibrahim, JG [1 ]
机构
[1] HARVARD UNIV,SCH PUBL HLTH,DEPT BIOSTAT,BOSTON,MA 02115
基金
美国国家卫生研究院;
关键词
EM-algorithm; missing at random; non-monotone missing data;
D O I
10.1093/biomet/83.4.916
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Incomplete covariate data arise in many data sets. When the missing covariates are categorical, a useful technique for obtaining parameter estimates is the EM algorithm by the method of weights proposed in Ibrahim (1990). This method requires the estimation of many nuisance parameters for the distribution of the covariates. Unfortunately, in data sets when the percentage of missing data is high, and the missing covariate patterns are highly non-monotone, the estimates of the nuisance parameters can lead to highly unstable estimates of the parameters of interest. We propose a conditional model for the covariate distribution that has several modelling advantages for the E-step and provides a reduction in the number of nuisance parameters, thus providing more stable estimates in finite samples. We present a clinical trials example with six covariates, five of which have some missing values.
引用
收藏
页码:916 / 922
页数:7
相关论文
共 50 条