Optimal pseudolikelihood estimation in the analysis of multivariate missing data with nonignorable nonresponse

被引:9
|
作者
Zhao, Jiwei [1 ]
Ma, Yanyuan [2 ]
机构
[1] SUNY Buffalo, Dept Biostat, 719 Kimball Tower, Buffalo, NY 14214 USA
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Efficiency; Missing data; Nonignorable nonresponse; Pseudolikelihood estimator; MAXIMUM-LIKELIHOOD-ESTIMATION; EXPOSURE; MODELS; HEALTH;
D O I
10.1093/biomet/asy007
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Tang et al. (2003) considered a regression model with missing response, where the missingness mechanism depends on the value of the response variable and hence is nonignorable. They proposed three pseudolikelihood estimators, based on different treatments of the probability distribution of the completely observed covariates. The first assumes the distribution of the covariate to be known, the second estimates this distribution parametrically, and the third estimates the distribution nonparametrically. While it is not hard to show that the second estimator is more efficient than the first, Tang et al. (2003) only conjectured that the third estimator is more efficient than the first two. In this paper, we investigate the asymptotic behaviour of the third estimator by deriving a closed-form representation of its asymptotic variance. We then prove that the third estimator is more efficient than the other two. Our result can be straightforwardly applied to missingness mechanisms that are more general than that in Tang et al. (2003).
引用
收藏
页码:479 / 486
页数:8
相关论文
共 50 条