Jackknife model averaging for linear regression models with missing responses

被引:0
|
作者
Zeng, Jie [1 ]
Cheng, Weihu [2 ]
Hu, Guozhi [1 ]
机构
[1] Hefei Normal Univ, Sch Math & Stat, Hefei, Peoples R China
[2] Beijing Univ Technol, Fac Sci, Beijing, Peoples R China
关键词
Asymptotic optimality; Cross-validation; Model averaging; Missing data; Covariate balancing propensity score; INFORMATION CRITERION; EMPIRICAL LIKELIHOOD; CROSS-VALIDATION; PROPENSITY-SCORE; SELECTION; PREDICTION; INFERENCE;
D O I
10.1007/s42952-024-00259-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider model averaging estimation problem in the linear regression model with missing response data, that allows for model misspecification. Based on the 'complete' data set for the response variable after inverse propensity score weighted imputation, we construct a leave-one-out cross-validation criterion for allocating model weights, where the propensity score model is estimated by the covariate balancing propensity score method. We derive some theoretical results to justify the proposed strategy. Firstly, when all candidate outcome regression models are misspecified, our procedures are proved to achieve optimality in terms of asymptotically minimizing the squared loss. Secondly, when the true outcome regression model is among the set of candidate models, the resulting model averaging estimators of the regression parameters are shown to be root-n consistent. Simulation studies provide evidence of the superiority of our methods over other existing model averaging methods, even when the propensity score model is misspecified. As an illustration, the approach is further applied to study the CD4 data.
引用
收藏
页码:583 / 616
页数:34
相关论文
共 50 条