A ROBUST AND EFFICIENT APPROACH TO CAUSAL INFERENCE BASED ON SPARSE SUFFICIENT DIMENSION REDUCTION

被引:26
|
作者
Ma, Shujie [1 ]
Zhu, Liping [2 ]
Zhang, Zhiwei [1 ]
Tsai, Chih-Ling [3 ]
Carroll, Raymond J. [4 ,5 ]
机构
[1] Univ Calif Riverside, Dept Stat, Riverside, CA 92521 USA
[2] Renmin Univ China, Inst Stat & Big Data, Ctr Appl Stat, Beijing 100872, Peoples R China
[3] Univ Calif Davis, Grad Sch Management, Davis, CA 95616 USA
[4] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
[5] Univ Technol, Sch Math Sci, Broadway, NSW 2007, Australia
来源
ANNALS OF STATISTICS | 2019年 / 47卷 / 03期
关键词
Average treatment effect; dimension reduction; high-dimensional data; multiple-index model; outcome regression; semiparametric efficiency; SLICED INVERSE REGRESSION; PROPENSITY SCORE; VEGETABLE INTAKE; MISSING DATA; LARGE-SAMPLE; BODY-MASS; SELECTION; INDEX; ASSOCIATION; MODEL;
D O I
10.1214/18-AOS1722
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A fundamental assumption used in causal inference with observational data is that treatment assignment is ignorable given measured confounding variables. This assumption of no missing confounders is plausible if a large number of baseline covariates are included in the analysis, as we often have no prior knowledge of which variables can be important confounders. Thus, estimation of treatment effects with a large number of covariates has received considerable attention in recent years. Most existing methods require specifying certain parametric models involving the outcome, treatment and confounding variables, and employ a variable selection procedure to identify confounders. However, selection of a proper set of confounders depends on correct specification of the working models. The bias due to model mis-specification and incorrect selection of confounding variables can yield misleading results. We propose a robust and efficient approach for inference about the average treatment effect via a flexible modeling strategy incorporating penalized variable selection. Specifically, we consider an estimator constructed based on an efficient influence function that involves a propensity score and an outcome regression. We then propose a new sparse sufficient dimension reduction method to estimate these two functions without making restrictive parametric modeling assumptions. The proposed estimator of the average treatment effect is asymptotically normal and semiparametrically efficient without the need for variable selection consistency. The proposed methods are illustrated via simulation studies and a biomedical application.
引用
收藏
页码:1505 / 1535
页数:31
相关论文
共 50 条