Semiparametric counterfactual density estimation

被引:4
|
作者
Kennedy, E. H. [1 ]
Balakrishnan, S. [1 ]
Wasserman, L. A. [1 ]
机构
[1] Carnegie Mellon Univ, Dept Stat & Data Sci, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Causal inference; Density estimation; Influence function; Model misspecification; Semiparametric theory; CAUSAL INFERENCE; MODELS; DECOMPOSITION; DIVERGENCE; REGRESSION; QUANTILES;
D O I
10.1093/biomet/asad017
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Causal effects are often characterized with averages, which can give an incomplete picture of the underlying counterfactual distributions. Here we consider estimating the entire counterfactual density and generic functionals thereof. We focus on two kinds of target parameters: density approximations and the distance between counterfactual densities. We study nonparametric efficiency bounds, giving results for smooth but otherwise generic models and distances. Importantly, we show how these bounds connect to means of particular nontrivial functions of counterfactuals, linking the problems of density and mean estimation. We propose doubly robust-style estimators, and study their rates of convergence, showing that they can be optimally efficient in large nonparametric models. We also give analogous methods for model selection and aggregation, when many models may be available and of interest. Our results all hold for generic models and distances, but we highlight results for L-2 projections on linear models and Kullbach-Leibler projections on exponential families. Finally, we illustrate our method by estimating the density of the CD4 count among patients with HIV, had all been treated with combination therapy versus zidovudine alone, as well as a density effect. Our methods are implemented in the R package npcausal on GitHub.
引用
收藏
页码:875 / 896
页数:22
相关论文
共 50 条