Causal Inference for a Population of Causally Connected Units

被引:34
|
作者
Van der Laan, Mark J. [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
networks; causal inference; targeted maximum likelihood estimation; stochastic intervention; efficient influence curve;
D O I
10.1515/jci-2013-0002
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Suppose that we observe a population of causally connected units. On each unit at each time-point on a grid we observe a set of other units the unit is potentially connected with, and a unit-specific longitudinal data structure consisting of baseline and time-dependent covariates, a time-dependent treatment, and a final outcome of interest. The target quantity of interest is defined as the mean outcome for this group of units if the exposures of the units would be probabilistically assigned according to a known specified mechanism, where the latter is called a stochastic intervention. Causal effects of interest are defined as contrasts of the mean of the unit-specific outcomes under different stochastic interventions one wishes to evaluate. This covers a large range of estimation problems from independent units, independent clusters of units, and a single cluster of units in which each unit has a limited number of connections to other units. The allowed dependence includes treatment allocation in response to data on multiple units and so called causal interference as special cases. We present a few motivating classes of examples, propose a structural causal model, define the desired causal quantities, address the identification of these quantities from the observed data, and define maximum likelihood based estimators based on cross-validation. In particular, we present maximum likelihood based super-learning for this network data. Nonetheless, such smoothed/regularized maximum likelihood estimators are not targeted and will thereby be overly bias w.r.t. the target parameter, and, as a consequence, generally not result in asymptotically normally distributed estimators of the statistical target parameter. To formally develop estimation theory, we focus on the simpler case in which the longitudinal data structure is a point-treatment data structure. We formulate a novel targeted maximum likelihood estimator of this estimand and show that the double robustness of the efficient influence curve implies that the bias of the targeted minimum loss-based estimation (TMLE) will be a second-order term involving squared differences of two nuisance parameters. In particular, the TMLE will be consistent if either one of these nuisance parameters is consistently estimated. Due to the causal dependencies between units, the data set may correspond with the realization of a single experiment, so that establishing a (e.g. normal) limit distribution for the targeted maximum likelihood estimators, and corresponding statistical inference, is a challenging topic. We prove two formal theorems establishing the asymptotic normality using advances in weak-convergence theory. We conclude with a discussion and refer to an accompanying technical report for extensions to general longitudinal data structures.
引用
收藏
页码:13 / 74
页数:62
相关论文
共 50 条
  • [1] Causal inference with interfering units for cluster and population level treatment allocation programs
    Papadogeorgou, Georgia
    Mealli, Fabrizia
    Zigler, Corwin M.
    [J]. BIOMETRICS, 2019, 75 (03) : 778 - 787
  • [2] Population heterogeneity and causal inference
    Xie, Yu
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (16) : 6262 - 6268
  • [3] Causal Belief Inference in Multiply Connected Networks
    Boussarsar, Oumaima
    Boukhris, Imen
    Elouedi, Zied
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, IPMU 2016, PT II, 2016, 611 : 291 - 302
  • [4] Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population
    Sofrygin, Oleg
    van der Laan, Mark J.
    [J]. JOURNAL OF CAUSAL INFERENCE, 2017, 5 (01)
  • [5] Population intervention models in causal inference
    Hubbard, Alan E.
    Van der Laan, Mark J.
    [J]. BIOMETRIKA, 2008, 95 (01) : 35 - 47
  • [6] Causal and causally separable processes
    Oreshkov, Ognyan
    Giarmatzi, Christina
    [J]. NEW JOURNAL OF PHYSICS, 2016, 18
  • [7] Close (Causally Connected) Cousins? Evidence on the Causal Relationship between Political Trust and Social Trust
    Dinesen, Peter Thisted
    Sonderskov, Kim Mannemar
    Sohlberg, Jacob
    Esaiasson, Peter
    [J]. PUBLIC OPINION QUARTERLY, 2022, 86 (03) : 708 - 721
  • [8] Bridging Finite and Super Population Causal Inference
    Ding, Peng
    Li, Xinran
    Miratrix, Luke W.
    [J]. JOURNAL OF CAUSAL INFERENCE, 2017, 5 (02)
  • [9] Gaussian prepivoting for finite population causal inference
    Cohen, Peter L.
    Fogarty, Colin B.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (02) : 295 - 320
  • [10] Note on the delta method for finite population inference with applications to causal inference
    Pashley, Nicole E.
    [J]. STATISTICS & PROBABILITY LETTERS, 2022, 188