Targeted Maximum Likelihood Based Causal Inference: Part I

被引:62
|
作者
van der Laan, Mark J. [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
关键词
causal effect; causal graph; censored data; cross-validation; collaborative double robust; double robust; dynamic treatment regimens; efficient influence curve; estimating function; estimator selection; locally efficient; loss function; marginal structural models for dynamic treatments; maximum likelihood estimation; model selection; pathwise derivative; randomized controlled trials; sieve; super-learning; targeted maximum likelihood estimation; MARGINAL STRUCTURAL MODELS; ESTIMATORS;
D O I
10.2202/1557-4679.1211
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Given causal graph assumptions, intervention-specific counterfactual distributions of the data can be defined by the so called G-computation formula, which is obtained by carrying out these interventions on the likelihood of the data factorized according to the causal graph. The obtained G-computation formula represents the counterfactual distribution the data would have had if this intervention would have been enforced on the system generating the data. A causal effect of interest can now be defined as some difference between these counterfactual distributions indexed by different interventions. For example, the interventions can represent static treatment regimens or individualized treatment rules that assign treatment in response to time-dependent covariates, and the causal effects could be defined in terms of features of the mean of the treatment-regimen specific counterfactual outcome of interest as a function of the corresponding treatment regimens. Such features could be defined nonparametrically in terms of so called (nonparametric) marginal structural models for static or individualized treatment rules, whose parameters can be thought of as (smooth) summary measures of differences between the treatment regimen specific counterfactual distributions. In this article, we develop a particular targeted maximum likelihood estimator of causal effects of multiple time point interventions. This involves the use of loss-based super-learning to obtain an initial estimate of the unknown factors of the G-computation formula, and subsequently, applying a target-parameter specific optimal fluctuation function (least favorable parametric submodel) to each estimated factor, estimating the fluctuation parameter(s) with maximum likelihood estimation, and iterating this updating step of the initial factor till convergence. This iterative targeted maximum likelihood updating step makes the resulting estimator of the causal effect double robust in the sense that it is consistent if either the initial estimator is consistent, or the estimator of the optimal fluctuation function is consistent. The optimal fluctuation function is correctly specified if the conditional distributions of the nodes in the causal graph one intervenes upon are correctly specified. The latter conditional distributions often comprise the so called treatment and censoring mechanism. Selection among different targeted maximum likelihood estimators (e. g., indexed by different initial estimators) can be based on loss-based cross-validation such as likelihood based cross-validation or cross-validation based on another appropriate loss function for the distribution of the data. Some specific loss functions are mentioned in this article. Subsequently, a variety of interesting observations about this targeted maximum likelihood estimation procedure are made. This article provides the basis for the subsequent companion Part II-article in which concrete demonstrations for the implementation of the targeted MLE in complex causal effect estimation problems are provided.
引用
收藏
页数:45
相关论文
共 50 条
  • [1] Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies
    Schuler, Megan S.
    Rose, Sherri
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2017, 185 (01) : 65 - 73
  • [2] Transfering Targeted Maximum Likelihood Estimation for Causal Inference into Sports Science
    Dijkhuis, Talko B.
    Blaauw, Frank J.
    [J]. ENTROPY, 2022, 24 (08)
  • [3] An Application of Collaborative Targeted Maximum Likelihood Estimation in Causal Inference and Genomics
    Gruber, Susan
    van der Laan, Mark J.
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2010, 6 (01):
  • [4] Targeted maximum likelihood estimation for causal inference in survival and competing risks analysis
    Helene C. W. Rytgaard
    Mark J. van der Laan
    [J]. Lifetime Data Analysis, 2024, 30 : 4 - 33
  • [5] Targeted maximum likelihood estimation for causal inference in survival and competing risks analysis
    Rytgaard, Helene C. W.
    van der Laan, Mark J.
    [J]. LIFETIME DATA ANALYSIS, 2024, 30 (01) : 4 - 33
  • [6] Targeted maximum likelihood estimation of causal effects with interference: A simulation study
    Zivich, Paul N.
    Hudgens, Michael G.
    Brookhart, Maurice A.
    Moody, James
    Weber, David J.
    Aiello, Allison E.
    [J]. STATISTICS IN MEDICINE, 2022, 41 (23) : 4554 - 4577
  • [7] A Targeted Maximum Likelihood Estimator of a Causal Effect on a Bounded Continuous Outcome
    Gruber, Susan
    van der Laan, Mark J.
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2010, 6 (01):
  • [8] Empirical Likelihood in Causal Inference
    Zhang, Biao
    [J]. ECONOMETRIC REVIEWS, 2016, 35 (02) : 201 - 231
  • [9] Likelihood-based inference for bounds of causal parameters
    Lee, Woojoo
    Sjolander, Arvid
    Larsson, Anton
    Pawitan, Yudi
    [J]. STATISTICS IN MEDICINE, 2018, 37 (30) : 4695 - 4706
  • [10] SIMPLIFIED MAXIMUM LIKELIHOOD INFERENCE BASED ON THE LIKELIHOOD DECOMPOSITION FOR MISSING DATA
    Jung, Sangah
    Park, Sangun
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2013, 55 (03) : 271 - 283