Collaborative Double Robust Targeted Maximum Likelihood Estimation

被引:86
|
作者
van der Laan, Mark J. [1 ]
Gruber, Susan [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
关键词
asymptotic linearity; coarsening at random; causal effect; censored data; cross-validation; collaborative double robust; double robust; efficient influence curve; estimating function; estimator selection; influence curve; G-computation; locally efficient; loss-function; marginal structural model; maximum likelihood estimation; model selection; pathwise derivative; semiparametric model; sieve; super efficiency; super-learning; targeted maximum likelihood estimation; targeted nuisance parameter estimator selection; variable importance; DEMYSTIFYING DOUBLE ROBUSTNESS; MARGINAL STRUCTURAL MODELS; ALTERNATIVE STRATEGIES; CAUSAL INFERENCE;
D O I
10.2202/1557-4679.1181
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Collaborative double robust targeted maximum likelihood estimators represent a fundamental further advance over standard targeted maximum likelihood estimators of a pathwise differentiable parameter of a data generating distribution in a semiparametric model, introduced in van der Laan, Rubin (2006). The targeted maximum likelihood approach involves fluctuating an initial estimate of a relevant factor (Q) of the density of the observed data, in order to make a bias/variance tradeoff targeted towards the parameter of interest. The fluctuation involves estimation of a nuisance parameter portion of the likelihood, g. TMLE has been shown to be consistent and asymptotically normally distributed (CAN) under regularity conditions, when either one of these two factors of the likelihood of the data is correctly specified, and it is semiparametric efficient if both are correctly specified. In this article we provide a template for applying collaborative targeted maximum likelihood estimation (C-TMLE) to the estimation of pathwise differentiable parameters in semi-parametric models. The procedure creates a sequence of candidate targeted maximum likelihood estimators based on an initial estimate for Q coupled with a succession of increasingly non-parametric estimates for g. In a departure from current state of the art nuisance parameter estimation, C-TMLE estimates of g are constructed based on a loss function for the targeted maximum likelihood estimator of the relevant factor Q that uses the nuisance parameter to carry out the fluctuation, instead of a loss function for the nuisance parameter itself. Likelihood-based cross-validation is used to select the best estimator among all candidate TMLE estimators of Q(0) in this sequence. A penalized-likelihood loss function for Q is suggested when the parameter of interest is borderline-identifiable. We present theoretical results for "collaborative double robustness," demonstrating that the collaborative targeted maximum likelihood estimator is CAN even when Q and g are both misspecified, providing that g solves a specified score equation implied by the difference between the Q and the true Q(0). This marks an improvement over the current definition of double robustness in the estimating equation literature. We also establish an asymptotic linearity theorem for the C-DR-TMLE of the target parameter, showing that the C-DR-TMLE is more adaptive to the truth, and, as a consequence, can even be super efficient if the first stage density estimator does an excellent job itself with respect to the target parameter. This research provides a template for targeted efficient and robust loss-based learning of a particular target feature of the probability distribution of the data within large (infinite dimensional) semi-parametric models, while still providing statistical inference in terms of confidence intervals and p-values. This research also breaks with a taboo (e.g., in the propensity score literature in the field of causal inference) on using the relevant part of likelihood to fine-tune the fitting of the nuisance parameter/censoring mechanism/treatment mechanism.
引用
收藏
页数:71
相关论文
共 50 条
  • [41] Robust talker direction estimation based on weighted CSP analysis and maximum likelihood estimation
    Denda, Y
    Nishiura, T
    Yamashita, Y
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 1050 - 1057
  • [42] Robust Pitch Estimation Using l1-regularized Maximum Likelihood Estimation
    Huang, Feng
    Lee, Tan
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 378 - 381
  • [43] Collaborative targeted maximum likelihood estimation for variable importance measure: Illustration for functional outcome prediction in mild traumatic brain injuries
    Pirracchio, Romain
    Yue, John K.
    Manley, Geoffrey T.
    van der Laan, Mark J.
    Hubbard, Alan E.
    Gordon, Wayne A.
    Lingsma, Hester F.
    Maas, Andrew I. R.
    Mukherjee, Pratik
    Okonkwo, David O.
    Schnyer, David M.
    Valadka, Alex B.
    Yuh, Esther L.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (01) : 286 - 297
  • [44] Collaborative Target Tracking in WSNs Based on Maximum Likelihood Estimation and Kalman Filter
    Wang Xingbo
    Zhang Huanshui
    Jiang Xiangyuan
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4946 - 4951
  • [45] Regularized Weighted Collaborative Representation with Maximum Likelihood Estimation for Facial Expression Recognition
    Dang, Juan
    Xu, Hongji
    Ji, Mingyang
    Sun, Junfeng
    2017 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2017), VOL 2, 2017, : 55 - 58
  • [46] Maximum Likelihood Estimation of Asymmetric Double Type II Pareto Distributions
    Daniel Halvarsson
    Journal of Statistical Theory and Practice, 2020, 14
  • [48] Targeted maximum likelihood estimation for causal inference in survival and competing risks analysis
    Helene C. W. Rytgaard
    Mark J. van der Laan
    Lifetime Data Analysis, 2024, 30 : 4 - 33
  • [49] An Application of Targeted Maximum Likelihood Estimation to the Meta-Analysis of Safety Data
    Gruber, Susan
    van der Laan, Mark J.
    BIOMETRICS, 2013, 69 (01) : 254 - 262
  • [50] Maximum likelihood estimation of the double exponential jump-diffusion process
    Ramezani C.A.
    Zeng Y.
    Annals of Finance, 2007, 3 (4) : 487 - 507