Bayesian nonparametric generative models for causal inference with missing at random covariates

被引:25
|
作者
Roy, Jason [1 ]
Lum, Kirsten J. [1 ]
Zeldow, Bret [1 ]
Dworkin, Jordan D. [1 ]
Re, Vincent Lo [1 ,2 ]
Daniels, Michael J. [3 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, Dept Med, Philadelphia, PA 19104 USA
[3] Univ Florida, Dept Stat, Gainesville, FL 32611 USA
关键词
Bayesian modeling; Causal effect; Cluster; Enriched Dirichlet process mixture model; Missing data; Observational studies; MARGINAL STRUCTURAL MODELS; DIRICHLET PROCESS MIXTURES;
D O I
10.1111/biom.12875
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a general Bayesian nonparametric (BNP) approach to causal inference in the point treatment setting. The joint distribution of the observed data (outcome, treatment, and confounders) is modeled using an enriched Dirichlet process. The combination of the observed data model and causal assumptions allows us to identify any type of causal effect-differences, ratios, or quantile effects, either marginally or for subpopulations of interest. The proposed BNP model is well-suited for causal inference problems, as it does not require parametric assumptions about the distribution of confounders and naturally leads to a computationally efficient Gibbs sampling algorithm. By flexibly modeling the joint distribution, we are also able to impute (via data augmentation) values for missing covariates within the algorithm under an assumption of ignorable missingness, obviating the need to create separate imputed data sets. This approach for imputing the missing covariates has the additional advantage of guaranteeing congeniality between the imputation model and the analysis model, and because we use a BNP approach, parametric models are avoided for imputation. The performance of the method is assessed using simulation studies. The method is applied to data from a cohort study of human immunodeficiency virus/hepatitis C virus co-infected patients.
引用
收藏
页码:1193 / 1202
页数:10
相关论文
共 50 条
  • [1] Bayesian nonparametric for causal inference and missing data
    Chen, Li-Pang
    [J]. BIOMETRICS, 2024, 80 (01)
  • [2] Nonparametric causal inference with confounders missing not at random
    Shan, Jiawei
    Yan, Xinyu
    [J]. STATISTICA NEERLANDICA, 2024,
  • [3] Bayesian methods for generalized linear models with covariates missing at random
    Ibrahim, JG
    Chen, MH
    Lipsitz, SR
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2002, 30 (01): : 55 - 78
  • [4] Bayesian Nonparametric Modeling for Causal Inference
    Hill, Jennifer L.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (01) : 217 - 240
  • [5] A Bayesian nonparametric approach to causal inference on quantiles
    Xu, Dandan
    Daniels, Michael J.
    Winterstein, Almut G.
    [J]. BIOMETRICS, 2018, 74 (03) : 986 - 996
  • [6] The how and why of Bayesian nonparametric causal inference
    Linero, Antonio R.
    Antonelli, Joseph L.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (01)
  • [7] Regression Analysis with Covariates Missing at Random: A Piece-wise Nonparametric Model for Missing Covariates
    Zhao, Yang
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (20) : 3736 - 3744
  • [8] Causal inference with confounders missing not at random
    Yang, S.
    Wang, L.
    Ding, P.
    [J]. BIOMETRIKA, 2019, 106 (04) : 875 - 888
  • [9] Efficient Nonparametric Causal Inference with Missing Exposure Information
    Kennedy, Edward H.
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2020, 16 (01):
  • [10] Causal Inference with Noisy and Missing Covariates via Matrix Factorization
    Kallus, Nathan
    Mao, Xiaojie
    Udell, Madeleine
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31