Causal Inference with Noisy and Missing Covariates via Matrix Factorization

被引:0
|
作者
Kallus, Nathan [1 ]
Mao, Xiaojie [1 ]
Udell, Madeleine [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
COMPLETION; BOUNDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Valid causal inference in observational studies often requires controlling for confounders. However, in practice measurements of confounders may be noisy, and can lead to biased estimates of causal effects. We show that we can reduce bias induced by measurement noise using a large number of noisy measurements of the underlying confounders. We propose the use of matrix factorization to infer the confounders from noisy covariates. This flexible and principled framework adapts to missing values, accommodates a wide variety of data types, and can enhance a wide variety of causal inference methods. We bound the error for the induced average treatment effect estimator and show it is consistent in a linear regression setting, using Exponential Family Matrix Completion preprocessing. We demonstrate the effectiveness of the proposed procedure in numerical experiments with both synthetic data and real clinical data.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Bayesian nonparametric generative models for causal inference with missing at random covariates
    Roy, Jason
    Lum, Kirsten J.
    Zeldow, Bret
    Dworkin, Jordan D.
    Re, Vincent Lo
    Daniels, Michael J.
    [J]. BIOMETRICS, 2018, 74 (04) : 1193 - 1202
  • [2] Factorization with missing and noisy data
    Julia, Carme
    Sappa, Angel
    Lumbreras, Felipe
    Serrat, Joan
    Lopez, Antonio
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 1, PROCEEDINGS, 2006, 3991 : 555 - 562
  • [3] Boolean Matrix Factorization and Noisy Completion via Message Passing
    Ravanbakhsh, Siamak
    Poczos, Barnabas
    Greiner, Russell
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [4] Causal Inference: A Missing Data Perspective
    Ding, Peng
    Li, Fan
    [J]. STATISTICAL SCIENCE, 2018, 33 (02) : 214 - 237
  • [5] Causal inference with confounders missing not at random
    Yang, S.
    Wang, L.
    Ding, P.
    [J]. BIOMETRIKA, 2019, 106 (04) : 875 - 888
  • [6] Semiparametric inference for estimating equations with nonignorably missing covariates
    Chen, Ji
    Fang, Fang
    Xiao, Zhiguo
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2018, 30 (03) : 796 - 812
  • [7] Inference using conditional logistic regression with missing covariates
    Lipsitz, SR
    Parzen, M
    Ewell, M
    [J]. BIOMETRICS, 1998, 54 (01) : 295 - 303
  • [8] Theory and inference for regression models with missing responses and covariates
    Chen, Qingxia
    Ibrahim, Joseph G.
    Chen, Ming-Hui
    Senchaudhuri, Pralay
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2008, 99 (06) : 1302 - 1331
  • [9] Leveraging random assignment to impute missing covariates in causal studies
    Kamat, Gauri
    Reiter, Jerome P.
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (07) : 1275 - 1305
  • [10] Bayesian nonparametric for causal inference and missing data
    Chen, Li-Pang
    [J]. BIOMETRICS, 2024, 80 (01)