Tuning Random Forests for Causal Inference under Cluster-Level Unmeasured Confounding

被引：4

作者：

Suk, Youmi ^{[1
]}

Kang, Hyunseung ^{[2
]}

机构：

[1] Univ Virginia, Sch Data Sci, 31 Bonnycastle Dr, Charlottesville, VA 22903 USA

[2] Univ Wisconsin Madison, Dept Stat, Madison, WI USA

来源：

MULTIVARIATE BEHAVIORAL RESEARCH | 2023年 / 58卷 / 02期

关键词：

Causal inference; machine learning methods; unmeasured variables; omitted variable bias; fixed effects models; PROPENSITY SCORE; HETEROGENEITY; EXPOSURE;

D O I：

10.1080/00273171.2021.1994364

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Recently, there has been growing interest in using machine learning methods for causal inference due to their automatic and flexible ability to model the propensity score and the outcome model. However, almost all the machine learning methods for causal inference have been studied under the assumption of no unmeasured confounding and there is little work on handling omitted/unmeasured variable bias. This paper focuses on a machine learning method based on random forests known as Causal Forests and presents five simple modifications for tuning Causal Forests so that they are robust to cluster-level unmeasured confounding. Our simulation study finds that adjusting the default tuning procedure with the propensity score from fixed effects logistic regression or using variables that are centered to their cluster means produces estimates that are more robust to cluster-level unmeasured confounding. Also, when these parametric propensity score models are mis-specified, our modified machine learning methods remain robust to bias from cluster-level unmeasured confounders compared to existing parametric approaches based on propensity score weighting. We conclude by demonstrating our proposals in a real data study concerning the effect of taking an eighth-grade algebra course on math achievement scores from the Early Childhood Longitudinal Study.

引用

页码：408 / 440

页数：33

共 34 条

[1] Robust Machine Learning for Treatment Effects in Multilevel Observational Studies Under Cluster-level Unmeasured Confounding
Suk, Youmi
Kang, Hyunseung
[J]. PSYCHOMETRIKA, 2022, 87 (01) : 310 - 343
[2] Robust Machine Learning for Treatment Effects in Multilevel Observational Studies Under Cluster-level Unmeasured Confounding
Youmi Suk
Hyunseung Kang
[J]. Psychometrika, 2022, 87 : 310 - 343
[3] On the use of between-within models to adjust for confounding due to unmeasured cluster-level covariates
Brumback, Babette A.
Li, Li
Cai, Zhuangyu
[J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (05) : 3841 - 3854
[4] Sensitivity Analysis for Causal Inference under Unmeasured Confounding and Measurement Error Problems
Diaz, Ivan
van der Laan, Mark J.
[J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2013, 9 (02): : 149 - 160
[5] Differentiable Causal Discovery Under Unmeasured Confounding
Bhattacharya, Rohit
Nagarajan, Tushar
Malinsky, Daniel
Shpitser, Ilya
[J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[6] Causal forests versus inverse probability weighting for addressing cluster-level confounding in medical device and surgical epidemiology: A simulation study
Du, Mike
Khalid, Sara
Strauss, Victoria
Prieto-Alhambra, Daniel
[J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 309 - 309
[7] Cluster-level statistical inference in fMRI datasets: The unexpected behavior of random fields in high dimensions
Bansal, Ravi
Peterson, Bradley S.
[J]. MAGNETIC RESONANCE IMAGING, 2018, 49 : 101 - 115
[8] Cluster failure or power failure? Evaluating sensitivity in cluster-level inference
Noble, Stephanie
Scheinost, Dustin
Constable, R. Todd
[J]. NEUROIMAGE, 2020, 209
[9] Cluster-level statistical inference in fMRI datasets: The unexpected behavior of random fields in high dimensions
Bansal, Ravi
Peterson, Bradley S.
[J]. MAGNETIC RESONANCE IMAGING, 2022, 87 : 19 - 31
[10] Sensitivity analysis of unmeasured confounding in causal inference based on exponential tilting and super learner
Zhou, Mi
Yao, Weixin
[J]. JOURNAL OF APPLIED STATISTICS, 2023, 50 (03) : 744 - 760

← 1 2 3 4 →