Graphical Representation of Missing Data Problems

被引:46
|
作者
Thoemmes, Felix [1 ]
Mohan, Karthika [2 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
auxiliary variables; full information; graphical models; maximum likelihood; missing data; multiple imputation; MULTIPLE IMPUTATION; CAUSAL;
D O I
10.1080/10705511.2014.937378
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Rubin's classic missingness mechanisms are central to handling missing data and minimizing biases that can arise due to missingness. However, the formulaic expressions that posit certain independencies among missing and observed data are difficult to grasp. As a result, applied researchers often rely on informal translations of these assumptions. We present a graphical representation of missing data mechanism, formalized in Mohan, Pearl, and Tian (2013). We show that graphical models provide a tool for comprehending, encoding, and communicating assumptions about the missingness process. Furthermore, we demonstrate on several examples how graph-theoretical criteria can determine if biases due to missing data might emerge in some estimates of interests and which auxiliary variables are needed to control for such biases, given assumptions about the missingness process.
引用
收藏
页码:631 / 642
页数:12
相关论文
共 50 条