Robust Estimation of Gaussian Copula Causal Structure from Mixed Data with Missing Values

被引:3
|
作者
Cui, Ruifei [1 ]
Groot, Perry [1 ]
Heskes, Tom [1 ]
机构
[1] Radboud Univ Nijmegen, Inst Comp & Informat Sci, Nijmegen, Netherlands
关键词
DIRECTED ACYCLIC GRAPHS; PC-ALGORITHM; MODELS;
D O I
10.1109/ICDM.2017.101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of causal structure learning from data with missing values, assumed to be drawn from a Gaussian copula model. First, we extend the 'Rank PC' algorithm, designed for Gaussian copula models with purely continuous data (so-called nonparanormal models), to incomplete data by applying rank correlation to pairwise complete observations and replacing the sample size with an effective sample size in the conditional independence tests to account for the information loss from missing values. The resulting approach works when the data are missing completely at random (MCAR). Then, we propose a Gibbs sampling procedure to draw correlation matrix samples from mixed data under missingness at random (MAR). These samples are translated into an average correlation matrix, and an effective sample size, resulting in the 'Copula PC' algorithm for incomplete data. Simulation study shows that: 1) the usage of the effective sample size significantly improves the performance of 'Rank PC' and 'Copula PC'; 2) ` Copula PC' estimates a more accurate correlation matrix and causal structure than 'Rank PC' under MCAR and, even more so, under MAR. Also, we illustrate our methods on a real-world data set about gene expression.
引用
收藏
页码:835 / 840
页数:6
相关论文
共 50 条
  • [1] Learning causal structure from mixed data with missing values using Gaussian copula models
    Ruifei Cui
    Perry Groot
    Tom Heskes
    [J]. Statistics and Computing, 2019, 29 : 311 - 333
  • [2] Learning causal structure from mixed data with missing values using Gaussian copula models
    Cui, Ruifei
    Groot, Perry
    Heskes, Tom
    [J]. STATISTICS AND COMPUTING, 2019, 29 (02) : 311 - 333
  • [3] Gaussian Copula Precision Estimation with Missing Values
    Wang, Huahua
    Fazayeli, Farideh
    Chatterjee, Soumyadeep
    Banerjee, Arindam
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 978 - 986
  • [4] Missing Value Imputation for Mixed Data via Gaussian Copula
    Zhao, Yuxuan
    Udell, Madeleine
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 636 - 646
  • [5] Doubly robust estimation in missing data and causal inference models
    Bang, H
    [J]. BIOMETRICS, 2005, 61 (04) : 962 - 972
  • [6] EM algorithm in Gaussian copula with missing data
    Ding, Wei
    Song, Peter X. -K.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 101 : 1 - 11
  • [8] Robust and efficient estimation for the treatment effect in causal inference and missing data problems
    Lin, Huazhen
    Zhou, Fanyin
    Wang, Qiuxia
    Zhou, Ling
    Qin, Jing
    [J]. JOURNAL OF ECONOMETRICS, 2018, 205 (02) : 363 - 380
  • [9] Improved double-robust estimation in missing data and causal inference models
    Rotnitzky, Andrea
    Lei, Quanhong
    Sued, Mariela
    Robins, James M.
    [J]. BIOMETRIKA, 2012, 99 (02) : 439 - 456
  • [10] Gaussian Copula Mixed Models with Non-Ignorable Missing Outcomes
    Jafari, N.
    Tabrizi, E.
    Samani, E. Bahrami
    [J]. APPLICATIONS AND APPLIED MATHEMATICS-AN INTERNATIONAL JOURNAL, 2015, 10 (01): : 81 - 105