EM algorithm in Gaussian copula with missing data

被引:42
|
作者
Ding, Wei [1 ]
Song, Peter X. -K. [2 ]
机构
[1] Barclays Investment Bank, 745 7th Ave, New York, NY 10019 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Gaussian copula; EM algorithm; Misaligned missing data; Regression; IMPUTATION;
D O I
10.1016/j.csda.2016.01.008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Rank-based correlation is widely used to measure dependence between variables when their marginal distributions are skewed. Estimation of such correlation is challenged by both the presence of missing data and the need for adjusting for confounding factors. In this paper, we consider a unified framework of Gaussian copula regression that enables us to estimate either Pearson correlation or rank-based correlation (e.g. Kendall's tau or Spearman's rho), depending on the types of marginal distributions. To adjust for confounding covariates, we utilize marginal regression models with univariate location-scale family distributions. We establish the EM algorithm for estimation of both correlation and regression parameters with missing values. For implementation, we propose an effective peeling procedure to carry out iterations required by the EM algorithm. We compare the performance of the EM algorithm method to the traditional multiple imputation approach through simulation studies. For structured types of correlations, such as exchangeable or first-order auto-regressive (AR-1) correlation, the EM algorithm outperforms the multiple imputation approach in terms of both estimation bias and efficiency. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] A penalized EM algorithm incorporating missing data mechanism for Gaussian parameter estimation
    Chen, Lin S.
    Prentice, Ross L.
    Wang, Pei
    [J]. BIOMETRICS, 2014, 70 (02) : 312 - 322
  • [2] Missing Value Imputation for Mixed Data via Gaussian Copula
    Zhao, Yuxuan
    Udell, Madeleine
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 636 - 646
  • [3] Quantile regression via the EM algorithm for joint modeling of mixed discrete and continuous data based on Gaussian copula
    S. Ghasemzadeh
    M. Ganjali
    T. Baghfalaki
    [J]. Statistical Methods & Applications, 2022, 31 : 1181 - 1202
  • [4] Copula Gaussian graphical models with penalized ascent Monte Carlo EM algorithm
    Abegaz, Fentaw
    Wit, Ernst
    [J]. STATISTICA NEERLANDICA, 2015, 69 (04) : 419 - 441
  • [5] Quantile regression via the EM algorithm for joint modeling of mixed discrete and continuous data based on Gaussian copula
    Ghasemzadeh, S.
    Ganjali, M.
    Baghfalaki, T.
    [J]. STATISTICAL METHODS AND APPLICATIONS, 2022, 31 (05): : 1181 - 1202
  • [6] Imputation Algorithm Based on Copula for Missing Value in Timeseries Data
    Afrianti, Y. S.
    Indratno, S. W.
    Pasaribu, U. S.
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON TECHNOLOGY, INFORMATICS, MANAGEMENT, ENGINEERING, AND ENVIRONMENT (TIME-E 2014), 2014, : 252 - 257
  • [7] Adjusting the EM algorithm for design of experiments with missing data
    Dodge, Y
    [J]. ITI 2004: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2004, : 9 - 12
  • [8] Identification of nonlinear systems with missing data by the EM algorithm
    Tanaka, M
    Dai, JS
    [J]. (SYSID'97): SYSTEM IDENTIFICATION, VOLS 1-3, 1998, : 645 - 650
  • [9] THE EM ALGORITHM FOR GRAPHICAL ASSOCIATION MODELS WITH MISSING DATA
    LAURITZEN, SL
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1995, 19 (02) : 191 - 201
  • [10] Gaussian Copula Precision Estimation with Missing Values
    Wang, Huahua
    Fazayeli, Farideh
    Chatterjee, Soumyadeep
    Banerjee, Arindam
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 978 - 986