Gaussian Graphical Model Estimation and Selection for High-Dimensional Incomplete Data Using Multiple Imputation and Horseshoe Estimators

被引:0
|
作者
Zhang, Yunxi [1 ]
Kim, Soeun [2 ]
机构
[1] Univ Mississippi, Med Ctr, Dept Data Sci, 2500 North State St, Jackson, MS 39216 USA
[2] Azusa Pacific Univ, Dept Math Phys & Stat, 901 Alosta Ave, Azusa, CA 91702 USA
关键词
missing data; Gaussian graphical model; sparse precision matrix; horseshoe prior; multiple imputation; Gibbs sampling; Bayesian; VARIABLE SELECTION;
D O I
10.3390/math12121837
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Gaussian graphical models have been widely used to measure the association networks for high-dimensional data; however, most existing methods assume fully observed data. In practice, missing values are inevitable in high-dimensional data and should be handled carefully. Under the Bayesian framework, we propose a regression-based approach to estimating sparse precision matrix for high-dimensional incomplete data. The proposed approach nests multiple imputation and precision matrix estimation with horseshoe estimators in a combined Gibbs sampling process. For fast and efficient selection using horseshoe priors, a post-iteration 2-means clustering strategy is employed. Through extensive simulations, we show the predominant selection and estimation performance of our approach compared to several prevalent methods. We further demonstrate the proposed approach to incomplete genetics data compared to alternative methods applied to completed data.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Variable selection for high-dimensional incomplete data using horseshoe estimation with data augmentation
    Zhang, Yunxi
    Kim, Soeun
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024, 53 (12) : 4235 - 4251
  • [2] Multiple imputation and analysis for high-dimensional incomplete proteomics data
    Yin, Xiaoyan
    Levy, Daniel
    Willinger, Christine
    Adourian, Aram
    Larson, Martin G.
    [J]. STATISTICS IN MEDICINE, 2016, 35 (08) : 1315 - 1326
  • [3] GRAPHICAL LASSO FOR HIGH-DIMENSIONAL COMPLEX GAUSSIAN GRAPHICAL MODEL SELECTION
    Tugnait, Jitendra K.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2952 - 2956
  • [4] Multiple imputation for high-dimensional mixed incomplete continuous and binary data
    He, Ren
    Belin, Thomas
    [J]. STATISTICS IN MEDICINE, 2014, 33 (13) : 2251 - 2262
  • [5] High-dimensional missing data imputation via undirected graphical model
    Lee, Yoonah
    Park, Seongoh
    [J]. STATISTICS AND COMPUTING, 2024, 34 (05)
  • [6] Joint estimation of multiple high-dimensional Gaussian copula graphical models
    He, Yong
    Zhang, Xinsheng
    Ji, Jiadong
    Liu, Bin
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2017, 59 (03) : 289 - 310
  • [7] High-dimensional joint estimation of multiple directed Gaussian graphical models
    Wang, Yuhao
    Segarra, Santiago
    Uhler, Caroline
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 2439 - 2483
  • [8] Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud
    Bu, Fanyu
    Chen, Zhikui
    Zhang, Qingchen
    Yang, Laurence T.
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (08): : 2977 - 2990
  • [9] Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud
    Fanyu Bu
    Zhikui Chen
    Qingchen Zhang
    Laurence T. Yang
    [J]. The Journal of Supercomputing, 2016, 72 : 2977 - 2990
  • [10] Imputation for incomplete high-dimensional multivariate normal data using a common factor model
    Song, JW
    Belin, TR
    [J]. STATISTICS IN MEDICINE, 2004, 23 (18) : 2827 - 2843