Latent Network Estimation and Variable Selection for Compositional Data Via Variational EM

被引:15
|
作者
Osborne, Nathan [1 ]
Peterson, Christine B. [2 ]
Vannucci, Marina [1 ]
机构
[1] Rice Univ, Dept Stat, Houston, TX 77251 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
Bayesian hierarchical model; Count data; EM algorithm; Graphical model; Microbiome data; Variational inference; BAYESIAN-INFERENCE; PROBIT MODELS; REGRESSION; GRAPHS; LASSO;
D O I
10.1080/10618600.2021.1935971
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Network estimation and variable selection have been extensively studied in the statistical literature, but only recently have those two challenges been addressed simultaneously. In this article, we seek to develop a novel method to simultaneously estimate network interactions and associations to relevant covariates for count data, and specifically for compositional data, which have a fixed sum constraint. We use a hierarchical Bayesian model with latent layers and employ spike-and-slab priors for both edge and covariate selection. For posterior inference, we develop a novel variational inference scheme with an expectation-maximization step, to enable efficient estimation. Through simulation studies, we demonstrate that the proposed model outperforms existing methods in its accuracy of network recovery. We show the practical utility of our model via an application to microbiome data. The human microbiome has been shown to contribute too many of the functions of the human body, and also to be linked with a number of diseases. In our application, we seek to better understand the interaction between microbes and relevant covariates, as well as the interaction of microbes with each other. We call our algorithm simultaneous inference for networks and covariates and provide a Python implementation, which is available online.
引用
收藏
页码:163 / 175
页数:13
相关论文
共 50 条
  • [41] Order selection and sparsity in latent variable models via the ordered factor LASSO
    Hui, Francis K. C.
    Tanaka, Emi
    Warton, David I.
    BIOMETRICS, 2018, 74 (04) : 1311 - 1319
  • [42] Estimation and variable selection via frailty models with penalized likelihood
    Androulakis, E.
    Koukouvinos, C.
    Vonta, F.
    STATISTICS IN MEDICINE, 2012, 31 (20) : 2223 - 2239
  • [43] Variable selection and estimation for multivariate panel count data via the seamless-L0 penalty
    Zhang, Haixiang
    Sun, Jianguo
    Wang, Dehui
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2013, 41 (02): : 368 - 385
  • [44] Nonnegative estimation and variable selection via adaptive elastic-net for high-dimensional data
    Li, Ning
    Yang, Hu
    Yang, Jing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (12) : 4263 - 4279
  • [45] Simultaneous variable selection and estimation for survival data via the Gaussian seamless-L0 penalty
    Liu, Zili
    Wang, Hong
    STATISTICS IN MEDICINE, 2024, 43 (08) : 1509 - 1526
  • [46] Representation Learning for Dynamic Functional Connectivities via Variational Dynamic Graph Latent Variable Models
    Huang, Yicong
    Yu, Zhuliang
    ENTROPY, 2022, 24 (02)
  • [47] Hyperspectral Image Denoising via Clustering-Based Latent Variable in Variational Bayesian Framework
    Azimpour, Peyman
    Bahraini, Tahereh
    Yazdi, Hadi Sadoghi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (04): : 3266 - 3276
  • [48] Heterogenous data fusion via a probabilistic latent-variable model
    Yu, K
    Tresp, V
    ORGANIC AND PERVASIVE COMPUTING - ARCS 2004, 2004, 2981 : 20 - 30
  • [49] Maximum likelihood estimation for discrete latent variable models via evolutionary algorithms
    Brusa, Luca
    Pennoni, Fulvia
    Bartolucci, Francesco
    STATISTICS AND COMPUTING, 2024, 34 (02)
  • [50] Simultaneous treatment effect estimation and variable selection for observational data
    Ma, Eun-Yeol
    Lee, Uichin
    Kim, Heeyoung
    IISE TRANSACTIONS, 2025, 57 (04) : 380 - 392