Latent Network Estimation and Variable Selection for Compositional Data Via Variational EM

被引:15
|
作者
Osborne, Nathan [1 ]
Peterson, Christine B. [2 ]
Vannucci, Marina [1 ]
机构
[1] Rice Univ, Dept Stat, Houston, TX 77251 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
Bayesian hierarchical model; Count data; EM algorithm; Graphical model; Microbiome data; Variational inference; BAYESIAN-INFERENCE; PROBIT MODELS; REGRESSION; GRAPHS; LASSO;
D O I
10.1080/10618600.2021.1935971
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Network estimation and variable selection have been extensively studied in the statistical literature, but only recently have those two challenges been addressed simultaneously. In this article, we seek to develop a novel method to simultaneously estimate network interactions and associations to relevant covariates for count data, and specifically for compositional data, which have a fixed sum constraint. We use a hierarchical Bayesian model with latent layers and employ spike-and-slab priors for both edge and covariate selection. For posterior inference, we develop a novel variational inference scheme with an expectation-maximization step, to enable efficient estimation. Through simulation studies, we demonstrate that the proposed model outperforms existing methods in its accuracy of network recovery. We show the practical utility of our model via an application to microbiome data. The human microbiome has been shown to contribute too many of the functions of the human body, and also to be linked with a number of diseases. In our application, we seek to better understand the interaction between microbes and relevant covariates, as well as the interaction of microbes with each other. We call our algorithm simultaneous inference for networks and covariates and provide a Python implementation, which is available online.
引用
收藏
页码:163 / 175
页数:13
相关论文
共 50 条
  • [21] Stochastic variational variable selection for high-dimensional microbiome data
    Dang, Tung
    Kumaishi, Kie
    Usui, Erika
    Kobori, Shungo
    Sato, Takumi
    Toda, Yusuke
    Yamasaki, Yuji
    Tsujimoto, Hisashi
    Ichihashi, Yasunori
    Iwata, Hiroyoshi
    MICROBIOME, 2022, 10 (01)
  • [22] Stochastic variational variable selection for high-dimensional microbiome data
    Tung Dang
    Kie Kumaishi
    Erika Usui
    Shungo Kobori
    Takumi Sato
    Yusuke Toda
    Yuji Yamasaki
    Hisashi Tsujimoto
    Yasunori Ichihashi
    Hiroyoshi Iwata
    Microbiome, 10
  • [23] Variable selection for heteroscedastic data through variance estimation
    Baek, S
    Karaman, F
    Ahn, H
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2005, 34 (03) : 567 - 583
  • [24] Bayesian latent factor regression for multivariate functional data with variable selection
    Noh, Heesang
    Choi, Taeryon
    Park, Jinsu
    Chung, Yeonseung
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2020, 49 (03) : 901 - 923
  • [25] Bayesian latent factor regression for multivariate functional data with variable selection
    Heesang Noh
    Taeryon Choi
    Jinsu Park
    Yeonseung Chung
    Journal of the Korean Statistical Society, 2020, 49 : 901 - 923
  • [26] Bayesian Variable Shrinkage and Selection in Compositional Data Regression: Application to Oral Microbiome
    Datta, Jyotishka
    Bandyopadhyay, Dipankar
    JOURNAL OF THE INDIAN SOCIETY FOR PROBABILITY AND STATISTICS, 2024, 25 (02) : 491 - 515
  • [27] From Abstract Items to Latent Spaces to Observed Data and Back: Compositional Variational Auto-Encoder
    Berger, Victor
    Sebag, Michele
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 11906 : 274 - 289
  • [28] Variational Bayesian Estimation of Quantile Nonlinear Dynamic Latent Variable Models with Possible Nonignorable Missingness
    Tuerde, Mulati
    Muhammadhaji, Ahmadjan
    AXIOMS, 2024, 13 (12)
  • [29] Latent Orientation Field Estimation via Convolutional Neural Network
    Cao, Kai
    Jain, Anil K.
    2015 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2015, : 349 - 356
  • [30] ESTIMATION OF ERRORS-IN-LATENT-VARIABLE MODELS ON BUSINESS SURVEY DATA
    IVALDI, M
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1992, 13 (03) : 307 - 318