Identification of microbial interaction network: zero-inflated latent Ising model based approach

被引:1
|
作者
Zhou, Jie [1 ]
Viles, Weston D. [3 ]
Lu, Boran [1 ]
Li, Zhigang [4 ]
Madan, Juliette C. [2 ]
Karagas, Margaret R. [2 ]
Gui, Jiang [1 ]
Hoen, Anne G. [1 ,2 ]
机构
[1] Dartmouth Coll, Geisel Sch Med, Dept Biomed Data Sci, Hanover, NH 03755 USA
[2] Dartmouth Coll, Geisel Sch Med, Dept Epidemiol, Hanover, NH 03755 USA
[3] Univ Southern Maine, Dept Math & Stat, Portland, ME USA
[4] Univ Florida, Dept Biostat, Gainesville, FL USA
基金
美国国家卫生研究院;
关键词
Gut microbiota; Microbial interaction network; Latent Ising model; Dynamic programming; High-dimensional data; Sparse estimation; SELECTION;
D O I
10.1186/s13040-020-00226-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Throughout their lifespans, humans continually interact with the microbial world, including those organisms which live in and on the human body. Research in this domain has revealed the extensive links between the human-associated microbiota and health. In particular, the microbiota of the human gut plays essential roles in digestion, nutrient metabolism, immune maturation and homeostasis, neurological signaling, and endocrine regulation. Microbial interaction networks are frequently estimated from data and are an indispensable tool for representing and understanding the conditional correlation between the microbes. In this high-dimensional setting, zero-inflation and unit-sum constraint for relative abundance data pose challenges to the reliable estimation of microbial interaction networks. Methods and Results To identify the microbial interaction network, thezero-inflated latent Ising(ZILI) model is proposed which assumes the distribution of relative abundance relies only on finite latent states and provides a novel way to solve issues induced by the unit-sum and zero-inflation constrains. A two-step algorithm is proposed for the model selection of ZILI. ZILI is evaluated through simulated data and subsequently applied to an infant gut microbiota dataset from New Hampshire Birth Cohort Study. The results are compared with results from Gaussian graphical model (GGM) and dichotomous Ising model (DIS). Providing ZILI is the true data-generating model, the simulation studies show that the two-step algorithm can identify the graphical structure effectively and is robust to a range of parameter settings. For the infant gut microbiota dataset, the final estimated networks from GGM and ZILI turn out to have significant overlap in which the ZILI tends to select the sparser network than those from GGM. From the shared subnetwork, a hub taxon Lachnospiraceae is identified whose involvement in human disease development has been discovered recently in literature. Conclusions Constrains induced by relative abundance of microbiota such as zero inflation and unit sum render the conditional correlation analysis unreliable for conventional methods such as GGM. The proposed optimal categoricalization based ZILI model provides an alternative yet elegant way to deal with these difficulties. The results from ZILI have reasonable biological interpretation. This model can also be used to study the microbial interaction in other body parts.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Identification of microbial interaction network: zero-inflated latent Ising model based approach
    Jie Zhou
    Weston D. Viles
    Boran Lu
    Zhigang Li
    Juliette C. Madan
    Margaret R. Karagas
    Jiang Gui
    Anne G. Hoen
    [J]. BioData Mining, 13
  • [2] A Zero-Inflated Latent Dirichlet Allocation Model for Microbiome Studies
    Deek, Rebecca A.
    Li, Hongzhe
    [J]. FRONTIERS IN GENETICS, 2021, 11
  • [3] A New Zero-Inflated Negative Binomial Methodology for Latent Category Identification
    Simon J. Blanchard
    Wayne S. DeSarbo
    [J]. Psychometrika, 2013, 78 : 322 - 340
  • [4] A New Zero-Inflated Negative Binomial Methodology for Latent Category Identification
    Blanchard, Simon J.
    DeSarbo, Wayne S.
    [J]. PSYCHOMETRIKA, 2013, 78 (02) : 322 - 340
  • [5] The LZIP: A Bayesian Latent Factor Model for Correlated Zero-Inflated Counts
    Neelon, Brian
    Chung, Dongjun
    [J]. BIOMETRICS, 2017, 73 (01) : 185 - 196
  • [6] A novel approach for zero-inflated count regression model: Zero-inflated Poisson generalized-Lindley linear model with applications
    Altun, Emrah
    Alqifari, Hana
    Eliwa, Mohamed S.
    [J]. AIMS MATHEMATICS, 2023, 8 (10): : 23272 - 23290
  • [7] A Zero-Inflated Logistic Normal Multinomial Model for Extracting Microbial Compositions
    Zeng, Yanyan
    Pang, Daolin
    Zhao, Hongyu
    Wang, Tao
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2356 - 2369
  • [8] A Zero-Inflated Model for Insurance Data
    Choi, Jong-Hoo
    Ko, In-Mi
    Cheon, Sooyoung
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (03) : 485 - 494
  • [9] THE DEVELOPMENT OF A ZERO-INFLATED RASCH MODEL
    Kim, Sungyeun
    Lee, Guemin
    [J]. JOURNAL OF THE KOREAN SOCIETY OF MATHEMATICAL EDUCATION SERIES B-PURE AND APPLIED MATHEMATICS, 2013, 20 (01): : 59 - 70
  • [10] Outlier identification and robust parameter estimation in a zero-inflated Poisson model
    Yang, Jun
    Xie, Min
    Goh, Thong Ngee
    [J]. JOURNAL OF APPLIED STATISTICS, 2011, 38 (02) : 421 - 430