A zero-inflated beta-binomial model for microbiome data analysis

被引:23
|
作者
Hu, Tao [1 ]
Gallins, Paul [1 ]
Zhou, Yi-Hui [1 ,2 ]
机构
[1] North Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[2] North Carolina State Univ, Dept Biol Sci, Raleigh, NC 27695 USA
来源
STAT | 2018年 / 7卷 / 01期
关键词
count data; penalized generalized linear model; zero-inflated beta-binomial modelling; GUT MICROBIOTA; ASSOCIATION; UNIFRAC; HEALTH;
D O I
10.1002/sta4.185
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The Microbiome is increasingly recognized as an important aspect of the health of host species, involved in many biological pathways and processes and potentially useful as health biomarkers. Taking advantage of high-throughput sequencing technologies, modern bacterial microbiome studies are metagenomic, interrogating thousands of taxa simultaneously. Several data analysis frameworks have been proposed for microbiome sequence read count data and for determining the most significant features. However, there is still room for improvement. We introduce a zero-inflated beta-binomial to model the distribution of microbiome count data and to determine association with a continuous or categorical phenotype of interest. The approach can exploit the mean-variance relationship to improve power and adjust for covariates. The proposed method is a mixture model with two components: (i) a zero model accounting for excess zeros and (ii) a count model to capture the remaining component by beta-binomial regression, allowing for overdispersion effects. Simulation studies show that our proposed method effectively controls type I error and has higher power than competing methods to detect taxa associated with phenotype. An R package ZIBBSeqDiscovery is available on R CRAN. Copyright (c) 2018 John Wiley & Sons, Ltd.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Estimation for zero-inflated beta-binomial regression model with missing response data
    Luo, Rong
    Paul, Sudhir
    [J]. STATISTICS IN MEDICINE, 2018, 37 (26) : 3789 - 3813
  • [2] A Bayesian zero-inflated beta-binomial model for longitudinal data with group-specific changepoints
    Wen, Chun-Che
    Baker, Nathaniel
    Paul, Rajib
    Hill, Elizabeth
    Hunt, Kelly
    Li, Hong
    Gray, Kevin
    Neelon, Brian
    [J]. STATISTICS IN MEDICINE, 2024, 43 (01) : 125 - 140
  • [3] A Bayesian zero-inflated negative binomial regression model for the integrative analysis of microbiome data
    Jiang, Shuang
    Xiao, Guanghua
    Koh, Andrew Y.
    Kim, Jiwoong
    Li, Qiwei
    Zhan, Xiaowei
    [J]. BIOSTATISTICS, 2021, 22 (03) : 522 - 540
  • [4] A multivariate zero-inflated binomial model for the analysis of correlated proportional data
    Deng, Dianliang
    Sun, Yiguang
    Tian, Guo-Liang
    [J]. JOURNAL OF APPLIED STATISTICS, 2022, 49 (11) : 2740 - 2766
  • [5] NBZIMM: negative binomial and zero-inflated mixed models, with application to microbiome/metagenomics data analysis
    Xinyan Zhang
    Nengjun Yi
    [J]. BMC Bioinformatics, 21
  • [6] NBZIMM: negative binomial and zero-inflated mixed models, with application to microbiome/metagenomics data analysis
    Zhang, Xinyan
    Yi, Nengjun
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [7] Parameter estimations of zero-inflated negative binomial model with incomplete data
    Pho, Kim-Hung
    Lukusa, T. Martin
    [J]. APPLIED MATHEMATICAL MODELLING, 2024, 129 : 207 - 231
  • [8] Evaluation of negative binomial and zero-inflated negative binomial models for the analysis of zero-inflated count data: application to the telemedicine for children with medical complexity trial
    Kyung Hyun Lee
    Claudia Pedroza
    Elenir B. C. Avritscher
    Ricardo A. Mosquera
    Jon E. Tyson
    [J]. Trials, 24
  • [9] Evaluation of negative binomial and zero-inflated negative binomial models for the analysis of zero-inflated count data: application to the telemedicine for children with medical complexity trial
    Lee, Kyung Hyun
    Pedroza, Claudia
    Avritscher, Elenir B. C.
    Mosquera, Ricardo A.
    Tyson, Jon E.
    [J]. TRIALS, 2023, 24 (01)
  • [10] A constrained marginal zero-inflated binomial regression model
    Ali, Essoham
    Diop, Aliou
    Dupuy, Jean-Francois
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (18) : 6396 - 6422