Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis
被引:47
|
作者:
Tang, Zheng-Zheng
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53715 USA
Wisconsin Inst Discovery, Madison, WI 53715 USAUniv Wisconsin, Dept Biostat & Med Informat, Madison, WI 53715 USA
Tang, Zheng-Zheng
[1
,2
]
Chen, Guanhua
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53715 USAUniv Wisconsin, Dept Biostat & Med Informat, Madison, WI 53715 USA
Chen, Guanhua
[1
]
机构:
[1] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53715 USA
[2] Wisconsin Inst Discovery, Madison, WI 53715 USA
There is heightened interest in using high-throughput sequencing technologies to quantify abundances of microbial taxa and linking the abundance to human diseases and traits. Proper modeling of multivariate taxon counts is essential to the power of detecting this association. Existing models are limited in handling excessive zero observations in taxon counts and in flexibly accommodating complex correlation structures and dispersion patterns among taxa. In this article, we develop a new probability distribution, zero-inflated generalized Dirichlet multinomial (ZIGDM), that overcomes these limitations in modeling multivariate taxon counts. Based on this distribution, we propose a ZIGDM regression model to link microbial abundances to covariates (e.g. disease status) and develop a fast expectation-maximization algorithm to efficiently estimate parameters in the model. The derived tests enable us to reveal rich patterns of variation in microbial compositions including differential mean and dispersion. The advantages of the proposed methods are demonstrated through simulation studies and an analysis of a gut microbiome dataset.
机构:
Changwon Natl Univ, Dept Stat, Chang Won 51140, South Korea
Seoul Natl Univ, Dept Stat, Seoul 08826, South KoreaChangwon Natl Univ, Dept Stat, Chang Won 51140, South Korea
Kim, Kipoong
Park, Jaesung
论文数: 0引用数: 0
h-index: 0
机构:
Seoul Natl Univ, Dept Stat, Seoul 08826, South KoreaChangwon Natl Univ, Dept Stat, Chang Won 51140, South Korea
Park, Jaesung
Jung, Sungkyu
论文数: 0引用数: 0
h-index: 0
机构:
Seoul Natl Univ, Dept Stat, Seoul 08826, South Korea
Seoul Natl Univ, Inst Data Innovat Sci, 1 Gwanak Ro, Seoul 08826, South KoreaChangwon Natl Univ, Dept Stat, Chang Won 51140, South Korea