A Statistical Perspective on the Challenges in Molecular Microbial Biology

被引:0
|
作者
Pratheepa Jeganathan
Susan P. Holmes
机构
[1] Stanford University,Department of Statistics
关键词
Microbial ecology; Bayesian data analysis; Hierarchical mixture models; Latent Dirichlet allocation; Bayesian nonparametric ordination; Sequencing data; Quality control;
D O I
暂无
中图分类号
学科分类号
摘要
High throughput sequencing (HTS)-based technology enables identifying and quantifying non-culturable microbial organisms in all environments. Microbial sequences have enhanced our understanding of the human microbiome, the soil and plant environment, and the marine environment. All molecular microbial data pose statistical challenges due to contamination sequences from reagents, batch effects, unequal sampling, and undetected taxa. Technical biases and heteroscedasticity have the strongest effects, but different strains across subjects and environments also make direct differential abundance testing unwieldy. We provide an introduction to a few statistical tools that can overcome some of these difficulties and demonstrate those tools on an example. We show how standard statistical methods, such as simple hierarchical mixture and topic models, can facilitate inferences on latent microbial communities. We also review some nonparametric Bayesian approaches that combine visualization and uncertainty quantification. The intersection of molecular microbial biology and statistics is an exciting new venue. Finally, we list some of the important open problems that would benefit from more careful statistical method development.
引用
收藏
页码:131 / 160
页数:29
相关论文
共 50 条