metaBIT, an integrative and automated metagenomic pipeline for analysing microbial profiles from high-throughput sequencing shotgun data

被引:28
|
作者
Louvel, Guillaume [1 ]
Sarkissian, Clio Der [1 ]
Hanghoj, Kristian [1 ]
Orlando, Ludovic [1 ,2 ]
机构
[1] Univ Copenhagen, Nat Hist Museum Denmark, Ctr Geogenet, Voldgade 5-7, DK-1350 Copenhagen, Denmark
[2] Univ Toulouse, UPS, Lab AMIS, CNRS,UMR 5288, 37 Allees Jules Guesde, F-31000 Toulouse, France
基金
新加坡国家研究基金会;
关键词
ancient DNA; metagenomics; microbial profiling; microbiome; shotgun sequencing; GENOME SEQUENCE; GUT MICROBIOTA; ANCIENT; DNA; COMMUNITIES; DIVERSITY; SAMPLES; TIME;
D O I
10.1111/1755-0998.12546
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Micro-organisms account for most of the Earth's biodiversity and yet remain largely unknown. The complexity and diversity of microbial communities present in clinical and environmental samples can now be robustly investigated in record times and prices thanks to recent advances in high-throughput DNA sequencing (HTS). Here, we develop metaBIT, an open-source computational pipeline automatizing routine microbial profiling of shotgun HTS data. Customizable by the user at different stringency levels, it performs robust taxonomy-based assignment and relative abundance calculation of microbial taxa, as well as cross-sample statistical analyses of microbial diversity distributions. We demonstrate the versatility of metaBIT within a range of published HTS data sets sampled from the environment (soil and seawater) and the human body (skin and gut), but also from archaeological specimens. We present the diversity of outputs provided by the pipeline for the visualization of microbial profiles (barplots, heatmaps) and for their characterization and comparison (diversity indices, hierarchical clustering and principal coordinates analyses). We show that metaBIT allows an automatic, fast and user-friendly profiling of the microbial DNA present in HTS shotgun data sets. The applications of metaBIT are vast, from monitoring of laboratory errors and contaminations, to the reconstruction of past and present microbiota, and the detection of candidate species, including pathogens.
引用
收藏
页码:1415 / 1427
页数:13
相关论文
共 50 条
  • [21] Microbial community analysis of mildewed cigar tobacco leaves from high-throughput sequencing data
    Wei, Min
    Shi, Youzhi
    Song, Xuyan
    Rong, Lin
    Li, Ziwei
    Li, Jing
    Wang, Bo
    Chen, Bifeng
    ANNALS OF MICROBIOLOGY, 2024, 74 (01)
  • [22] metaBEETL: high-throughput analysis of heterogeneous microbial populations from shotgun DNA sequences
    Ander, Christina
    Schulz-Trieglaff, Ole B.
    Stoye, Jens
    Cox, Anthony J.
    BMC BIOINFORMATICS, 2013, 14
  • [23] PhylOTU: A High-Throughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data
    Sharpton, Thomas J.
    Riesenfeld, Samantha J.
    Kembel, Steven W.
    Ladau, Joshua
    O'Dwyer, James P.
    Green, Jessica L.
    Eisen, Jonathan A.
    Pollard, Katherine S.
    PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (01)
  • [24] Evaluation of a Pooled Strategy for High-Throughput Sequencing of Cosmid Clones from Metagenomic Libraries
    Lam, Kathy N.
    Hall, Michael W.
    Engel, Katja
    Vey, Gregory
    Cheng, Jiujun
    Neufeld, Josh D.
    Charles, Trevor C.
    PLOS ONE, 2014, 9 (06):
  • [25] xIP-seq platform: an integrative framework for high-throughput sequencing data analysis
    Wang, Xin
    Teng, Mingmang
    Wang, Guohua
    Zhao, Yuming
    Han, Xu
    Feng, Weixing
    Li, Lang
    Sanford, Jeremy
    Liu, Yunlong
    2009 OHIO COLLABORATIVE CONFERENCE ON BIOINFORMATICS, PROCEEDINGS, 2009, : 26 - +
  • [26] Detecting Alu insertions from high-throughput sequencing data
    David, Matei
    Mustafa, Harun
    Brudno, Michael
    NUCLEIC ACIDS RESEARCH, 2013, 41 (17)
  • [27] EasyDIVER: A Pipeline for Assembling and Counting High-Throughput Sequencing Data from In Vitro Evolution of Nucleic Acids or Peptides
    Blanco, Celia
    Verbanic, Samuel
    Seelig, Burckhard
    Chen, Irene A.
    JOURNAL OF MOLECULAR EVOLUTION, 2020, 88 (06) : 477 - 481
  • [28] EasyDIVER: A Pipeline for Assembling and Counting High-Throughput Sequencing Data from In Vitro Evolution of Nucleic Acids or Peptides
    Celia Blanco
    Samuel Verbanic
    Burckhard Seelig
    Irene A. Chen
    Journal of Molecular Evolution, 2020, 88 : 477 - 481
  • [29] A Pipeline for the Error-Free Identification of Somatic Alu Insertions in High-Throughput Sequencing Data
    Nugmanov, G. A.
    Komkov, A. Y.
    Saliutina, M. V.
    Minervina, A. A.
    Lebedev, Y. B.
    Mamedov, I. Z.
    MOLECULAR BIOLOGY, 2019, 53 (01) : 138 - 146
  • [30] A Pipeline for the Error-Free Identification of Somatic Alu Insertions in High-Throughput Sequencing Data
    G. A. Nugmanov
    A. Y. Komkov
    M. V. Saliutina
    A. A. Minervina
    Y. B. Lebedev
    I. Z. Mamedov
    Molecular Biology, 2019, 53 : 138 - 146