metaBIT, an integrative and automated metagenomic pipeline for analysing microbial profiles from high-throughput sequencing shotgun data

被引:28
|
作者
Louvel, Guillaume [1 ]
Sarkissian, Clio Der [1 ]
Hanghoj, Kristian [1 ]
Orlando, Ludovic [1 ,2 ]
机构
[1] Univ Copenhagen, Nat Hist Museum Denmark, Ctr Geogenet, Voldgade 5-7, DK-1350 Copenhagen, Denmark
[2] Univ Toulouse, UPS, Lab AMIS, CNRS,UMR 5288, 37 Allees Jules Guesde, F-31000 Toulouse, France
基金
新加坡国家研究基金会;
关键词
ancient DNA; metagenomics; microbial profiling; microbiome; shotgun sequencing; GENOME SEQUENCE; GUT MICROBIOTA; ANCIENT; DNA; COMMUNITIES; DIVERSITY; SAMPLES; TIME;
D O I
10.1111/1755-0998.12546
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Micro-organisms account for most of the Earth's biodiversity and yet remain largely unknown. The complexity and diversity of microbial communities present in clinical and environmental samples can now be robustly investigated in record times and prices thanks to recent advances in high-throughput DNA sequencing (HTS). Here, we develop metaBIT, an open-source computational pipeline automatizing routine microbial profiling of shotgun HTS data. Customizable by the user at different stringency levels, it performs robust taxonomy-based assignment and relative abundance calculation of microbial taxa, as well as cross-sample statistical analyses of microbial diversity distributions. We demonstrate the versatility of metaBIT within a range of published HTS data sets sampled from the environment (soil and seawater) and the human body (skin and gut), but also from archaeological specimens. We present the diversity of outputs provided by the pipeline for the visualization of microbial profiles (barplots, heatmaps) and for their characterization and comparison (diversity indices, hierarchical clustering and principal coordinates analyses). We show that metaBIT allows an automatic, fast and user-friendly profiling of the microbial DNA present in HTS shotgun data sets. The applications of metaBIT are vast, from monitoring of laboratory errors and contaminations, to the reconstruction of past and present microbiota, and the detection of candidate species, including pathogens.
引用
收藏
页码:1415 / 1427
页数:13
相关论文
共 50 条
  • [1] MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data
    Ségolène Caboche
    Gaël Even
    Alexandre Loywick
    Christophe Audebert
    David Hot
    Genome Biology, 18
  • [2] MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data
    Caboche, Segolene
    Even, Gael
    Loywick, Alexandre
    Audebert, Christophe
    Hot, David
    GENOME BIOLOGY, 2017, 18
  • [3] CancerDiscover: an integrative pipeline for cancer biomarker and cancer class prediction from high-throughput sequencing data
    Mohammed, Akram
    Biegert, Greyson
    Adamec, Jiri
    Helikar, Tomas
    ONCOTARGET, 2018, 9 (02): : 2565 - 2573
  • [4] High-Throughput Sequencing Analysis of Microbial Profiles in the Dry Socket
    Shen, Li-Hang
    Xiao, E.
    Wang, En-Bo
    Zheng, Hui
    Zhang, Yi
    JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 2019, 77 (08) : 1548 - 1556
  • [5] HaTSPiL: A modular pipeline for high-throughput sequencing data analysis
    Morandi, Edoardo
    Cereda, Matteo
    Incarnato, Danny
    Parlato, Caterina
    Basile, Giulia
    Anselmi, Francesca
    Lauria, Andrea
    Simon, Lisa Marie
    Polignano, Isabelle Laurence
    Arruga, Francesca
    Deaglio, Silvia
    Tirtei, Elisa
    Fagioli, Franca
    Oliviero, Salvatore
    PLOS ONE, 2019, 14 (10):
  • [6] Analysing high-throughput sequencing data in Python']Python with HTSeq 2.0
    Putri, Givanna H.
    Anders, Simon
    Pyl, Paul Theodor
    Pimanda, John E.
    Zanini, Fabio
    BIOINFORMATICS, 2022, 38 (10) : 2943 - 2945
  • [7] The HTS barcode checker pipeline, a tool for automated detection of illegally traded species from high-throughput sequencing data
    Lammers, Youri
    Peelen, Tamara
    Vos, Rutger A.
    Gravendeel, Barbara
    BMC BIOINFORMATICS, 2014, 15
  • [8] TheViral MetaGenome Annotation Pipeline (VMGAP): An automated tool for the functional annotation of viral Metagenomic shotgun sequencing data
    Lorenzi, Hernan A.
    Hoover, Jeff
    Inman, Jason
    Safford, Todd
    Murphy, Sean
    Kagan, Leonid
    Williamson, Shannon J.
    STANDARDS IN GENOMIC SCIENCES, 2011, 4 (03): : 418 - 429
  • [9] The Viral MetaGenome Annotation Pipeline (VMGAP): An automated tool for the functional annotation of viral Metagenomic shotgun sequencing data
    Hernan A. Lorenzi
    Jeff Hoover
    Jason Inman
    Todd Safford
    Sean Murphy
    Leonid Kagan
    Shannon J. Williamson
    Standards in Genomic Sciences, 2011, 4 : 418 - 429
  • [10] ISRNA: an integrative online toolkit for short reads from high-throughput sequencing data
    Luo, Guan-Zheng
    Yang, Wei
    Ma, Ying-Ke
    Wang, Xiu-Jie
    BIOINFORMATICS, 2014, 30 (03) : 434 - 436