Genometa - A Fast and Accurate Classifier for Short Metagenomic Shotgun Reads

被引:26
|
作者
Davenport, Colin F. [1 ]
Neugebauer, Jens [1 ]
Beckmann, Nils [2 ]
Friedrich, Benedikt [2 ]
Kameri, Burim [2 ]
Kokott, Svea [1 ]
Paetow, Malte [2 ]
Siekmann, Bjoern [2 ]
Wieding-Drewes, Matthias [2 ]
Wienhoefer, Markus [2 ]
Wolf, Stefan [2 ]
Tuemmler, Burkhard [1 ]
Ahlers, Volker [2 ]
Sprengel, Frauke [2 ]
机构
[1] Hannover Med Sch, D-3000 Hannover, Lower Saxony, Germany
[2] Univ Appl Sci & Arts, Dept Comp Sci, Hannover, Lower Saxony, Germany
来源
PLOS ONE | 2012年 / 7卷 / 08期
基金
美国国家卫生研究院;
关键词
RIBOSOMAL-RNA; SEQUENCES; MICROBIOME; COMMUNITIES; ALIGNMENT; BACTERIA; TAXONOMY; ARCHAEA; SERVER; GENES;
D O I
10.1371/journal.pone.0041224
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A: Metagenomic studies use high-throughput sequence data to investigate microbial communities in situ. However, considerable challenges remain in the analysis of these data, particularly with regard to speed and reliable analysis of microbial species as opposed to higher level taxa such as phyla. We here present Genometa, a computationally undemanding graphical user interface program that enables identification of bacterial species and gene content from datasets generated by inexpensive high-throughput short read sequencing technologies. Our approach was first verified on two simulated metagenomic short read datasets, detecting 100% and 94% of the bacterial species included with few false positives or false negatives. Subsequent comparative benchmarking analysis against three popular metagenomic algorithms on an Illumina human gut dataset revealed Genometa to attribute the most reads to bacteria at species level (i.e. including all strains of that species) and demonstrate similar or better accuracy than the other programs. Lastly, speed was demonstrated to be many times that of BLAST due to the use of modern short read aligners. Our method is highly accurate if bacteria in the sample are represented by genomes in the reference sequence but cannot find species absent from the reference. This method is one of the most user-friendly and resource efficient approaches and is thus feasible for rapidly analysing millions of short reads on a personal computer.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Accurate Genome Relative Abundance Estimation Based on Shotgun Metagenomic Reads
    Xia, Li C.
    Cram, Jacob A.
    Chen, Ting
    Fuhrman, Jed A.
    Sun, Fengzhu
    PLOS ONE, 2011, 6 (12):
  • [2] Fast and Sensitive Classification of Short Metagenomic Reads with SKraken
    Qian, Jia
    Marchiori, Davide
    Comin, Matteo
    BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOSTEC 2017), 2018, 881 : 212 - 226
  • [3] Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
    Liu, Bo
    Gibbons, Theodore
    Ghodsi, Mohammad
    Treangen, Todd
    Pop, Mihai
    BMC GENOMICS, 2011, 12
  • [4] Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
    Liu, Bo
    Gibbons, Theodore
    Ghodsi, Mohammad
    Treangen, Todd
    Pop, Mihai
    GENOME BIOLOGY, 2011, 12 : 10 - 11
  • [5] Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
    Bo Liu
    Theodore Gibbons
    Mohammad Ghodsi
    Todd Treangen
    Mihai Pop
    Genome Biology, 12
  • [6] Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
    Bo Liu
    Theodore Gibbons
    Mohammad Ghodsi
    Todd Treangen
    Mihai Pop
    Genome Biology, 12 (Suppl 1)
  • [7] Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
    Bo Liu
    Theodore Gibbons
    Mohammad Ghodsi
    Todd Treangen
    Mihai Pop
    BMC Genomics, 12
  • [8] rNA: a fast and accurate short reads numerical aligner
    Vezzi, Francesco
    Del Fabbro, Cristian
    Tomescu, Alexandru I.
    Policriti, Alberto
    BIOINFORMATICS, 2012, 28 (01) : 123 - 124
  • [9] Woods: A fast and accurate functional annotator and classifier of genomic and metagenomic sequences
    Sharma, Ashok K.
    Gupta, Ankit
    Kumar, Sanjiv
    Dhakan, Darshan B.
    Sharma, Vineet K.
    GENOMICS, 2015, 106 (01) : 1 - 6
  • [10] A de novo metagenomic assembly program for shotgun DNA reads
    Lai, Binbin
    Ding, Ruogu
    Li, Yang
    Duan, Liping
    Zhu, Huaiqiu
    BIOINFORMATICS, 2012, 28 (11) : 1455 - 1462