Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data

被引:16
|
作者
Pereira-Flores, Emiliano [1 ,2 ]
Gloeckner, Frank Oliver [1 ,3 ]
Fernandez-Guerra, Antonio [1 ,2 ,4 ]
机构
[1] Max Planck Inst Marine Microbiol, Microbial Genom & Bioinformat Res Grp, Celsiusstr 1, D-28359 Bremen, Germany
[2] Jacobs Univ Bremen gGmbH, Dept Life Sci & Chem, Campus Ring 1, D-28759 Bremen, Germany
[3] Alfred Wegener Inst, Helmholtz Ctr Polar & Marine Res, Handelshafen 12, D-27570 Bremerhaven, Germany
[4] Univ Oxford, Oxford E Res Ctr, Oxford OX1 3QG, England
关键词
Microbial ecology; Metagenomics; Functional traits; Average genome size; 16S rRNA gene average copy number; MICROBES; BACTERIA; ECOLOGY; TOOLS; SEA;
D O I
10.1186/s12859-019-3031-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Metagenomics caused a quantum leap in microbial ecology. However, the inherent size and complexity of metagenomic data limit its interpretation. The quantification of metagenomic traits in metagenomic analysis workflows has the potential to improve the exploitation of metagenomic data. Metagenomic traits are organisms' characteristics linked to their performance. They are measured at the genomic level taking a random sample of individuals in a community. As such, these traits provide valuable information to uncover microorganisms' ecological patterns. The Average Genome Size (AGS) and the 16S rRNA gene Average Copy Number (ACN) are two highly informative metagenomic traits that reflect microorganisms' ecological strategies as well as the environmental conditions they inhabit. Results: Here, we present the ags.sh and acn.sh tools, which analytically derive the AGS and ACN metagenomic traits. These tools represent an advance on previous approaches to compute the AGS and ACN traits. Benchmarking shows that ags.sh is up to 11 times faster than state-of-the-art tools dedicated to the estimation AGS. Both ags.sh and acn.sh show comparable or higher accuracy than existing tools used to estimate these traits. To exemplify the applicability of both tools, we analyzed the 139 prokaryotic metagenomes of TARA Oceans and revealed the ecological strategies associated with different water layers. Conclusion: We took advantage of recent advances in gene annotation to develop the ags.sh and acn.sh tools to combine easy tool usage with fast and accurate performance. Our tools compute the AGS and ACN metagenomic traits on unassembled metagenomes and allow researchers to improve their metagenomic data analysis to gain deeper insights into microorganisms' ecology. The ags.sh and acn.sh tools are publicly available using Docker container technology at https://github.com/pereiramemo/AGS-and-ACN-tools.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Metagenomic 16S rRNA gene sequencing survey of Borrelia species in Irish samples of Ixodes ricinus ticks
    Lambert, John S.
    Cook, Michael John
    Healy, John Eoin
    Murtagh, Ross
    Avramovic, Gordana
    Lee, Sin Hang
    PLOS ONE, 2019, 14 (04):
  • [42] Identification of bacteria associated with underground parts of Crocus sativus by 16S rRNA gene targeted metagenomic approach
    Ambardar, Sheetal
    Sangwan, Naseer
    Manjula, A.
    Rajendhran, J.
    Gunasekaran, P.
    Lal, Rup
    Vakhlu, Jyoti
    WORLD JOURNAL OF MICROBIOLOGY & BIOTECHNOLOGY, 2014, 30 (10): : 2701 - 2709
  • [43] Metagenomic analysis of the canine oral cavity as revealed by high-throughput pyrosequencing of the 16S rRNA gene
    Sturgeon, Amy
    Stull, Jason W.
    Costa, Marcio C.
    Weese, J. Scott
    VETERINARY MICROBIOLOGY, 2013, 162 (2-4) : 891 - 898
  • [44] Investigation of postpartum dairy cows' uterine microbial diversity using metagenomic pyrosequencing of the 16S rRNA gene
    Machado, V. S.
    Oikonomou, G.
    Bicalho, M. L. S.
    Knauer, W. A.
    Gilbert, R.
    Bicalho, R. C.
    VETERINARY MICROBIOLOGY, 2012, 159 (3-4) : 460 - 469
  • [45] Evaluation of gastric microbiome and metagenomic function in patients with intestinal metaplasia using 16S rRNA gene sequencing
    Park, Chan Hyuk
    Lee, A-reum
    Lee, Yu-ra
    Eun, Chang Soo
    Lee, Sang Kil
    Han, Dong Soo
    HELICOBACTER, 2019, 24 (01)
  • [46] A NOVEL METHOD FOR ASSESSMENT OF 16S RRNA GENE COPY NUMBER IN BACTERIAL GENOMES BY PULSED-FIELD GEL ELECTROPHORESIS AND PCR AMPLIFICATION
    Zeng, Yonghui
    Jiao, Nianzhi
    JOURNAL OF RAPID METHODS AND AUTOMATION IN MICROBIOLOGY, 2009, 17 (03): : 274 - 279
  • [47] Characterization and predictive functional profiles on metagenomic 16S rRNA data of liver transplant recipients: A longitudinal study
    Sivaraj, Saranya
    Copeland, Julia K.
    Malik, Anshu
    Pasini, Elisa
    Angeli, Marc
    Azhie, Amirhossein
    Husain, Shahid
    Kumar, Deepali
    Allard, Johane
    Guttman, David S.
    Humar, Atul
    Bhat, Mamatha
    CLINICAL TRANSPLANTATION, 2022, 36 (02)
  • [48] 16S rRNA Gene Amplicon Sequencing Data for Pteris vittata Rhizosphere Soils
    Mu'azu, Aminu Salisu
    Haris, Hazzeman
    Zarkasi, Kamarul Zaman
    Lau, Nyok-Sean
    Ghazali, Amir Hamzah
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2023, 12 (03):
  • [49] Molecular data from the 16S rRNA gene for the phylogeny of Veneridae (Mollusca: Bivalvia)
    Canapa, A
    Schiaparelli, S
    Marota, I
    Barucca, M
    MARINE BIOLOGY, 2003, 142 (06) : 1125 - 1130
  • [50] SCRAPT: an iterative algorithm for clustering large 16S rRNA gene data sets
    Luan, Tu
    Muralidharan, Harihara Subrahmaniam
    Alshehri, Marwan
    Mittra, Ipsa
    Pop, Mihai
    NUCLEIC ACIDS RESEARCH, 2023, 51 (08) : e46