Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data

被引:16
|
作者
Pereira-Flores, Emiliano [1 ,2 ]
Gloeckner, Frank Oliver [1 ,3 ]
Fernandez-Guerra, Antonio [1 ,2 ,4 ]
机构
[1] Max Planck Inst Marine Microbiol, Microbial Genom & Bioinformat Res Grp, Celsiusstr 1, D-28359 Bremen, Germany
[2] Jacobs Univ Bremen gGmbH, Dept Life Sci & Chem, Campus Ring 1, D-28759 Bremen, Germany
[3] Alfred Wegener Inst, Helmholtz Ctr Polar & Marine Res, Handelshafen 12, D-27570 Bremerhaven, Germany
[4] Univ Oxford, Oxford E Res Ctr, Oxford OX1 3QG, England
关键词
Microbial ecology; Metagenomics; Functional traits; Average genome size; 16S rRNA gene average copy number; MICROBES; BACTERIA; ECOLOGY; TOOLS; SEA;
D O I
10.1186/s12859-019-3031-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Metagenomics caused a quantum leap in microbial ecology. However, the inherent size and complexity of metagenomic data limit its interpretation. The quantification of metagenomic traits in metagenomic analysis workflows has the potential to improve the exploitation of metagenomic data. Metagenomic traits are organisms' characteristics linked to their performance. They are measured at the genomic level taking a random sample of individuals in a community. As such, these traits provide valuable information to uncover microorganisms' ecological patterns. The Average Genome Size (AGS) and the 16S rRNA gene Average Copy Number (ACN) are two highly informative metagenomic traits that reflect microorganisms' ecological strategies as well as the environmental conditions they inhabit. Results: Here, we present the ags.sh and acn.sh tools, which analytically derive the AGS and ACN metagenomic traits. These tools represent an advance on previous approaches to compute the AGS and ACN traits. Benchmarking shows that ags.sh is up to 11 times faster than state-of-the-art tools dedicated to the estimation AGS. Both ags.sh and acn.sh show comparable or higher accuracy than existing tools used to estimate these traits. To exemplify the applicability of both tools, we analyzed the 139 prokaryotic metagenomes of TARA Oceans and revealed the ecological strategies associated with different water layers. Conclusion: We took advantage of recent advances in gene annotation to develop the ags.sh and acn.sh tools to combine easy tool usage with fast and accurate performance. Our tools compute the AGS and ACN metagenomic traits on unassembled metagenomes and allow researchers to improve their metagenomic data analysis to gain deeper insights into microorganisms' ecology. The ags.sh and acn.sh tools are publicly available using Docker container technology at https://github.com/pereiramemo/AGS-and-ACN-tools.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Correcting for 16S rRNA gene copy numbers in microbiome surveys remains an unsolved problem
    Stilianos Louca
    Michael Doebeli
    Laura Wegener Parfrey
    Microbiome, 6
  • [22] Accounting for 16S rRNA copy number prediction uncertainty and its implications in bacterial diversity analyses
    Yingnan Gao
    Martin Wu
    ISME Communications, 3
  • [23] Correcting for 16S rRNA gene copy numbers in microbiome surveys remains an unsolved problem
    Louca, Stilianos
    Doebeli, Michael
    Parfrey, Laura Wegener
    MICROBIOME, 2018, 6
  • [24] Accounting for 16S rRNA copy number prediction uncertainty and its implications in bacterial diversity analyses
    Gao, Yingnan
    Wu, Martin
    ISME COMMUNICATIONS, 2023, 3 (01):
  • [25] EFFECT OF GENOME SIZE AND RRN GENE COPY NUMBER ON PCR AMPLIFICATION OF 16S RIBOSOMAL-RNA GENES FROM A MIXTURE OF BACTERIAL SPECIES
    FARRELLY, V
    RAINEY, FA
    STACKEBRANDT, E
    APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 1995, 61 (07) : 2798 - 2801
  • [26] An Overview of Adenoid Microbiome Using 16S rRNA Gene Sequencing-Based Metagenomic Analysis
    Sokolovs-Karijs, Olegs
    Briviba, Monta
    Saksis, Rihards
    Sumeraga, Gunta
    Girotto, Francesca
    Erts, Renars
    Osite, Jana
    Krumina, Angelika
    MEDICINA-LITHUANIA, 2022, 58 (07):
  • [27] A 16S rRNA Gene and Draft Genome Database for the Murine Oral Bacterial Community
    Joseph, Susan
    Aduse-Opoku, Joseph
    Hashim, Ahmed
    Hanski, Eveliina
    Streich, Ricarda
    Knowles, Sarah C. L.
    Pedersen, Amy B.
    Wade, William G.
    Curtis, Michael A.
    MSYSTEMS, 2021, 6 (01)
  • [28] Accurate and fast identification of Campylobacter fetus in bulls by real-time PCR targeting a 16S rRNA gene sequence
    Delpiazzo, Rafael
    Barcellos, Maila
    Barros, Sofia
    Betancor, Laura
    Fraga, Martin
    Gil, Jorge
    Iraola, Gregorio
    Morsella, Claudia
    Paolicchi, Fernando
    Perez, Ruben
    Riet-Correa, Franklin
    Sanguinetti, Margarita
    Silva, Alfonso
    da Silva Silveira, Caroline
    Calleros, Lucia
    VETERINARY AND ANIMAL SCIENCE, 2021, 11
  • [29] Tax4Fun: predicting functional profiles from metagenomic 16S rRNA data
    Asshauer, Kathrin P.
    Wemheuer, Bernd
    Daniel, Rolf
    Meinicke, Peter
    BIOINFORMATICS, 2015, 31 (17) : 2882 - 2884
  • [30] 16S rRNA metagenomic data of microbial diversity of Pheidole decarinata Santschi (Hymenoptera: Formicidae) workers
    Ashigar, Mohammed Ahmed
    Majid, Abdul Hafiz Ab
    DATA IN BRIEF, 2020, 31