Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data

被引:16
|
作者
Pereira-Flores, Emiliano [1 ,2 ]
Gloeckner, Frank Oliver [1 ,3 ]
Fernandez-Guerra, Antonio [1 ,2 ,4 ]
机构
[1] Max Planck Inst Marine Microbiol, Microbial Genom & Bioinformat Res Grp, Celsiusstr 1, D-28359 Bremen, Germany
[2] Jacobs Univ Bremen gGmbH, Dept Life Sci & Chem, Campus Ring 1, D-28759 Bremen, Germany
[3] Alfred Wegener Inst, Helmholtz Ctr Polar & Marine Res, Handelshafen 12, D-27570 Bremerhaven, Germany
[4] Univ Oxford, Oxford E Res Ctr, Oxford OX1 3QG, England
关键词
Microbial ecology; Metagenomics; Functional traits; Average genome size; 16S rRNA gene average copy number; MICROBES; BACTERIA; ECOLOGY; TOOLS; SEA;
D O I
10.1186/s12859-019-3031-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Metagenomics caused a quantum leap in microbial ecology. However, the inherent size and complexity of metagenomic data limit its interpretation. The quantification of metagenomic traits in metagenomic analysis workflows has the potential to improve the exploitation of metagenomic data. Metagenomic traits are organisms' characteristics linked to their performance. They are measured at the genomic level taking a random sample of individuals in a community. As such, these traits provide valuable information to uncover microorganisms' ecological patterns. The Average Genome Size (AGS) and the 16S rRNA gene Average Copy Number (ACN) are two highly informative metagenomic traits that reflect microorganisms' ecological strategies as well as the environmental conditions they inhabit. Results: Here, we present the ags.sh and acn.sh tools, which analytically derive the AGS and ACN metagenomic traits. These tools represent an advance on previous approaches to compute the AGS and ACN traits. Benchmarking shows that ags.sh is up to 11 times faster than state-of-the-art tools dedicated to the estimation AGS. Both ags.sh and acn.sh show comparable or higher accuracy than existing tools used to estimate these traits. To exemplify the applicability of both tools, we analyzed the 139 prokaryotic metagenomes of TARA Oceans and revealed the ecological strategies associated with different water layers. Conclusion: We took advantage of recent advances in gene annotation to develop the ags.sh and acn.sh tools to combine easy tool usage with fast and accurate performance. Our tools compute the AGS and ACN metagenomic traits on unassembled metagenomes and allow researchers to improve their metagenomic data analysis to gain deeper insights into microorganisms' ecology. The ags.sh and acn.sh tools are publicly available using Docker container technology at https://github.com/pereiramemo/AGS-and-ACN-tools.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data
    Emiliano Pereira-Flores
    Frank Oliver Glöckner
    Antonio Fernandez-Guerra
    BMC Bioinformatics, 20
  • [2] Copy number of the 16S rRNA gene in Coxiella burnetii
    Afseth, G
    Mallavia, LP
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 1997, 13 (06) : 729 - 731
  • [3] Copy number of the 16S rRNA gene in Coxiella burnetii
    Guy Afseth
    Louis P. Mallavia
    European Journal of Epidemiology, 1997, 13 : 729 - 731
  • [4] Deep learning for predicting 16S rRNA gene copy number
    Miao, Jiazheng
    Chen, Tianlai
    Misir, Mustafa
    Lin, Yajuan
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [5] 16S Classifier: A Tool for Fast and Accurate Taxonomic Classification of 16S rRNA Hypervariable Regions in Metagenomic Datasets
    Chaudhary, Nikhil
    Sharma, Ashok K.
    Agarwal, Piyush
    Gupta, Ankit
    Sharma, Vineet K.
    PLOS ONE, 2015, 10 (02):
  • [6] Reconstructing 16S rRNA genes in metagenomic data
    Yuan, Cheng
    Lei, Jikai
    Cole, James
    Sun, Yanni
    BIOINFORMATICS, 2015, 31 (12) : 35 - 43
  • [7] Genome size of Eperythrozoon suis and hybridization with 16S rRNA gene
    Messick, JB
    Smith, G
    Berent, L
    Cooper, S
    CANADIAN JOURNAL OF MICROBIOLOGY, 2000, 46 (11) : 1082 - 1086
  • [8] Sequence and copy number of the Xanthomonas campestris pv campestris gene encoding 16S rRNA
    Lin, NT
    Tseng, YH
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 1997, 235 (02) : 276 - 280
  • [9] Dependence of genome size and copy number of rRNA gene on cell volume in dinoflagellates
    Liu, Yuyang
    Hu, Zhangxi
    Deng, Yunyan
    Shang, Lixia
    Gobler, Christopher J.
    Tang, Ying Zhong
    HARMFUL ALGAE, 2021, 109
  • [10] 16S rRNA Gene Copy Number Normalization Does Not Provide More Reliable Conclusions in Metataxonomic Surveys
    Starke, Robert
    Pylro, Victor Satler
    Morais, Daniel Kumazawa
    MICROBIAL ECOLOGY, 2021, 81 (02) : 535 - 539