Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data

被引:16
|
作者
Pereira-Flores, Emiliano [1 ,2 ]
Gloeckner, Frank Oliver [1 ,3 ]
Fernandez-Guerra, Antonio [1 ,2 ,4 ]
机构
[1] Max Planck Inst Marine Microbiol, Microbial Genom & Bioinformat Res Grp, Celsiusstr 1, D-28359 Bremen, Germany
[2] Jacobs Univ Bremen gGmbH, Dept Life Sci & Chem, Campus Ring 1, D-28759 Bremen, Germany
[3] Alfred Wegener Inst, Helmholtz Ctr Polar & Marine Res, Handelshafen 12, D-27570 Bremerhaven, Germany
[4] Univ Oxford, Oxford E Res Ctr, Oxford OX1 3QG, England
关键词
Microbial ecology; Metagenomics; Functional traits; Average genome size; 16S rRNA gene average copy number; MICROBES; BACTERIA; ECOLOGY; TOOLS; SEA;
D O I
10.1186/s12859-019-3031-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Metagenomics caused a quantum leap in microbial ecology. However, the inherent size and complexity of metagenomic data limit its interpretation. The quantification of metagenomic traits in metagenomic analysis workflows has the potential to improve the exploitation of metagenomic data. Metagenomic traits are organisms' characteristics linked to their performance. They are measured at the genomic level taking a random sample of individuals in a community. As such, these traits provide valuable information to uncover microorganisms' ecological patterns. The Average Genome Size (AGS) and the 16S rRNA gene Average Copy Number (ACN) are two highly informative metagenomic traits that reflect microorganisms' ecological strategies as well as the environmental conditions they inhabit. Results: Here, we present the ags.sh and acn.sh tools, which analytically derive the AGS and ACN metagenomic traits. These tools represent an advance on previous approaches to compute the AGS and ACN traits. Benchmarking shows that ags.sh is up to 11 times faster than state-of-the-art tools dedicated to the estimation AGS. Both ags.sh and acn.sh show comparable or higher accuracy than existing tools used to estimate these traits. To exemplify the applicability of both tools, we analyzed the 139 prokaryotic metagenomes of TARA Oceans and revealed the ecological strategies associated with different water layers. Conclusion: We took advantage of recent advances in gene annotation to develop the ags.sh and acn.sh tools to combine easy tool usage with fast and accurate performance. Our tools compute the AGS and ACN metagenomic traits on unassembled metagenomes and allow researchers to improve their metagenomic data analysis to gain deeper insights into microorganisms' ecology. The ags.sh and acn.sh tools are publicly available using Docker container technology at https://github.com/pereiramemo/AGS-and-ACN-tools.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Comprehensive 16S rRNA and metagenomic data from the gut microbiome of aging and rejuvenation mouse models
    Shin, Jongoh
    Noh, Jung-Ran
    Choe, Donghui
    Lee, Namil
    Song, Yoseb
    Cho, Suhyung
    Kang, Eun-Jung
    Go, Min-Jeong
    Ha, Seok Kyun
    Kim, Jae-Hoon
    Kim, Yong-Hoon
    Kim, Kyoung-Shim
    Kim, Byoung-Chan
    Lee, Chul-Ho
    Cho, Byung-Kwan
    SCIENTIFIC DATA, 2022, 9 (01)
  • [32] Comprehensive 16S rRNA and metagenomic data from the gut microbiome of aging and rejuvenation mouse models
    Jongoh Shin
    Jung-Ran Noh
    Donghui Choe
    Namil Lee
    Yoseb Song
    Suhyung Cho
    Eun-Jung Kang
    Min-Jeong Go
    Seok Kyun Ha
    Jae-Hoon Kim
    Yong-Hoon Kim
    Kyoung-Shim Kim
    Byoung-Chan Kim
    Chul-Ho Lee
    Byung-Kwan Cho
    Scientific Data, 9
  • [33] Looking for Rhizobacterial Ecological Indicators in Agricultural Soils Using 16S rRNA metagenomic Amplicon Data
    Valverde, Jose R.
    Gullon, Sonia
    Perez Mellado, Rafael
    PLOS ONE, 2016, 11 (10):
  • [34] Turkey fecal microbial community structure and functional gene diversity revealed by 16S rRNA gene and metagenomic sequences
    Jingrang Lu
    Jorge Santo Domingo
    The Journal of Microbiology, 2008, 46 : 469 - 477
  • [35] Isolation and identification of Ktedonobacteria using 16S rRNA gene sequences data
    Rachmania, M. K.
    Ningsih, F.
    Sakai, Y.
    Yabe, S.
    Yokota, A.
    Sjamsuridzal, W.
    INTERNATIONAL SYMPOSIUM OF INNOVATIVE BIO-PRODUCTION INDONESIA ON BIOTECHNOLOGY AND BIOENGINEERING 2019, 2020, 439
  • [36] Turkey Fecal Microbial Community Structure and Functional Gene Diversity Revealed by 16S rRNA Gene and Metagenomic Sequences
    Lu, Jingrang
    Domingo, Jorge Santo
    JOURNAL OF MICROBIOLOGY, 2008, 46 (05) : 469 - 477
  • [37] COPY NUMBER OF THE 16S RIBOSOMAL-RNA GENE IN RICKETTSIA-PROWAZEKII
    PANG, HL
    WINKLER, HH
    JOURNAL OF BACTERIOLOGY, 1993, 175 (12) : 3893 - 3896
  • [38] Yersinia spp. Identification Using Copy Diversity in the Chromosomal 16S rRNA Gene Sequence
    Hao, Huijing
    Liang, Junrong
    Duan, Ran
    Chen, Yuhuang
    Liu, Chang
    Xiao, Yuchun
    Li, Xu
    Su, Mingming
    Jing, Huaiqi
    Wang, Xin
    PLOS ONE, 2016, 11 (01):
  • [39] Identification of bacteria associated with underground parts of Crocus sativus by 16S rRNA gene targeted metagenomic approach
    Sheetal Ambardar
    Naseer Sangwan
    A. Manjula
    J. Rajendhran
    P. Gunasekaran
    Rup Lal
    Jyoti Vakhlu
    World Journal of Microbiology and Biotechnology, 2014, 30 : 2701 - 2709
  • [40] Multicenter assessment of microbial community profiling using 16S rRNA gene sequencing and shotgun metagenomic sequencing
    Han, Dongsheng
    Gao, Peng
    Li, Rui
    Tan, Ping
    Xie, Jiehong
    Zhang, Rui
    Li, Jinming
    JOURNAL OF ADVANCED RESEARCH, 2020, 26 : 111 - 121