Short Read Lengths Recover Ecological Patterns in 16S rRNA Gene Amplicon Data

被引:0
|
作者
Jurburg, Stephanie D. [1 ,2 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Environm Microbiol, Leipzig, Germany
[2] German Ctr Integrat Biodivers Res iDiv, Leipzig, Germany
关键词
bacteria; bioinformatics; data reuse; metabarcoding; microbiome; BACTERIOPLANKTON COMMUNITIES; SEQUENCE-ANALYSIS; DIVERSITY; IDENTIFICATION;
D O I
10.1111/1755-0998.14102
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
16S rRNA gene metabarcoding, the study of amplicon sequences of the 16S rRNA gene from mixed environmental samples, is an increasingly popular and accessible method for assessing bacterial communities across a wide range of environments. As metabarcoding sequence data archives continue to grow, data reuse will likely become an important source of novel insights into the ecology of microbes. While recent work has demonstrated the benefits of longer read lengths for the study of microbial communities from 16S rRNA gene segments, no studies have explored the use of shorter (< 200 bp) read lengths in the context of data reuse. Nevertheless, this information is essential to improve the reuse and comparability of metabarcoding data across existing datasets. This study reanalyzed nine 16S rRNA datasets targeting aquatic, animal-associated and soil microbiomes, and evaluated how processing the sequence data across a range of read lengths affected the resulting taxonomic assignments, biodiversity metrics and differential (i.e., before-after treatment) analyses. Short read lengths successfully recovered ecological patterns and allowed for the use of more sequences. Limited increases in resolution were observed beyond 150 bp reads across environments. Furthermore, abundance-weighted diversity metrics (e.g., Inverse Simpson index, Morisita-Horn dissimilarities or weighted Unifrac distances) were more robust to variation in read lengths. Read lengths alone contributed to consistent increases in the total number of ASVs detected, highlighting the need to consider metabarcoding-derived diversity estimates within the context of the bioinformatics parameters selected. This study provides evidence-based guidelines for the processing of short reads.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Community analysis of picocyanobacteria in an oligotrophic lake by cloning 16S rRNA gene and 16S rRNA gene amplicon sequencing
    Fujimoto, Naoshi
    Mizuno, Keigo
    Yokoyama, Tomoki
    Ohnishi, Akihiro
    Suzuki, Masaharu
    Watanabe, Satoru
    Komatsu, Kenji
    Sakata, Yoichi
    Kishida, Naohiro
    Akiba, Michihiro
    Matsukura, Satoko
    JOURNAL OF GENERAL AND APPLIED MICROBIOLOGY, 2015, 61 (05): : 171 - 176
  • [2] 16S rRNA Gene Amplicon Sequencing Data for Pteris vittata Rhizosphere Soils
    Mu'azu, Aminu Salisu
    Haris, Hazzeman
    Zarkasi, Kamarul Zaman
    Lau, Nyok-Sean
    Ghazali, Amir Hamzah
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2023, 12 (03):
  • [3] 16S rRNA gene amplicon sequence data from sunflower endosphere bacterial community
    Babalola, Olubukola Oluranti
    Adeleke, Bartholomew Saanu
    Ayangbenro, Ayansina Segun
    DATA IN BRIEF, 2021, 39
  • [4] Looking for Rhizobacterial Ecological Indicators in Agricultural Soils Using 16S rRNA metagenomic Amplicon Data
    Valverde, Jose R.
    Gullon, Sonia
    Perez Mellado, Rafael
    PLOS ONE, 2016, 11 (10):
  • [5] 16S rRNA gene amplicon sequencing data from an Australian wastewater treatment plant
    Romanis, C. S.
    Timms, V. J.
    Crosbie, N. D.
    Neilan, B. A.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2024, 13 (06):
  • [6] METASEED: a novel approach to full-length 16S rRNA gene reconstruction from short read data
    Philip, Melcy
    Rudi, Knut
    Ormaasen, Ida
    Angell, Inga Leena
    Pettersen, Ragnhild
    Keeley, Nigel B.
    Snipen, Lars-Gustav
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [7] Triplicate PCR reactions for 16S rRNA gene amplicon sequencing are unnecessary
    Marotz, Clarisse
    Sharma, Anukriti
    Humphrey, Greg
    Gottel, Neil
    Daum, Christopher
    Gilbert, Jack A.
    Eloe-Fadrosh, Emiley
    Knight, Rob
    BIOTECHNIQUES, 2019, 67 (01) : 29 - 32
  • [8] 16S rRNA Gene Amplicon Sequencing Data of Bacterial Community of Freshwater Sponge Lubomirskia baicalensis
    Belikov, Sergei, I
    Petrushin, Ivan S.
    Chernogor, Lubov, I
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2022, 11 (02):
  • [9] 16S rRNA gene amplicon sequencing data from the gut microbiota of adolescent Afghan refugees
    Shahzad, Muhammad
    Saeedullah, Anum
    Khan, Muhammad Shabbir
    Ahmad, Habab Ali
    Iddrissu, Ishawu
    Andrews, Simon C.
    DATA IN BRIEF, 2024, 55
  • [10] Reprocessing 16S rRNA Gene Amplicon Sequencing Studies: (Meta)Data Issues, Robustness, and Reproducibility
    Kang, Xiongbin
    Deng, Dong Mei
    Crielaard, Wim
    Brandt, Bernd W.
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2021, 11