Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics

被引:23
|
作者
Haro-Moreno, Jose M. [1 ]
Lopez-Perez, Mario [1 ]
Rodriguez-Valera, Francisco [1 ,2 ]
机构
[1] Univ Miguel Hernandez, Div Microbiol, Evolutionary Genom Grp, Alicante, Spain
[2] Moscow Inst Phys & Technol, Res Ctr Mol Mech Aging & Age Related Dis, Dolgoprudnyi, Russia
关键词
metagenome; metagenome-assembled genomes (MAGs); long-read sequencing; PacBio CCS long-reads; polyketide synthase (PKS); CRISPR; MULTIPLE SEQUENCE ALIGNMENT; SINGLE-CELL; DIVERSITY; DATABASE; EVOLUTION; INSIGHTS; BACTERIA; REVEALS; IDENTIFICATION; EURYARCHAEOTA;
D O I
10.3389/fmicb.2021.708782
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Third-generation sequencing has penetrated little in metagenomics due to the high error rate and dependence for assembly on short-read designed bioinformatics. However, second-generation sequencing metagenomics (mostly Illumina) suffers from limitations, particularly in the assembly of microbes with high microdiversity and retrieval of the flexible (adaptive) fraction of prokaryotic genomes. Here, we have used a third-generation technique to study the metagenome of a well-known marine sample from the mixed epipelagic water column of the winter Mediterranean. We have compared PacBio Sequel II with the classical approach using Illumina Nextseq short reads followed by assembly to study the metagenome. Long reads allow for efficient direct retrieval of complete genes avoiding the bias of the assembly step. Besides, the application of long reads on metagenomic assembly allows for the reconstruction of much more complete metagenome-assembled genomes (MAGs), particularly from microbes with high microdiversity such as Pelagibacterales. The flexible genome of reconstructed MAGs was much more complete containing many adaptive genes (some with biotechnological potential). PacBio Sequel II CCS appears particularly suitable for cellular metagenomics due to its low error rate. For most applications of metagenomics, from community structure analysis to ecosystem functioning, long reads should be applied whenever possible. Specifically, for in silico screening of biotechnologically useful genes, or population genomics, long-read metagenomics appears presently as a very fruitful approach and can be analyzed from raw reads before a computationally demanding (and potentially artifactual) assembly step.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Insights into ecological roles of uncultivated bacteria in Katase hot spring sediment from long-read metagenomics
    Kato, Shingo
    Masuda, Sachiko
    Shibata, Arisa
    Shirasu, Ken
    Ohkuma, Moriya
    FRONTIERS IN MICROBIOLOGY, 2022, 13
  • [32] Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing
    Singleton, Caitlin M.
    Petriglieri, Francesca
    Kristensen, Jannie M.
    Kirkegaard, Rasmus H.
    Michaelsen, Thomas Y.
    Andersen, Martin H.
    Kondrotaite, Zivile
    Karst, Soren M.
    Dueholm, Morten S.
    Nielsen, Per H.
    Albertsen, Mads
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [33] Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing
    Caitlin M. Singleton
    Francesca Petriglieri
    Jannie M. Kristensen
    Rasmus H. Kirkegaard
    Thomas Y. Michaelsen
    Martin H. Andersen
    Zivile Kondrotaite
    Søren M. Karst
    Morten S. Dueholm
    Per H. Nielsen
    Mads Albertsen
    Nature Communications, 12
  • [34] Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data
    Chin, Chen-Shan
    Alexander, David H.
    Marks, Patrick
    Klammer, Aaron A.
    Drake, James
    Heiner, Cheryl
    Clum, Alicia
    Copeland, Alex
    Huddleston, John
    Eichler, Evan E.
    Turner, Stephen W.
    Korlach, Jonas
    NATURE METHODS, 2013, 10 (06) : 563 - +
  • [35] Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data
    Chin C.-S.
    Alexander D.H.
    Marks P.
    Klammer A.A.
    Drake J.
    Heiner C.
    Clum A.
    Copeland A.
    Huddleston J.
    Eichler E.E.
    Turner S.W.
    Korlach J.
    Nature Methods, 2013, 10 (6) : 563 - 569
  • [36] Complete nontuberculous mycobacteria whole genomes using an optimized DNA extraction protocol for long-read sequencing
    Jennifer M. Bouso
    Paul J. Planet
    BMC Genomics, 20
  • [37] Simultaneous profiling of chromatin accessibility and DNA methylation in complete plant genomes using long-read sequencing
    Leduque, Basile
    Edera, Alejandro
    Vitte, Clementine
    Quadrana, Leandro
    NUCLEIC ACIDS RESEARCH, 2024, 52 (11) : 6285 - 6297
  • [38] Assembly of Mitochondrial Genomes Using Nanopore Long-Read Technology in Three Sea Chubs (Teleostei: Kyphosidae)
    Baeza, J. Antonio
    Minish, Jeremiah J.
    Michael, Todd P.
    MOLECULAR ECOLOGY RESOURCES, 2025, 25 (01)
  • [39] A novel and diverse group of Candidatus Patescibacteria from bathypelagic Lake Baikal revealed through long-read metagenomics
    Jose M. Haro-Moreno
    Pedro J. Cabello-Yeves
    M. Pilar Garcillán-Barcia
    Alexandra Zakharenko
    Tamara I. Zemskaya
    Francisco Rodriguez-Valera
    Environmental Microbiome, 18
  • [40] A novel and diverse group of Candidatus Patescibacteria from bathypelagic Lake Baikal revealed through long-read metagenomics
    Haro-Moreno, Jose M.
    Cabello-Yeves, Pedro J.
    Garcillan-Barcia, M. Pilar
    Zakharenko, Alexandra
    Zemskaya, Tamara I.
    Rodriguez-Valera, Francisco
    ENVIRONMENTAL MICROBIOME, 2023, 18 (01)