Enhancing Long-Read-Based Strain-Aware Metagenome Assembly

被引:6
|
作者
Luo, Xiao [1 ,2 ]
Kang, Xiongbin [1 ]
Schoenhuth, Alexander [1 ,2 ]
机构
[1] Bielefeld Univ, Fac Technol, Genome Data Sci, Bielefeld, Germany
[2] Ctr Wiskunde & Informat, Life Sci & Hlth, Amsterdam, Netherlands
基金
欧盟地平线“2020”;
关键词
long reads; haplotype; strain; metagenome; genome assembly; SINGLE-CELL;
D O I
10.3389/fgene.2022.868280
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Microbial communities are usually highly diverse and often involve multiple strains from the participating species due to the rapid evolution of microorganisms. In such a complex microecosystem, different strains may show different biological functions. While reconstruction of individual genomes at the strain level is vital for accurately deciphering the composition of microbial communities, the problem has largely remained unresolved so far. Next-generation sequencing has been routinely used in metagenome assembly but there have been struggles to generate strain-specific genome sequences due to the short-read length. This explains why long-read sequencing technologies have recently provided unprecedented opportunities to carry out haplotype- or strain-resolved genome assembly. Here, we propose MetaBooster and MetaBooster-HiFi, as two pipelines for strain-aware metagenome assembly from PacBio CLR and Oxford Nanopore long-read sequencing data. Benchmarking experiments on both simulated and real sequencing data demonstrate that either the MetaBooster or the MetaBooster-HiFi pipeline drastically outperforms the state-of-the-art de novo metagenome assemblers, in terms of all relevant metagenome assembly criteria, involving genome fraction, contig length, and error rates.
引用
收藏
页数:8
相关论文
共 47 条
  • [11] Improved Apis mellifera reference genome based on the alternative long-read-based assemblies
    Kaskinova, Milyausha
    Yunusbayev, Bayazit
    Altinbaev, Radick
    Raffiudin, Rika
    Carpenter, Madeline H.
    Kwon, Hyung Wook
    Nikolenko, Alexey
    Harpur, Brock A.
    Yunusbaev, Ural
    G3-GENES GENOMES GENETICS, 2021, 11 (09):
  • [12] metaFlye: scalable long-read metagenome assembly using repeat graphs
    Kolmogorov, Mikhail
    Bickhart, Derek M.
    Behsaz, Bahar
    Gurevich, Alexey
    Rayko, Mikhail
    Shin, Sung Bong
    Kuhn, Kristen
    Yuan, Jeffrey
    Polevikov, Evgeny
    Smith, Timothy P. L.
    Pevzner, Pavel A.
    NATURE METHODS, 2020, 17 (11) : 1103 - +
  • [13] metaFlye: scalable long-read metagenome assembly using repeat graphs
    Mikhail Kolmogorov
    Derek M. Bickhart
    Bahar Behsaz
    Alexey Gurevich
    Mikhail Rayko
    Sung Bong Shin
    Kristen Kuhn
    Jeffrey Yuan
    Evgeny Polevikov
    Timothy P. L. Smith
    Pavel A. Pevzner
    Nature Methods, 2020, 17 : 1103 - 1110
  • [14] TERRA ONTseq: a long-read-based sequencing pipeline to study the human telomeric transcriptome
    Rodrigues, Joana
    Alfieri, Roberta
    Bione, Silvia
    Azzalin, Claus M.
    RNA, 2024, 30 (08) : 955 - 966
  • [15] Melon: metagenomic long-read-based taxonomic identification and quantification using marker genes
    Chen, Xi
    Yin, Xiaole
    Shi, Xianghui
    Yan, Weifu
    Yang, Yu
    Liu, Lei
    Zhang, Tong
    GENOME BIOLOGY, 2024, 25 (01):
  • [16] Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system
    Vincent Somerville
    Stefanie Lutz
    Michael Schmid
    Daniel Frei
    Aline Moser
    Stefan Irmler
    Jürg E. Frey
    Christian H. Ahrens
    BMC Microbiology, 19
  • [17] Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system
    Somerville, Vincent
    Lutz, Stefanie
    Schmid, Michael
    Frei, Daniel
    Moser, Aline
    Irmler, Stefan
    Frey, Jurg E.
    Ahrens, Christian H.
    BMC MICROBIOLOGY, 2019, 19 (1)
  • [18] Shedding light on DNA methylation and its clinical implications: the impact of long-read-based nanopore technology
    Chera, Alexandra
    Stancu-Cretu, Mircea
    Zabet, Nicolae Radu
    Bucur, Octavian
    EPIGENETICS & CHROMATIN, 2024, 17 (01)
  • [19] Comparative Analysis for the Performance of Long-Read-Based Structural Variation Detection Pipelines in Tandem Repeat Regions
    Guo, Mingkun
    Li, Shihai
    Zhou, Yifan
    Li, Menglong
    Wen, Zhining
    FRONTIERS IN PHARMACOLOGY, 2021, 12
  • [20] Long-Read Genome Assembly of Saccharomyces uvarum Strain CBS 7001
    Chen, Jingxuan
    Garfinkel, David J.
    Bergman, Casey M.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2022, 11 (01):