Enhancing Long-Read-Based Strain-Aware Metagenome Assembly

被引:6
|
作者
Luo, Xiao [1 ,2 ]
Kang, Xiongbin [1 ]
Schoenhuth, Alexander [1 ,2 ]
机构
[1] Bielefeld Univ, Fac Technol, Genome Data Sci, Bielefeld, Germany
[2] Ctr Wiskunde & Informat, Life Sci & Hlth, Amsterdam, Netherlands
基金
欧盟地平线“2020”;
关键词
long reads; haplotype; strain; metagenome; genome assembly; SINGLE-CELL;
D O I
10.3389/fgene.2022.868280
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Microbial communities are usually highly diverse and often involve multiple strains from the participating species due to the rapid evolution of microorganisms. In such a complex microecosystem, different strains may show different biological functions. While reconstruction of individual genomes at the strain level is vital for accurately deciphering the composition of microbial communities, the problem has largely remained unresolved so far. Next-generation sequencing has been routinely used in metagenome assembly but there have been struggles to generate strain-specific genome sequences due to the short-read length. This explains why long-read sequencing technologies have recently provided unprecedented opportunities to carry out haplotype- or strain-resolved genome assembly. Here, we propose MetaBooster and MetaBooster-HiFi, as two pipelines for strain-aware metagenome assembly from PacBio CLR and Oxford Nanopore long-read sequencing data. Benchmarking experiments on both simulated and real sequencing data demonstrate that either the MetaBooster or the MetaBooster-HiFi pipeline drastically outperforms the state-of-the-art de novo metagenome assemblers, in terms of all relevant metagenome assembly criteria, involving genome fraction, contig length, and error rates.
引用
收藏
页数:8
相关论文
共 47 条
  • [21] Long-read-based single sperm genome sequencing for chromosome-wide haplotype phasing of both SNPs and SVs
    Xie, Haoling
    Li, Wen
    Guo, Yuqing
    Su, Xinjie
    Chen, Kexuan
    Wen, Lu
    Tang, Fuchou
    NUCLEIC ACIDS RESEARCH, 2023, 51 (15) : 8020 - 8034
  • [22] Complex genome assembly based on long-read sequencing
    Zhang, Tianjiao
    Zhou, Jie
    Gao, Wentao
    Jia, Yuran
    Wei, Yanan
    Wang, Guohua
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [23] Advancing metagenome-assembled genome-based pathogen identification: unraveling the power of long-read assembly algorithms in Oxford Nanopore sequencing
    Chen, Zhao
    Grim, Christopher J.
    Ramachandran, Padmini
    Meng, Jianghong
    MICROBIOLOGY SPECTRUM, 2024, 12 (06):
  • [24] Multi-tissue RNA-Seq Analysis and Long-read-based Genome Assembly Reveal Complex Sex-specific Gene Regulation and Molecular Evolution in the Manila Clam
    Xu, Ran
    Martelossi, Jacopo
    Smits, Morgan
    Iannello, Mariangela
    Peruzza, Luca
    Babbucci, Massimiliano
    Milan, Massimo
    Dunham, Joseph P.
    Breton, Sophie
    Milani, Liliana
    Nuzhdin, Sergey, V
    Bargelloni, Luca
    Passamonti, Marco
    Ghiselli, Fabrizio
    GENOME BIOLOGY AND EVOLUTION, 2022, 14 (12):
  • [25] Long-read, assembly-based characterization of rearranged cancer karyotypes
    Keskus, Ayse
    Ahmad, Tanveer
    Donmez, Ataberk
    Xie, Yi
    Rodriguez, Isabel
    Milano, Rose
    Rossi, Nicole
    Lou, Hong
    Malik, Laksh
    Billingsley, Kimberley
    Blauwendraat, Cornelis
    Dean, Michael
    Kolmogorov, Mikhail
    CANCER RESEARCH, 2023, 83 (07)
  • [26] An improved draft genome assembly of Meloidogyne graminicola IARI strain using long-read sequencing
    Somvanshi, Vishal Singh
    Dash, Manoranjan
    Bhat, Chaitra G.
    Budhwar, Roli
    Godwin, Jeffrey
    Shukla, Rohit N.
    Patrignani, Andrea
    Schlapbach, Ralph
    Rao, Uma
    GENE, 2021, 793
  • [27] Minimum error correction-based haplotype assembly: Considerations for long read data
    Majidian, Sina
    Kahaei, Mohammad Hossein
    de Ridder, Dick
    PLOS ONE, 2020, 15 (06):
  • [28] Long-read-based draft genome sequence of Indian black gram IPU-94-1 'Uttara': Insights into disease resistance and seed storage protein genes
    Ambreen, Heena
    Oraon, Praveen Kumar
    Wahlang, Daniel Regie
    Satyawada, Rama Rao
    Katiyar-Agarwal, Surekha
    Agarwal, Manu
    Jagannath, Arun
    Kumar, Amar
    Budhwar, Roli
    Shukla, Rohit Nandan
    Goel, Shailendra
    PLANT GENOME, 2022, 15 (03):
  • [29] Complete Genome Resequencing of Thermus thermophilus Strain TMY by Hybrid Assembly of Long- and Short-Read Sequencing Technologies
    Miyazaki, Kentaro
    Tokito, Natsuko
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2021, 10 (46):
  • [30] A detailed guide to assessing genome assembly based on long-read sequencing data using Inspector
    Guo, Yan
    Song, Yuwei
    Jiang, Limin
    Chen, Yu
    Ceccarelli, Michele
    Gao, Min
    Chong, Zechen
    NATURE PROTOCOLS, 2025,