Enhancing Long-Read-Based Strain-Aware Metagenome Assembly

被引:6
|
作者
Luo, Xiao [1 ,2 ]
Kang, Xiongbin [1 ]
Schoenhuth, Alexander [1 ,2 ]
机构
[1] Bielefeld Univ, Fac Technol, Genome Data Sci, Bielefeld, Germany
[2] Ctr Wiskunde & Informat, Life Sci & Hlth, Amsterdam, Netherlands
基金
欧盟地平线“2020”;
关键词
long reads; haplotype; strain; metagenome; genome assembly; SINGLE-CELL;
D O I
10.3389/fgene.2022.868280
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Microbial communities are usually highly diverse and often involve multiple strains from the participating species due to the rapid evolution of microorganisms. In such a complex microecosystem, different strains may show different biological functions. While reconstruction of individual genomes at the strain level is vital for accurately deciphering the composition of microbial communities, the problem has largely remained unresolved so far. Next-generation sequencing has been routinely used in metagenome assembly but there have been struggles to generate strain-specific genome sequences due to the short-read length. This explains why long-read sequencing technologies have recently provided unprecedented opportunities to carry out haplotype- or strain-resolved genome assembly. Here, we propose MetaBooster and MetaBooster-HiFi, as two pipelines for strain-aware metagenome assembly from PacBio CLR and Oxford Nanopore long-read sequencing data. Benchmarking experiments on both simulated and real sequencing data demonstrate that either the MetaBooster or the MetaBooster-HiFi pipeline drastically outperforms the state-of-the-art de novo metagenome assemblers, in terms of all relevant metagenome assembly criteria, involving genome fraction, contig length, and error rates.
引用
收藏
页数:8
相关论文
共 47 条
  • [1] Strainy: phasing and assembly of strain haplotypes from long-read metagenome sequencing
    Kazantseva, Ekaterina
    Donmez, Ataberk
    Frolova, Maria
    Pop, Mihai
    Kolmogorov, Mikhail
    NATURE METHODS, 2024, 21 (11) : 2034 - 2043
  • [2] StrainXpress: strain aware metagenome assembly from short reads
    Kang, Xiongbin
    Luo, Xiao
    Schoenhuth, Alexander
    NUCLEIC ACIDS RESEARCH, 2022, 50 (17)
  • [3] Long-read-based human genomic structural variation detection with cuteSV
    Jiang, Tao
    Liu, Yongzhuang
    Jiang, Yue
    Li, Junyi
    Gao, Yan
    Cui, Zhe
    Liu, Yadong
    Liu, Bo
    Wang, Yadong
    GENOME BIOLOGY, 2020, 21 (01)
  • [4] Long-read-based human genomic structural variation detection with cuteSV
    Tao Jiang
    Yongzhuang Liu
    Yue Jiang
    Junyi Li
    Yan Gao
    Zhe Cui
    Yadong Liu
    Bo Liu
    Yadong Wang
    Genome Biology, 21
  • [5] SVvalidation: A long-read-based validation method for genomic structural variation
    Zheng, Yan
    Shang, Xuequn
    PLOS ONE, 2024, 19 (01):
  • [6] Strain-Aware Performance Evaluation and Correction for OTFT-Based Flexible Displays
    Li, Tengtao
    Sapatnekar, Sachin S.
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [7] Long-Read-Based Genome Sequences of Pandemic and Environmental Vibrio cholerae Strains
    Matthey, Noemie
    Doerr, Natalia C. Drebes
    Blokesch, Melanie
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2018, 7 (23):
  • [8] Long-read-based Genome Assembly of Drosophila gunungcola Reveals Fewer Chemosensory Genes in Flower-breeding Species
    Negi, Ateesha
    Liao, Ben-Yang
    Yeh, Shu-Dan
    GENOME BIOLOGY AND EVOLUTION, 2023, 15 (03):
  • [9] Long-Read-Based Hybrid Genome Assembly and Annotation of Snow Algal Strain CCCryo 101-99 (cf. Sphaerocystis sp., Chlamydomonadales)
    Ciftci, Ozan
    Zervas, Athanasios
    Lutz, Stefanie
    Feord, Helen
    Keusching, Christoph
    Leya, Thomas
    Tranter, Martyn
    Anesio, Alexandre M.
    Benning, Liane G.
    GENOME BIOLOGY AND EVOLUTION, 2024, 16 (07):
  • [10] Long-Read-Based Genome Assembly Reveals Numerous Endogenous Viral Elements in the Green Algal Bacterivore Cymbomonas tetramitiformis
    Gyaltshen, Yangtsho
    Rozenberg, Andrey
    Paasch, Amber
    Burns, John A.
    Warring, Sally
    Larson, Raegan T.
    Maurer-Alcala, Xyrus X.
    Dacks, Joel
    Narechania, Apurva
    Kim, Eunsoo
    GENOME BIOLOGY AND EVOLUTION, 2023, 15 (11):