共 47 条
Enhancing Long-Read-Based Strain-Aware Metagenome Assembly
被引:6
|作者:
Luo, Xiao
[1
,2
]
Kang, Xiongbin
[1
]
Schoenhuth, Alexander
[1
,2
]
机构:
[1] Bielefeld Univ, Fac Technol, Genome Data Sci, Bielefeld, Germany
[2] Ctr Wiskunde & Informat, Life Sci & Hlth, Amsterdam, Netherlands
基金:
欧盟地平线“2020”;
关键词:
long reads;
haplotype;
strain;
metagenome;
genome assembly;
SINGLE-CELL;
D O I:
10.3389/fgene.2022.868280
中图分类号:
Q3 [遗传学];
学科分类号:
071007 ;
090102 ;
摘要:
Microbial communities are usually highly diverse and often involve multiple strains from the participating species due to the rapid evolution of microorganisms. In such a complex microecosystem, different strains may show different biological functions. While reconstruction of individual genomes at the strain level is vital for accurately deciphering the composition of microbial communities, the problem has largely remained unresolved so far. Next-generation sequencing has been routinely used in metagenome assembly but there have been struggles to generate strain-specific genome sequences due to the short-read length. This explains why long-read sequencing technologies have recently provided unprecedented opportunities to carry out haplotype- or strain-resolved genome assembly. Here, we propose MetaBooster and MetaBooster-HiFi, as two pipelines for strain-aware metagenome assembly from PacBio CLR and Oxford Nanopore long-read sequencing data. Benchmarking experiments on both simulated and real sequencing data demonstrate that either the MetaBooster or the MetaBooster-HiFi pipeline drastically outperforms the state-of-the-art de novo metagenome assemblers, in terms of all relevant metagenome assembly criteria, involving genome fraction, contig length, and error rates.
引用
收藏
页数:8
相关论文