Generation and application of pseudo-long reads for metagenome assembly

被引:0
|
作者
Sim, Mikang [1 ]
Lee, Jongin [1 ]
Wy, Suyeon [1 ]
Park, Nayoung [1 ]
Lee, Daehwan [1 ]
Kwon, Daehong [1 ]
kim, Jaebum [1 ]
机构
[1] Konkuk Univ, Dept Biomed Sci & Engn, 120 Neungdong Ro, Seoul 05029, South Korea
来源
GIGASCIENCE | 2022年 / 11卷
关键词
next-generation sequencing; metagenomic assembly; pseudo-long read;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Metagenomic assembly using high-throughput sequencing data is a powerful method to construct microbial genomes in environmental samples without cultivation. However, metagenomic assembly, especially when only short reads are available, is a complex and challenging task because mixed genomes of multiple microorganisms constitute the metagenome. Although long read sequencing technologies have been developed and have begun to be used for metagenomic assembly, many metagenomic studies have been performed based on short reads because the generation of long reads requires higher sequencing cost than short reads. Results In this study, we present a new method called PLR-GEN. It creates pseudo-long reads from metagenomic short reads based on given reference genome sequences by considering small sequence variations existing in individual genomes of the same or different species. When applied to a mock community data set in the Human Microbiome Project, PLR-GEN dramatically extended short reads in length of 101 bp to pseudo-long reads with N50 of 33 Kbp and 0.4% error rate. The use of these pseudo-long reads generated by PLR-GEN resulted in an obvious improvement of metagenomic assembly in terms of the number of sequences, assembly contiguity, and prediction of species and genes. Conclusions PLR-GEN can be used to generate artificial long read sequences without spending extra sequencing cost, thus aiding various studies using metagenomes.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Generation and application of pseudo-long reads for metagenome assembly
    Sim, Mikang
    Lee, Jongin
    Wy, Suyeon
    Park, Nayoung
    Lee, Daehwan
    Kwon, Daehong
    Kim, Jaebum
    GIGASCIENCE, 2022, 11
  • [2] Generation and application of pseudo-long reads for metagenome assembly
    Sim, Mikang
    Lee, Jongin
    Wy, Suyeon
    Park, Nayoung
    Lee, Daehwan
    Kwon, Daehong
    Kim, Jaebum
    GIGASCIENCE, 2022, 11
  • [3] Konnector v2.0: pseudo-long reads from paired-end sequencing data
    Vandervalk, Benjamin P.
    Yang, Chen
    Xue, Zhuyi
    Raghavan, Karthika
    Chu, Justin
    Mohamadi, Hamid
    Jackman, Shaun D.
    Chiu, Readman
    Warren, Rene L.
    Birol, Inanc
    BMC MEDICAL GENOMICS, 2015, 8
  • [4] Konnector v2.0: pseudo-long reads from paired-end sequencing data
    Benjamin P Vandervalk
    Chen Yang
    Zhuyi Xue
    Karthika Raghavan
    Justin Chu
    Hamid Mohamadi
    Shaun D Jackman
    Readman Chiu
    René L Warren
    Inanç Birol
    BMC Medical Genomics, 8
  • [5] High-quality metagenome assembly from long accurate reads with metaMDBG
    Benoit, Gaetan
    Raguideau, Sebastien
    James, Robert
    Phillippy, Adam M.
    Chikhi, Rayan
    Quince, Christopher
    NATURE BIOTECHNOLOGY, 2024, 42 (09) : 1378 - 1383
  • [6] New approaches for metagenome assembly with short reads
    Ayling, Martin
    Clark, Matthew D.
    Leggett, Richard M.
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (02) : 584 - 594
  • [7] Metagenome assembly of high-fidelity long reads with hifiasm-meta
    Xiaowen Feng
    Haoyu Cheng
    Daniel Portik
    Heng Li
    Nature Methods, 2022, 19 : 671 - 674
  • [8] Metagenome assembly of high-fidelity long reads with hifiasm-meta
    Feng, Xiaowen
    Cheng, Haoyu
    Portik, Daniel
    Li, Heng
    NATURE METHODS, 2022, 19 (06) : 671 - +
  • [9] Combined assembly of long and short sequencing reads improve the efficiency of exploring the soil metagenome
    Xu, Guoshun
    Zhang, Liwen
    Liu, Xiaoqing
    Guan, Feifei
    Xu, Yuquan
    Yue, Haitao
    Huang, Jin-Qun
    Chen, Jieyin
    Wu, Ningfeng
    Tian, Jian
    BMC GENOMICS, 2022, 23 (01)
  • [10] Combined assembly of long and short sequencing reads improve the efficiency of exploring the soil metagenome
    Guoshun Xu
    Liwen Zhang
    Xiaoqing Liu
    Feifei Guan
    Yuquan Xu
    Haitao Yue
    Jin-Qun Huang
    Jieyin Chen
    Ningfeng Wu
    Jian Tian
    BMC Genomics, 23