De novo diploid genome assembly using long noisy reads

被引:0
|
作者
Fan Nie
Peng Ni
Neng Huang
Jun Zhang
Zhenyu Wang
Chuanle Xiao
Feng Luo
Jianxin Wang
机构
[1] Central South University,School of Computer Science and Engineering
[2] Xiangjiang Laboratory,National Center for Applied Mathematics in Hunan and Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education
[3] Xiangtan University,Hunan Provincial Key Lab on Bioinformatics
[4] Central South University,Institute of Nanfan & Seed Industry
[5] Guangdong Academy of Sciences,State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center
[6] Sun Yat-sen University #7 Jinsui Road,School of Computing
[7] Tianhe District,undefined
[8] Clemson University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The high sequencing error rate has impeded the application of long noisy reads for diploid genome assembly. Most existing assemblers failed to generate high-quality phased assemblies using long noisy reads. Here, we present PECAT, a Phased Error Correction and Assembly Tool, for reconstructing diploid genomes from long noisy reads. We design a haplotype-aware error correction method that can retain heterozygote alleles while correcting sequencing errors. We combine a corrected read SNP caller and a raw read SNP caller to further improve the identification of inconsistent overlaps in the string graph. We use a grouping method to assign reads to different haplotype groups. PECAT efficiently assembles diploid genomes using Nanopore R9, PacBio CLR or Nanopore R10 reads only. PECAT generates more contiguous haplotype-specific contigs compared to other assemblers. Especially, PECAT achieves nearly haplotype-resolved assembly on B. taurus (Bison×Simmental) using Nanopore R9 reads and phase block NG50 with 59.4/58.0 Mb for HG002 using Nanopore R10 reads.
引用
收藏
相关论文
共 50 条
  • [1] De novo diploid genome assembly using long noisy reads
    Nie, Fan
    Ni, Peng
    Huang, Neng
    Zhang, Jun
    Wang, Zhenyu
    Xiao, Chuanle
    Luo, Feng
    Wang, Jianxin
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [2] De novo assembly of the complex genome of Nippostrongylus brasiliensis using MinION long reads
    David Eccles
    Jodie Chandler
    Mali Camberis
    Bernard Henrissat
    Sergey Koren
    Graham Le Gros
    Jonathan J. Ewbank
    BMC Biology, 16
  • [3] De novo assembly of the complex genome of Nippostrongylus brasiliensis using MinION long reads
    Eccles, David
    Chandler, Jodie
    Camberis, Mali
    Henrissat, Bernard
    Koren, Sergey
    Le Gros, Graham
    Ewbank, Jonathan J.
    BMC BIOLOGY, 2018, 16
  • [4] ntLink: A Toolkit for De Novo Genome Assembly Scaffolding and Mapping Using Long Reads
    Coombe, Lauren
    Warren, Rene L.
    Wong, Johnathan
    Nikolic, Vladimir
    Birol, Inanc
    CURRENT PROTOCOLS, 2023, 3 (04):
  • [5] Fast and accurate de novo genome assembly from long uncorrected reads
    Vaser, Robert
    Sovic, Ivan
    Nagarajan, Niranjan
    Sikic, Mile
    GENOME RESEARCH, 2017, 27 (05) : 737 - 746
  • [6] phasebook: haplotype-aware de novo assembly of diploid genomes from long reads
    Xiao Luo
    Xiongbin Kang
    Alexander Schönhuth
    Genome Biology, 22
  • [7] phasebook: haplotype-aware de novo assembly of diploid genomes from long reads
    Luo, Xiao
    Kang, Xiongbin
    Schoenhuth, Alexander
    GENOME BIOLOGY, 2021, 22 (01)
  • [8] De novo Assembly of the Brugia malayi Genome Using Long Reads from a Single MinION Flowcell
    Joseph R. Fauver
    John Martin
    Gary J. Weil
    Makedonka Mitreva
    Peter U. Fischer
    Scientific Reports, 9
  • [9] De novo Assembly of the Brugia malayi Genome Using Long Reads from a Single MinION Flowcell
    Fauver, Joseph R.
    Martin, John
    Weil, Gary J.
    Mitreva, Makedonka
    Fischer, Peter U.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [10] De-novo Assembly of Limnospira fusiformis Using Ultra-Long Reads
    Hicks, McKenna
    Tran-Dao, Thuy-Khanh
    Mulroney, Logan
    Bernick, David L.
    FRONTIERS IN MICROBIOLOGY, 2021, 12