Hybrid-hybrid correction of errors in long reads with HERO

被引:3
|
作者
Kang, Xiongbin [1 ,2 ]
Xu, Jialu [1 ]
Luo, Xiao [1 ]
Schoenhuth, Alexander [2 ]
机构
[1] Hunan Univ, Coll Biol, Changsha, Peoples R China
[2] Bielefeld Univ, Fac Technol, Genome Data Sci, Bielefeld, Germany
基金
欧洲研究理事会;
关键词
Correction of sequencing errors; Haplotype specific variation; Metagenome sequencing; Third-generation sequencing reads; Genome assembly; SINGLE-CELL; ACCURATE;
D O I
10.1186/s13059-023-03112-7
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Although generally superior, hybrid approaches for correcting errors in third-generation sequencing (TGS) reads, using next-generation sequencing (NGS) reads, mistake haplotype-specific variants for errors in polyploid and mixed samples. We suggest HERO, as the first "hybrid-hybrid" approach, to make use of both de Bruijn graphs and overlap graphs for optimal catering to the particular strengths of NGS and TGS reads. Extensive benchmarking experiments demonstrate that HERO improves indel and mismatch error rates by on average 65% (27 similar to 95%) and 20% (4 similar to 61%). Using HERO prior to genome assembly significantly improves the assemblies in the majority of the relevant categories.
引用
收藏
页数:39
相关论文
共 50 条
  • [1] Hybrid-hybrid correction of errors in long reads with HERO
    Xiongbin Kang
    Jialu Xu
    Xiao Luo
    Alexander Schönhuth
    Genome Biology, 24
  • [2] A hybrid and scalable error correction algorithm for indel and substitution errors of long reads
    Arghya Kusum Das
    Sayan Goswami
    Kisung Lee
    Seung-Jong Park
    BMC Genomics, 20
  • [3] A hybrid and scalable error correction algorithm for indel and substitution errors of long reads
    Das, Arghya Kusum
    Goswami, Sayan
    Lee, Kisung
    Park, Seung-Jong
    BMC GENOMICS, 2019, 20 (Suppl 11)
  • [4] Jabba: hybrid error correction for long sequencing reads
    Giles Miclotte
    Mahdi Heydari
    Piet Demeester
    Stephane Rombauts
    Yves Van de Peer
    Pieter Audenaert
    Jan Fostier
    Algorithms for Molecular Biology, 11
  • [5] Jabba: hybrid error correction for long sequencing reads
    Miclotte, Giles
    Heydari, Mahdi
    Demeester, Piet
    Rombauts, Stephane
    Van de Peer, Yves
    Audenaert, Pieter
    Fostier, Jan
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2016, 11
  • [6] Efficient Hybrid De Novo Error Correction and Assembly for Long Reads
    Kchouk, Mehdi
    Elloumi, Mourad
    2016 27TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2016, : 88 - 92
  • [7] HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning
    Choudhury, Olivia
    Chakrabarty, Ankush
    Emrich, Scott J.
    SCIENTIFIC REPORTS, 2018, 8
  • [8] HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning
    Olivia Choudhury
    Ankush Chakrabarty
    Scott J. Emrich
    Scientific Reports, 8
  • [9] Identification and correction of substitution errors in Moleculo long reads
    Price, Jared
    Ward, Judson
    Udall, Joshua
    Snell, Quinn
    Clement, Mark
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2013,
  • [10] ShyLU: A Hybrid-Hybrid Solver for Multicore Platforms
    Rajamanickam, Sivasankaran
    Boman, Erik G.
    Heroux, Michael A.
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 631 - 643