Hybrid-hybrid correction of errors in long reads with HERO

被引:3
|
作者
Kang, Xiongbin [1 ,2 ]
Xu, Jialu [1 ]
Luo, Xiao [1 ]
Schoenhuth, Alexander [2 ]
机构
[1] Hunan Univ, Coll Biol, Changsha, Peoples R China
[2] Bielefeld Univ, Fac Technol, Genome Data Sci, Bielefeld, Germany
基金
欧洲研究理事会;
关键词
Correction of sequencing errors; Haplotype specific variation; Metagenome sequencing; Third-generation sequencing reads; Genome assembly; SINGLE-CELL; ACCURATE;
D O I
10.1186/s13059-023-03112-7
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Although generally superior, hybrid approaches for correcting errors in third-generation sequencing (TGS) reads, using next-generation sequencing (NGS) reads, mistake haplotype-specific variants for errors in polyploid and mixed samples. We suggest HERO, as the first "hybrid-hybrid" approach, to make use of both de Bruijn graphs and overlap graphs for optimal catering to the particular strengths of NGS and TGS reads. Extensive benchmarking experiments demonstrate that HERO improves indel and mismatch error rates by on average 65% (27 similar to 95%) and 20% (4 similar to 61%). Using HERO prior to genome assembly significantly improves the assemblies in the majority of the relevant categories.
引用
收藏
页数:39
相关论文
共 50 条
  • [41] Hybrid de novo tandem repeat detection using short and long reads
    Guillaume Fertin
    Géraldine Jean
    Andreea Radulescu
    Irena Rusu
    BMC Medical Genomics, 8
  • [42] Hybrid de novo tandem repeat detection using short and long reads
    Fertin, Guillaume
    Jean, Geraldine
    Radulescu, Andreea
    Rusu, Irena
    BMC MEDICAL GENOMICS, 2015, 8
  • [43] Hybrid assembly with long and short reads improves discovery of gene family expansions
    Miller, Jason R.
    Zhou, Peng
    Mudge, Joann
    Gurtowski, James
    Lee, Hayan
    Ramaraj, Thiruvarangan
    Walenz, Brian P.
    Liu, Junqi
    Stupar, Robert M.
    Denny, Roxanne
    Song, Li
    Singh, Namrata
    Maron, Lyza G.
    McCouch, Susan R.
    McCombie, W. Richard
    Schatz, Michael C.
    Tiffin, Peter
    Young, Nevin D.
    Silverstein, Kevin A. T.
    BMC GENOMICS, 2017, 18
  • [44] A HYBRID-HYBRID MATRIX-METHOD FOR 3D NOE-NOE DATA-ANALYSIS
    ZHANG, Q
    CHEN, JY
    GOZANSKY, EK
    ZHU, F
    JACKSON, PL
    GORENSTEIN, DG
    JOURNAL OF MAGNETIC RESONANCE SERIES B, 1995, 106 (02): : 164 - 169
  • [45] SERIAL HYBRID COMPUTATION AND ERRORS IN HYBRID LOOPS
    LITTLE, WD
    IEEE TRANSACTIONS ON COMPUTERS, 1973, C 22 (04) : 367 - 370
  • [46] Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads
    Wang, Anqi
    Au, Kin Fai
    GENOME BIOLOGY, 2020, 21 (01)
  • [47] Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads
    Anqi Wang
    Kin Fai Au
    Genome Biology, 21
  • [48] CONSTRUCTION AND CHARACTERIZATION OF A HYBRID-HYBRID MONOCLONAL-ANTIBODY RECOGNIZING BOTH CARCINOEMBRYONIC ANTIGEN (CEA) AND VINCA ALKALOIDS
    CORVALAN, JRF
    SMITH, W
    CANCER IMMUNOLOGY IMMUNOTHERAPY, 1987, 24 (02) : 127 - 132
  • [49] ELECTOR: evaluator for long reads correction methods
    Marchet, Camille
    Morisse, Pierre
    Lecompte, Lolita
    Lefebvre, Arnaud
    Lecroq, Thierry
    Peterlongo, Pierre
    Limasset, Antoine
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (01)
  • [50] SIMCOMP: A Hybrid Soft Clustering of Metagenome Reads
    Prabhakara, Shruthi
    Acharya, Raj
    PATTERN RECOGNITION IN BIOINFORMATICS, 2010, 6282 : 113 - 124