Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads

被引:6
|
作者
Wang, Anqi [1 ,2 ]
Au, Kin Fai [1 ,2 ,3 ]
机构
[1] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[2] Univ Iowa, Dept Internal Med, Iowa City, IA 52242 USA
[3] Univ Iowa, Dept Biostat, Iowa City, IA 52242 USA
关键词
SINGLE-MOLECULE; NANOPORE; GENOME; ACCURACY;
D O I
10.1186/s13059-019-1885-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The error-prone third-generation sequencing (TGS) long reads can be corrected by the high-quality second-generation sequencing (SGS) short reads, which is referred to as hybrid error correction. We here investigate the influences of the principal algorithmic factors of two major types of hybrid error correction methods by mathematical modeling and analysis on both simulated and real data. Our study reveals the distribution of accuracy gain with respect to the original long read error rate. We also demonstrate that the original error rate of 19% is the limit for perfect correction, beyond which long reads are too error-prone to be corrected by these methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Efficient Hybrid De Novo Error Correction and Assembly for Long Reads
    Kchouk, Mehdi
    Elloumi, Mourad
    2016 27TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2016, : 88 - 92
  • [22] HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning
    Choudhury, Olivia
    Chakrabarty, Ankush
    Emrich, Scott J.
    SCIENTIFIC REPORTS, 2018, 8
  • [23] HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning
    Olivia Choudhury
    Ankush Chakrabarty
    Scott J. Emrich
    Scientific Reports, 8
  • [24] Prescription-based error concealment technique for video transmission on error-prone channels
    Lie, Wen-Nung
    Lin, Tom C. -I.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (04) : 310 - 321
  • [25] Smooth q-Gram, and Its Applications to Detection of Overlaps among Long, Error-Prone Sequencing Reads
    Zhang, Haoyu
    Zhang, Qin
    Tang, Haixu
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 267 - 276
  • [26] A hybrid and scalable error correction algorithm for indel and substitution errors of long reads
    Arghya Kusum Das
    Sayan Goswami
    Kisung Lee
    Seung-Jong Park
    BMC Genomics, 20
  • [27] A hybrid and scalable error correction algorithm for indel and substitution errors of long reads
    Das, Arghya Kusum
    Goswami, Sayan
    Lee, Kisung
    Park, Seung-Jong
    BMC GENOMICS, 2019, 20 (Suppl 11)
  • [28] Hybrid Error Correction approach and DeNovo Assembly for MinIon Sequencing Long Reads
    Kchouk, Mehdi
    Elloumi, Mourad
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 122 - 125
  • [29] Propensity Score-Based Estimators With Multiple Error-Prone Covariates
    Hong, Hwanhee
    Aaby, David A.
    Siddique, Juned
    Stuart, Elizabeth A.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2019, 188 (01) : 222 - 230
  • [30] AN ERROR RESILIENCE TECHNIQUE BASED ON FMO AND ERROR PROPAGATION FOR H.264 VIDEO CODING IN ERROR-PRONE CHANNELS
    Vu, Tien Huu
    Aramvith, Supavadee
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 205 - 208