Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads

被引:6
|
作者
Wang, Anqi [1 ,2 ]
Au, Kin Fai [1 ,2 ,3 ]
机构
[1] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[2] Univ Iowa, Dept Internal Med, Iowa City, IA 52242 USA
[3] Univ Iowa, Dept Biostat, Iowa City, IA 52242 USA
关键词
SINGLE-MOLECULE; NANOPORE; GENOME; ACCURACY;
D O I
10.1186/s13059-019-1885-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The error-prone third-generation sequencing (TGS) long reads can be corrected by the high-quality second-generation sequencing (SGS) short reads, which is referred to as hybrid error correction. We here investigate the influences of the principal algorithmic factors of two major types of hybrid error correction methods by mathematical modeling and analysis on both simulated and real data. Our study reveals the distribution of accuracy gain with respect to the original long read error rate. We also demonstrate that the original error rate of 19% is the limit for perfect correction, beyond which long reads are too error-prone to be corrected by these methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Redundancy-Based Delivery Mechanism for Error-Prone Wireless Networks
    Yu, Yu-Ting
    Chao, Hsi-Lu
    2009 IEEE VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-5, 2009, : 3055 - 3059
  • [32] Object-based audio streaming over error-prone channels
    Marks, SK
    Gonzalez, R
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 261 - 264
  • [33] Graph-based minimum error entropy Kalman filtering
    Zhang, Kun
    Wang, Gang
    Zhou, Yuzheng
    He, Jiacheng
    Mao, Xuemei
    Peng, Bei
    SIGNAL PROCESSING, 2024, 222
  • [34] Context-Aware Adversarial Graph-Based Learning for Multilingual Grammatical Error Correction
    Kumar, Naresh
    Kumar, Parveen
    Tripath, Sushreeta
    Samal, Neelamani
    Gountia, Debasis
    Gatla, Praveen
    Singh, Teekam
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (12)
  • [35] A motion-based selective error protection method for scalable video over error-prone channel
    Wang, Yu
    Chau, Lap-Pui
    Yap, Kim-Hui
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 763 - 766
  • [36] TGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads
    Xu, Mengyang
    Guo, Lidong
    Gu, Shengqiang
    Wang, Ou
    Zhang, Rui
    Peters, Brock A.
    Fan, Guangyi
    Liu, Xin
    Xu, Xun
    Deng, Li
    Zhang, Yongwei
    GIGASCIENCE, 2020, 9 (09):
  • [37] RankUp: Enhancing graph-based keyphrase extraction methods with error-feedback propagation
    Figueroa, Gerardo
    Chen, Po-Chi
    Chen, Yi-Shin
    COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 112 - 131
  • [38] Ratatosk: hybrid error correction of long reads enables accurate variant calling and assembly
    Guillaume Holley
    Doruk Beyter
    Helga Ingimundardottir
    Peter L. Møller
    Snædis Kristmundsdottir
    Hannes P. Eggertsson
    Bjarni V. Halldorsson
    Genome Biology, 22
  • [39] Ratatosk: hybrid error correction of long reads enables accurate variant calling and assembly
    Holley, Guillaume
    Beyter, Doruk
    Ingimundardottir, Helga
    Moller, Peter L.
    Kristmundsdottir, Snodis
    Eggertsson, Hannes P.
    Halldorsson, Bjarni, V
    GENOME BIOLOGY, 2021, 22 (01)
  • [40] Jabba: Hybrid Error Correction for Long Sequencing Reads Using Maximal Exact Matches
    Miclotte, Giles
    Heydari, Mahdi
    Demeester, Piet
    Audenaert, Pieter
    Fostier, Jan
    ALGORITHMS IN BIOINFORMATICS (WABI 2015), 2015, 9289 : 175 - 188