An Error Correction Algorithm for NGS Data

被引:2
|
作者
Kchouk, Mehdi [1 ]
Gibrat, Jean-Francois [2 ]
Elloumi, Mourad [1 ]
机构
[1] Univ Tunis, Natl Super Sch Engineers Tunis ENSIT, Lab Technol Informat & Commun & Elect Engn LaTICE, Tunis, Tunisia
[2] French Natl Inst Agr Res INRA, Res Unit Appl Math & Comp Sci Genomes Environm Ma, Jouy En Josas, France
关键词
NGS Data; Long Reads; Denovo Error Correction; Hashing; Alignment;
D O I
10.1109/DEXA.2017.33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Oxford Nanopore and Pacbio SMRT sequencing technologies has revolutionized the Next-Generation Sequencing (NGS) environment by producing long reads that exceed 60 kbp and helped to the completion of many biological projects. But, long reads are characterized by a high error rate which increases the difficulty of biological problems like the genome assembly problem. Error correction of long reads has become a challenge for bioinformaticians, which motivates the development of new approaches for error correction adapted to NGS technologies. In this paper, we present a new denovo self-error correction algorithm using only long reads. Our algorithm operates in two steps: First, we use a fast hashing method which allows to find alignments between the longest reads and other reads in a set of long reads. Next, we use the longest reads as seeds to obtain the final alignment of long reads by using a dynamic programming algorithm in a band of width w. Our error correction algorithm does not require high quality reads, in contrast to existing hybrid error correction ones.
引用
收藏
页码:84 / 87
页数:4
相关论文
共 50 条
  • [1] Efficient algorithms for error correction and compression of NGS data
    Saha, Subrata
    Rajasekaran, Sanguthevar
    [J]. 2014 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2014,
  • [2] Objective review of de novo stand-alone error correction methods for NGS data
    Alic, Andy S.
    Ruzafa, David
    Dopazo, Joaquin
    Blanquer, Ignacio
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2016, 6 (02) : 111 - 146
  • [3] Error correction and diversity analysis of population mixtures determined by NGS
    Wood, Graham R.
    Burroughs, Nigel J.
    Evans, David J.
    Ryabov, Eugene V.
    [J]. PEERJ, 2014, 2
  • [4] DATA ERROR CORRECTION
    MORANT, AJ
    [J]. ELECTRONICS & WIRELESS WORLD, 1987, 93 (1622): : 1259 - 1260
  • [5] Research on Data Caching System with Error Correction Algorithm in aerial camera
    Liao Jiawen
    Ren Long
    Wang Hua
    Xie Qingsheng
    Guo Huinan
    Wang Huawei
    Wang Zefeng
    [J]. 2014 FOURTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2014, : 960 - 963
  • [6] Improved Algorithm for Error Correction
    Toghuj, Wael
    Alkhatib, Ghazi
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2011, 6 (01) : 1 - 12
  • [7] Quantifying sequencing error and effective sequencing depth of liquid biopsy NGS with UMI error correction
    Frank, Malene Stochkel
    Fuss, Janina
    Steiert, Tim Alexander
    Streleckiene, Greta
    Gehl, Julie
    Forster, Michael
    [J]. BIOTECHNIQUES, 2021, 70 (04) : 226 - 232
  • [8] A efficient algorithm for spectrum error correction
    Zhu, L.
    Xiong, Y.
    [J]. Zhendong Gongcheng Xuebao/Journal of Vibration Engineering, 2001, 14 (02): : 166 - 171
  • [9] Solving the input data in error correction of optical fabrication by finite Fourier coefficient algorithm
    Haobo Cheng
    Guozheng Sun
    Zhijing Feng
    [J]. Science in China Series E, 2006, 49 : 50 - 60
  • [10] Algorithm for solving the input data in the error correction of free-form curved lens
    Wu, Hongzhong
    Feng, Zhijing
    Zhao, Guangmu
    [J]. Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2002, 13 (01):