naiveBayesCall: An Efficient Model-Based Base-Calling Algorithm for High-Throughput Sequencing

被引:0
|
作者
Kao, Wei-Chun [1 ]
Song, Yuri S. [1 ]
机构
[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
关键词
GENOME; MATRIX;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Immense amounts of raw instrument data (i.e., images of fluorescence) are currently being generated using ultra high-throughput sequencing platforms. An important computational challenge associated with this rapid advancement is to develop efficient algorithms that can extract accurate sequence information from raw data. To address this challenge, we recently introduced a novel model-based base-calling algorithm that is fully parametric and has several advantages over previously proposed methods. Our original algorithm, called BayesCall, significantly reduced the error rate, particularly in the later cycles of a sequencing run, and also produced useful base-specific quality scores with a high discrimination ability. Unfortunately, however, BayesCall is too computationally expensive to be of broad practical use. In this paper, we build on our previous model-based approach to devise an efficient base-calling algorithm that is orders of magnitude faster than BayesCall, while still maintaining a comparably high level of accuracy. Our new algorithm is called naiveBayesCall, and it utilizes approximation and optimization methods to achieve scalability. We describe the performance of naiveBayesCall and demonstrate how improved base-calling accuracy may facilitate de novo assembly when the coverage is low to moderate.
引用
收藏
页码:233 / 247
页数:15
相关论文
共 50 条
  • [1] naiveBayesCall: An Efficient Model-Based Base-Calling Algorithm for High-Throughput Sequencing
    Kao, Wei-Chun
    Song, Yun S.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2011, 18 (03) : 365 - 377
  • [2] BayesCall: A model-based base-calling algorithm for high-throughput short-read sequencing
    Kao, Wei-Chun
    Stevens, Kristian
    Song, Yun S.
    [J]. GENOME RESEARCH, 2009, 19 (10) : 1884 - 1895
  • [3] BlindCall: ultra-fast base-calling of high-throughput sequencing data by blind deconvolution
    Ye, Chengxi
    Hsiao, Chiaowen
    Bravo, Hector Corrada
    [J]. BIOINFORMATICS, 2014, 30 (09) : 1214 - 1219
  • [4] Model-Based Quality Assessment and Base-Calling for Second-Generation Sequencing Data
    Bravo, Hector Corrada
    Irizarry, Rafael A.
    [J]. BIOMETRICS, 2010, 66 (03) : 665 - 674
  • [5] Nanopore Sequencing using a Hidden Markov Model for Base-Calling
    Timp, Winston
    Comer, Jeffrey
    Aksimentiev, Aleksei
    [J]. BIOPHYSICAL JOURNAL, 2013, 104 (02) : 211A - 211A
  • [6] MODEL-BASED SEQUENTIAL BASE CALLING FOR ILLUMINA SEQUENCING
    Das, Shreepriya
    Vikalo, Haris
    Hassibi, Arjang
    [J]. 2010 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2010,
  • [7] Improved base-calling and quality scores for 454 sequencing based on a Hurdle Poisson model
    De Beuf, Kristof
    De Schrijver, Joachim
    Thas, Olivier
    Van Criekinge, Wim
    Irizarry, Rafael A.
    Clement, Lieven
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [8] Improved base-calling and quality scores for 454 sequencing based on a Hurdle Poisson model
    Kristof De Beuf
    Joachim De Schrijver
    Olivier Thas
    Wim Van Criekinge
    Rafael A Irizarry
    Lieven Clement
    [J]. BMC Bioinformatics, 13
  • [9] Alternative base-calling algorithm for DNA sequencing based on four-label multicolor detection
    Song, JM
    Yeung, ES
    [J]. ELECTROPHORESIS, 2000, 21 (04) : 807 - 815
  • [10] Base-Calling Algorithm with Vocabulary (BCV) Method for Analyzing Population Sequencing Chromatograms
    Fantin, Yuri S.
    Neverov, Alexey D.
    Favorov, Alexander V.
    Alvarez-Figueroa, Maria V.
    Braslavskaya, Svetlana I.
    Gordukova, Maria A.
    Karandashova, Inga V.
    Kuleshov, Konstantin V.
    Myznikova, Anna I.
    Polishchuk, Maya S.
    Reshetov, Denis A.
    Voiciehovskaya, Yana A.
    Mironov, Andrei A.
    Chulanov, Vladimir P.
    [J]. PLOS ONE, 2013, 8 (01):