cloudSPAdes: assembly of synthetic long reads using de Bruijn graphs

被引:18
|
作者
Tolstoganov, Ivan [1 ]
Bankevich, Anton [2 ]
Chen, Zhoutao [3 ]
Pevzner, Pavel A. [1 ,2 ]
机构
[1] St Petersburg State Univ, Inst Translat Biomed, Ctr Algorithm Biotechnol, St Petersburg, Russia
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[3] Universal Sequencing Technol Corp, Carlsbad, CA USA
基金
俄罗斯科学基金会;
关键词
DNA EXTRACTION; GENOME; ACCURATE;
D O I
10.1093/bioinformatics/btz349
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation The recently developed barcoding-based synthetic long read (SLR) technologies have already found many applications in genome assembly and analysis. However, although some new barcoding protocols are emerging and the range of SLR applications is being expanded, the existing SLR assemblers are optimized for a narrow range of parameters and are not easily extendable to new barcoding technologies and new applications such as metagenomics or hybrid assembly. Results We describe the algorithmic challenge of the SLR assembly and present a cloudSPAdes algorithm for SLR assembly that is based on analyzing the de Bruijn graph of SLRs. We benchmarked cloudSPAdes across various barcoding technologies/applications and demonstrated that it improves on the state-of-the-art SLR assemblers in accuracy and speed. Availability and implementation Source code and installation manual for cloudSPAdes are available at https://github.com/ablab/spades/releases/tag/cloudspades-paper. Supplementary Information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:I61 / I70
页数:10
相关论文
共 50 条
  • [31] Generalized de Bruijn graphs
    F. M. Malyshev
    V. E. Tarakanov
    Mathematical Notes, 1997, 62 : 449 - 456
  • [32] Enhanced de Bruijn graphs
    Guzide, O
    Wagh, MD
    AMCS '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ALGORITHMIC MATHEMATICS AND COMPUTER SCIENCE, 2005, : 23 - 28
  • [33] On the Representation of de Bruijn Graphs
    Chikhi, Rayan
    Limasset, Antoine
    Jackman, Shaun
    Simpson, Jared T.
    Medvedev, Paul
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB2014, 2014, 8394 : 35 - 55
  • [34] On the Representation of De Bruijn Graphs
    Chikhi, Rayan
    Limasset, Antoine
    Jackman, Shaun
    Simpson, Jared T.
    Medvedev, Paul
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (05) : 336 - 352
  • [35] Shifted de Bruijn Graphs
    Freij, Ragnar
    CODING THEORY AND APPLICATIONS, 4TH INTERNATIONAL CASTLE MEETING, 2015, 3 : 195 - 202
  • [36] Manifold de Bruijn Graphs
    Lin, Yu
    Pevzner, Pavel A.
    ALGORITHMS IN BIOINFORMATICS, 2014, 8701 : 296 - 310
  • [37] Bijections in de Bruijn Graphs
    Rukavicka, Josef
    ARS COMBINATORIA, 2019, 143 : 215 - 226
  • [38] Consensus on de Bruijn graphs
    G. Yan
    Z. -Q. Fu
    G. Chen
    The European Physical Journal B, 2008, 63 : 515 - 520
  • [39] Generalized de Bruijn graphs
    Malyshev, Fedor M.
    DISCRETE MATHEMATICS AND APPLICATIONS, 2022, 32 (01): : 11 - 38
  • [40] Generalized de bruijn graphs
    Zdzistaw, Grodzki
    Wronski, Aleksandr
    Journal of Information Processing and Cybernetics, 1994, 30 (01):