Velvet: Algorithms for de novo short read assembly using de Bruijn graphs

被引:7220
|
作者
Zerbino, Daniel R. [1 ]
Birney, Ewan [1 ]
机构
[1] EMBL European Bioinformat Inst, Cambridge CB10 1SD, England
基金
英国医学研究理事会;
关键词
D O I
10.1101/gr.074492.107
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have developed a new set of algorithms, collectively called "Velvet," to manipulate de Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact representation based on short words (k-mers) that is ideal for high coverage, very short read (25-50 bp) data sets. Applying Velvet to very short reads and paired-ends information only, one can produce contigs of significant length, up to 50-kb N50 length in simulations of prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa data sets without read pairs, Velvet generated contigs of similar to 8 kb in a prokaryote and 2 kb in a mammalian BAC, in close agreement with our simulated results without read-pair information. Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies.
引用
收藏
页码:821 / 829
页数:9
相关论文
共 50 条
  • [1] Parallelized short read assembly of large genomes using de Bruijn graphs
    Yongchao Liu
    Bertil Schmidt
    Douglas L Maskell
    [J]. BMC Bioinformatics, 12
  • [2] Parallelized short read assembly of large genomes using de Bruijn graphs
    Liu, Yongchao
    Schmidt, Bertil
    Maskell, Douglas L.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [3] De novo assembly and genotyping of variants using colored de Bruijn graphs
    Zamin Iqbal
    Mario Caccamo
    Isaac Turner
    Paul Flicek
    Gil McVean
    [J]. Nature Genetics, 2012, 44 : 226 - 232
  • [4] De novo assembly and genotyping of variants using colored de Bruijn graphs
    Iqbal, Zamin
    Caccamo, Mario
    Turner, Isaac
    Flicek, Paul
    McVean, Gil
    [J]. NATURE GENETICS, 2012, 44 (02) : 226 - 232
  • [5] Combining De Bruijn Graphs, Overlap Graphs and Microassembly for De Novo Genome Assembly
    Sergushichev, A. A.
    Alexandrov, A. V.
    Kazakov, S. V.
    Tsarev, F. N.
    Shalyto, A. A.
    [J]. IZVESTIYA SARATOVSKOGO UNIVERSITETA NOVAYA SERIYA-MATEMATIKA MEKHANIKA INFORMATIKA, 2013, 13 (02): : 10 - 10
  • [6] Proposal of a New Method for de Novo DNA Sequence Assembly Using de Bruijn Graphs
    Couto, Adriano Donato
    Cerqueira, Fabio Ribeiro
    Ferreira, Ricardo dos Santos
    Oliveira, Alcione de Paiva
    [J]. INFORMATION SCIENCES AND SYSTEMS 2015, 2016, 363 : 307 - 317
  • [7] Read mapping on de Bruijn graphs
    Limasset, Antoine
    Cazaux, Bastien
    Rivals, Eric
    Peterlongo, Pierre
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [8] Read mapping on de Bruijn graphs
    Antoine Limasset
    Bastien Cazaux
    Eric Rivals
    Pierre Peterlongo
    [J]. BMC Bioinformatics, 17
  • [9] K-mer Mapping and De Bruijn Graphs: the case for Velvet Fragment Assembly
    de Armas, Elvismary Molina
    Haeusler, Edward Hermann
    Lifschitz, Sergio
    de Holanda, Maristela Terto
    Cordeiro da Silva, Waldeyr Mendes
    Gomes Ferreira, Paulo Cavalcanti
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 882 - 889
  • [10] Overlap graphs and de Bruijn graphs: data structures for de novo genome assembly in the big data era
    Rizzi, Raffaella
    Beretta, Stefano
    Patterson, Murray
    Pirola, Yuri
    Previtali, Marco
    Della Vedova, Gianluca
    Bonizzoni, Paola
    [J]. QUANTITATIVE BIOLOGY, 2019, 7 (04) : 278 - 292