Succinct colored de Bruijn graphs

被引:60
|
作者
Muggli, Martin D. [1 ]
Bowe, Alexander [2 ]
Noyes, Noelle R. [3 ]
Morley, Paul S. [3 ]
Belk, Keith E. [4 ]
Raymond, Robert [1 ]
Gagie, Travis [5 ,6 ]
Puglisi, Simon J. [7 ]
Boucher, Christina [1 ]
机构
[1] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
[2] Natl Inst Informat, Dept Informat, Chiyoda Ku, Tokyo, Japan
[3] Colorado State Univ, Dept Clin Sci, Ft Collins, CO 80523 USA
[4] Colorado State Univ, Dept Anim Sci, Ft Collins, CO 80523 USA
[5] Diego Portales Univ, Sch Comp Sci & Telecommun, Santiago, Chile
[6] CEBIB, Santiago, Chile
[7] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
基金
芬兰科学院;
关键词
GENOME; ASSEMBLER;
D O I
10.1093/bioinformatics/btx067
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In 2012, Iqbal et al. introduced the colored de Bruijn graph, a variant of the classic de Bruijn graph, which is aimed at 'detecting and genotyping simple and complex genetic variants in an individual or population'. Because they are intended to be applied to massive population level data, it is essential that the graphs be represented efficiently. Unfortunately, current succinct de Bruijn graph representations are not directly applicable to the colored de Bruijn graph, which requires additional information to be succinctly encoded as well as support for non-standard traversal operations. Results: Our data structure dramatically reduces the amount of memory required to store and use the colored de Bruijn graph, with some penalty to runtime, allowing it to be applied in much larger and more ambitious sequence projects than was previously possible.
引用
收藏
页码:3181 / 3187
页数:7
相关论文
共 50 条
  • [1] Succinct dynamic de Bruijn graphs
    Alipanahi, Bahar
    Kuhnle, Alan
    Puglisi, Simon J.
    Salmela, Leena
    Boucher, Christina
    [J]. BIOINFORMATICS, 2021, 37 (14) : 1946 - 1952
  • [2] Compression algorithm for colored de Bruijn graphs
    Rahman, Amatur
    Dufresne, Yoann
    Medvedev, Paul
    [J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2024, 19 (01)
  • [3] Meta-colored Compacted de Bruijn Graphs
    Pibiri, Giulio Ermanno
    Fan, Jason
    Patro, Rob
    [J]. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB 2024, 2024, 14758 : 131 - 146
  • [4] Colored de Bruijn graphs and the genome halving problem
    Alekseyev, Max A.
    Pevzner, Pavel A.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (01) : 98 - 107
  • [5] De novo assembly and genotyping of variants using colored de Bruijn graphs
    Zamin Iqbal
    Mario Caccamo
    Isaac Turner
    Paul Flicek
    Gil McVean
    [J]. Nature Genetics, 2012, 44 : 226 - 232
  • [6] De novo assembly and genotyping of variants using colored de Bruijn graphs
    Iqbal, Zamin
    Caccamo, Mario
    Turner, Isaac
    Flicek, Paul
    McVean, Gil
    [J]. NATURE GENETICS, 2012, 44 (02) : 226 - 232
  • [7] Building large updatable colored de Bruijn graphs via merging
    Muggli, Martin D.
    Alipanahi, Bahar
    Boucher, Christina
    [J]. BIOINFORMATICS, 2019, 35 (14) : I51 - I60
  • [8] Bifrost: highly parallel construction and indexing of colored and compacted de Bruijn graphs
    Guillaume Holley
    Páll Melsted
    [J]. Genome Biology, 21
  • [9] Alignment- and reference-free phylogenomics with colored de Bruijn graphs
    Roland Wittler
    [J]. Algorithms for Molecular Biology, 15
  • [10] Extremely fast construction and querying of compacted and colored de Bruijn graphs with GGCAT
    Cracco, Andrea
    Tomescu, Alexandru I.
    [J]. GENOME RESEARCH, 2023, 33 (07) : 1198 - 1207