A Dynamic Hashing Approach to Build the de Bruijn Graph for Genome Assembly

被引:0
|
作者
Zhao, Kun [1 ]
Liu, Weiguo [1 ]
Voss, Gerrit [2 ]
Mueller-Wittig, Wolfgang [2 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan, Shandong, Peoples R China
[2] Nanyang Technol Univ, Fraunhofer IDMNTU, Singapore, Singapore
关键词
Dynamic hashing; De Bruijn graph; Genome assembly; SHORT DNA-SEQUENCES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The development of next-generation sequencing technologies has revolutionized the genome research and given rise to the explosive increase of DNA sequencing throughput. However, due to the continuing explosive growth of short-read database, these technologies face the challenges of short overlap and high throughput. The de Bruijn graph is particularly suitable for short-read assemblies, and its advantage is that the graph size will not be affected by the high redundancy of deep read coverage. With this character, the fragment assembly is cast as finding a path visiting every edge in the graph exactly once. In this paper, we present a new method to accelerate the genome assembly procedure. We have used a distributed dynamic hashing approach to construct the de Bruijn graph from short-read data. Evaluations using three paired-end datasets show that, our method outperforms previous parallel and distributed assemblers on a CPU cluster system.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Parallelized De Bruijn graph construction and simplification for genome assembly
    [J]. Cheng, J.-F. (jiefengcheng@gmail.com), 1600, Chinese Academy of Sciences (24):
  • [2] Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly
    Georganas, Evangelos
    Buluc, Aydin
    Chapman, Jarrod
    Oliker, Leonid
    Rokhsar, Daniel
    Yelick, Katherine
    [J]. SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 437 - 448
  • [3] HaVec: An Efficient de Bruijn Graph Construction Algorithm for Genome Assembly
    Limon, Mahfuzer Rahman
    Sharker, Ratul
    Biswas, Sajib
    Rahman, M. Sohel
    [J]. INTERNATIONAL JOURNAL OF GENOMICS, 2017, 2017
  • [4] A New Approach for De Bruijn Graph Construction in De Novo Genome Assembling
    de Armas, Elvismary Molina
    Castro, Liester Cruz
    Holanda, Maristela
    Lifschitz, Sergio
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1842 - 1849
  • [5] Parallelizing Big De Bruijn Graph Traversal for Genome Assembly on GPU Clusters
    Qiu, Shuang
    Feng, Zonghao
    Luo, Qiong
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 466 - 470
  • [6] A Classification of de Bruijn Graph Approaches for De Novo Fragment Assembly
    de Armas, Elvismary Molina
    Holanda, Maristela
    de Oliveira, Daniel
    Almeida, Nalvo F.
    Lifschitz, Sergio
    [J]. ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2020, 2020, 12558 : 1 - 12
  • [7] How to apply de Bruijn graphs to genome assembly
    Phillip E C Compeau
    Pavel A Pevzner
    Glenn Tesler
    [J]. Nature Biotechnology, 2011, 29 : 987 - 991
  • [8] How to apply de Bruijn graphs to genome assembly
    Compeau, Phillip E. C.
    Pevzner, Pavel A.
    Tesler, Glenn
    [J]. NATURE BIOTECHNOLOGY, 2011, 29 (11) : 987 - 991
  • [9] Integration of string and de Bruijn graphs for genome assembly
    Huang, Yao-Ting
    Liao, Chen-Fu
    [J]. BIOINFORMATICS, 2016, 32 (09) : 1301 - 1307
  • [10] Scalable Genome Assembly through Parallel de Bruijn Graph Construction for Multiple k-mers
    Kanak Mahadik
    Christopher Wright
    Milind Kulkarni
    Saurabh Bagchi
    Somali Chaterji
    [J]. Scientific Reports, 9