A de novo Genome Assembler based on MapReduce and Bi-directed de Bruijn Graph

被引:0
|
作者
Zhang, Yuehua [1 ]
Xuan, Pengfei [1 ]
Wang, Yunsheng [1 ]
Srimani, Pradip K. [1 ]
Luo, Feng [1 ]
机构
[1] Clemson Univ, Sch Comp, Clemson, SC 29634 USA
关键词
next generation sequencing (NGS); assembly; MapReduce; bi-directed de Bruijn graph; ALGORITHMS; PARALLEL;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The next generation sequencing (NGS) techniques have enabled biologists to generate large DNA sequences in a high-throughput and low-cost way. Assembly of NGS reads still face great challenges due to the short reads and enormous high volume. In this paper, we presented a new assembler, called GAMR, which is based on bi-directed de Bruijn graph and implemented using MapReduce framework. We designed distributed algorithm for each step in GAMR, making it scalable in assembling large-scale genomes. We evaluated GAMR using GAGE's data and compared it against other NGS assemblers. The results showed GAMR assembled contigs and scaffolds with better accuracy and longer N50 values.
引用
收藏
页码:65 / 71
页数:7
相关论文
共 50 条
  • [21] SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Extreme Scale
    Meng, Jintao
    Seo, Sangmin
    Balaji, Pavan
    Wei, Yanjie
    Wang, Bingqiang
    Feng, Shenzhong
    [J]. PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 195 - 204
  • [22] TransLiG: a de novo transcriptome assembler that uses line graph iteration
    Juntao Liu
    Ting Yu
    Zengchao Mu
    Guojun Li
    [J]. Genome Biology, 20
  • [23] TransLiG: a de novo transcriptome assembler that uses line graph iteration
    Liu, Juntao
    Yu, Ting
    Mu, Zengchao
    Li, Guojun
    [J]. GENOME BIOLOGY, 2019, 20 (1)
  • [24] De-Bruijn graph with MapReduce framework towards metagenomic data classification
    Kamal M.S.
    Parvin S.
    Ashour A.S.
    Shi F.
    Dey N.
    [J]. International Journal of Information Technology, 2017, 9 (1) : 59 - 75
  • [25] Combining De Bruijn Graphs, Overlap Graphs and Microassembly for De Novo Genome Assembly
    Sergushichev, A. A.
    Alexandrov, A. V.
    Kazakov, S. V.
    Tsarev, F. N.
    Shalyto, A. A.
    [J]. IZVESTIYA SARATOVSKOGO UNIVERSITETA NOVAYA SERIYA-MATEMATIKA MEKHANIKA INFORMATIKA, 2013, 13 (02): : 10 - 10
  • [26] PadeNA: A PARALLEL DE NOVO ASSEMBLER
    Thareja, Gaurav
    Kumar, Vivek
    Zyskowski, Mike
    Mercer, Simon
    Davidson, Bob
    [J]. BIOINFORMATICS 2011, 2011, : 196 - +
  • [27] Parallelized De Bruijn graph construction and simplification for genome assembly
    [J]. Cheng, J.-F. (jiefengcheng@gmail.com), 1600, Chinese Academy of Sciences (24):
  • [29] Cutwidth of the De Bruijn graph
    Raspaud, A
    Sykora, O
    Vrto, I
    [J]. RAIRO-INFORMATIQUE THEORIQUE ET APPLICATIONS-THEORETICAL INFORMATICS AND APPLICATIONS, 1995, 29 (06): : 509 - 514
  • [30] De novo short read assembler
    不详
    [J]. NATURE METHODS, 2012, 9 (02) : 125 - 125