GAMS: Genome Assembly on Multi-GPU using String Graph

被引:0
|
作者
Jain, Gaurav [1 ]
Rathore, Lalchand [1 ]
Paul, Kolin [1 ]
机构
[1] IIT, Dept Comp Sci & Engn, Delhi, India
关键词
Denovo assembly; GPU; BWT; String graph; SEQUENCES; ERROR;
D O I
10.1109/HPCC-SmartCity-DSS.2016.108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Genome Sequence assembly is a very compute intensive problem in the field of bio-informatics. Many parallel algorithms has been proposed to accelerate this on multicores as well as clusters of machines. In recent times, the improved computation power of GPUs has enabled applications from various research fields to take advantage of the massive number of "cores" available in GPUs and multiple GPUs working together in parallel. In this paper we present the design and development of a Multi-GPU based assembler for sequence assembly using Nvidia's GPUs. We use the String Graph approach to circumvent the limited memory available in GPUs to build a parallel solution given that the string graph is a memory efficient data structure. Our assembler (GAMS) takes in a file of reads in fasta format produced by the current NGS technologies to build the string graph. Contigs are formed by grouping the regions of graph which can be unambiguously connected. In this paper, we also present parallel algorithms for string graph construction and graph simplification. We have benchmarked our assembler on five bacterial genomes and chr22 of the human genome. Our results show that the design on the Multi-GPU provides a 6-7x speedup over a state of the art parallel Velvet implementation. The quality of assembly produced is also significantly better.
引用
收藏
页码:348 / 355
页数:8
相关论文
共 50 条
  • [1] Multi-GPU Graph Analytics
    Pan, Yuechao
    Wang, Yangzihao
    Wu, Yuduo
    Yang, Carl
    Owens, John D.
    [J]. 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 479 - 490
  • [2] A Distributed Multi-GPU System for Fast Graph Processing
    Jia, Zhihao
    Kwon, Yongkee
    Shipman, Galen
    McCormick, Pat
    Erez, Mattan
    Aiken, Alex
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (03): : 297 - 310
  • [3] Benchmarking multi-GPU applications on modern multi-GPU integrated systems
    Bernaschi, Massimo
    Agostini, Elena
    Rossetti, Davide
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (14):
  • [4] Fast STA Graph Partitioning Framework for Multi-GPU Acceleration
    Guo, Guannan
    Huang, Tsung-Wei
    Wong, Martin
    [J]. 2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [5] Large-Scale Graph Processing on Multi-GPU Platforms
    Zhang, Heng
    Zhang, Libo
    Wu, yanjun
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2018, 55 (02): : 273 - 288
  • [6] Computation and Communication Aware Task Graph Scheduling on Multi-GPU Systems
    Wang, Yun-Ting
    Lee, Jia-Ying
    Lai, Bo-Cheng Charles
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 115 - 119
  • [7] Modelling Multi-GPU Systems
    Spampinato, Daniele G.
    Elster, Anne C.
    Natvig, Thorvald
    [J]. PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 562 - 569
  • [8] Multi-GPU Kinetic Solvers using MPI and CUDA
    Zabelok, Sergey
    Arslanbekov, Robert
    Kolobov, Vladimir
    [J]. PROCEEDINGS OF THE 29TH INTERNATIONAL SYMPOSIUM ON RAREFIED GAS DYNAMICS, 2014, 1628 : 539 - 546
  • [9] MAPREDUCE IMPLEMENTATION WITH MULTI-GPU
    Chen, Yi
    Chen, Su
    Jiang, Hai
    [J]. INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 21 - 25
  • [10] Sphynx: A parallel multi-GPU graph partitioner for distributed-memory systems
    Acer, Seher
    Boman, Erik G.
    Glusa, Christian A.
    Rajamanickam, Sivasankaran
    [J]. PARALLEL COMPUTING, 2021, 106