GAMS: Genome Assembly on Multi-GPU using String Graph

被引:0
|
作者
Jain, Gaurav [1 ]
Rathore, Lalchand [1 ]
Paul, Kolin [1 ]
机构
[1] IIT, Dept Comp Sci & Engn, Delhi, India
关键词
Denovo assembly; GPU; BWT; String graph; SEQUENCES; ERROR;
D O I
10.1109/HPCC-SmartCity-DSS.2016.108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Genome Sequence assembly is a very compute intensive problem in the field of bio-informatics. Many parallel algorithms has been proposed to accelerate this on multicores as well as clusters of machines. In recent times, the improved computation power of GPUs has enabled applications from various research fields to take advantage of the massive number of "cores" available in GPUs and multiple GPUs working together in parallel. In this paper we present the design and development of a Multi-GPU based assembler for sequence assembly using Nvidia's GPUs. We use the String Graph approach to circumvent the limited memory available in GPUs to build a parallel solution given that the string graph is a memory efficient data structure. Our assembler (GAMS) takes in a file of reads in fasta format produced by the current NGS technologies to build the string graph. Contigs are formed by grouping the regions of graph which can be unambiguously connected. In this paper, we also present parallel algorithms for string graph construction and graph simplification. We have benchmarked our assembler on five bacterial genomes and chr22 of the human genome. Our results show that the design on the Multi-GPU provides a 6-7x speedup over a state of the art parallel Velvet implementation. The quality of assembly produced is also significantly better.
引用
收藏
页码:348 / 355
页数:8
相关论文
共 50 条
  • [21] Locality-aware Thread Block Design in Single and Multi-GPU Graph Processing
    Fan, Quan
    Chen, Zizhong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2021, : 148 - 151
  • [22] Understanding Scalability of Multi-GPU Systems
    Feng, Yuan
    Jeon, Hyeran
    [J]. 15TH WORKSHOP ON GENERAL PURPOSE PROCESSING USING GPU, GPGPU 2023, 2023, : 36 - 37
  • [23] Moim: A Multi-GPU MapReduce Framework
    Xie, Mengjun
    Kang, Kyoung-Don
    Basaran, Can
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1279 - 1286
  • [24] An introduction to multi-GPU programming for physicists
    M. Bernaschi
    M. Bisson
    M. Fatica
    E. Phillips
    [J]. The European Physical Journal Special Topics, 2012, 210 : 17 - 31
  • [25] Efficient Implementation of MrBayes on Multi-GPU
    Bao, Jie
    Xia, Hongju
    Zhou, Jianfu
    Liu, Xiaoguang
    Wang, Gang
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (06) : 1471 - 1479
  • [26] Multi-level Clustering on Metric Spaces Using a Multi-GPU Platform
    Barrientos, Ricardo J.
    Gomez, Jose I.
    Tenllado, Christian
    Prieto Matias, Manuel
    Zezula, Pavel
    [J]. EURO-PAR 2013 PARALLEL PROCESSING, 2013, 8097 : 216 - 228
  • [27] Data Parallel Skeletons for GPU Clusters and Multi-GPU Systems
    Ernsting, Steffen
    Kuchen, Herbert
    [J]. APPLICATIONS, TOOLS AND TECHNIQUES ON THE ROAD TO EXASCALE COMPUTING, 2012, 22 : 509 - 518
  • [28] Topology-Aware GPU Selection on Multi-GPU Nodes
    Faraji, Iman
    Mirsadeghi, Seyed H.
    Afsahi, Ahmad
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 712 - 720
  • [29] Efficient Breadth First Search on Multi-GPU Systems Using GPU-Centric OpenSHMEM
    Potluri, Sreeram
    Goswami, Anshuman
    Venkata, Manjunath Gorentla
    Imam, Neena
    [J]. OPENSHMEM AND RELATED TECHNOLOGIES: BIG COMPUTE AND BIG DATA CONVERGENCE, OPENSHMEM 2017, 2018, 10679 : 82 - 96
  • [30] Multi-GPU thermal lattice Boltzmann simulations using OpenACC and MPI
    Xu, Ao
    Li, Bo -Tao
    [J]. INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2023, 201