Large-Scale Pairwise Alignments on GPU Clusters: Exploring the Implementation Space

被引:0
|
作者
Huan Truong
Da Li
Kittisak Sajjapongse
Gavin Conant
Michela Becchi
机构
[1] University of Missouri,MU Informatics Institute
[2] University of Missouri,Department of Electrical and Computer Engineering
[3] University of Missouri,Division of Animal Sciences
来源
关键词
Heterogeneous system; Sequence alignment; GPU;
D O I
暂无
中图分类号
学科分类号
摘要
Several problems in computational biology require the all-against-all pairwise comparisons of tens of thousands of individual biological sequences. Each such comparison can be performed with the well-known Needleman-Wunsch alignment algorithm. However, with the rapid growth of biological databases, performing all possible comparisons with this algorithm in serial becomes extremely time-consuming. The massive computational power of graphics processing units (GPUs) makes them an appealing choice for accelerating these computations. As such, CPU-GPU clusters can enable all-against-all comparisons on large datasets. In this work, we present four GPU implementations for large-scale pairwise sequence alignment: TiledDScan-mNW, DScan-mNW, RScan-mNW and LazyRScan-mNW. The proposed GPU kernels exhibit different parallelization patterns: we discuss how each parallelization strategy affects the memory accesses and the utilization of the underlying GPU hardware. We evaluate our implementations on a variety of low- and high-end GPUs with different compute capabilities. Our results show that all the proposed solutions outperform the existing open-source implementation from the Rodinia Benchmark Suite, and LazyRScan-mNW is the preferred solution for applications that require performing the trace-back operation only on a subset of the considered sequence pairs (for example, the pairs whose alignment score exceeds a predefined threshold). Finally, we discuss the integration of the proposed GPU kernels into a hybrid MPI-CUDA framework for deployment on CPU-GPU clusters. In particular, our proposed distributed design targets both homogeneous and heterogeneous clusters with nodes that differ amongst themselves in their hardware configuration.
引用
收藏
页码:131 / 149
页数:18
相关论文
共 50 条
  • [1] Large-Scale Pairwise Alignments on GPU Clusters: Exploring the Implementation Space
    Huan Truong
    Li, Da
    Sajjapongse, Kittisak
    Conant, Gavin
    Becchi, Michela
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 77 (1-2): : 131 - 149
  • [2] Large-Scale Pairwise Sequence Alignments on a Large-Scale GPU Cluster
    Savran, Ibrahim
    Gao, Yang
    Bakos, Jason D.
    [J]. IEEE DESIGN & TEST, 2014, 31 (01) : 51 - 61
  • [3] A Distributed CPU-GPU Framework for Pairwise Alignments on Large-Scale Sequence Datasets
    Li, Da
    Sajjapongse, Kittisak
    Huan Truong
    Conant, Gavin
    Becchi, Michela
    [J]. PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 329 - 338
  • [4] Automatic analysis of large-scale pairwise alignments of protein sequences
    Codani, JJ
    Comet, JP
    Aude, JC
    Glémet, E
    Wozniak, A
    Risler, JL
    Hénaut, A
    Slonimski, PP
    [J]. METHODS IN MICROBIOLOGY, VOL 28, 1999, 28 : 229 - 244
  • [5] Power Control for GPU Clusters in Processing Large-scale Streams
    Chen, Qingkui
    Wang, Haifeng
    Liu, Bocheng
    [J]. JOURNAL OF COMPUTERS, 2013, 8 (10) : 2489 - 2496
  • [6] Acceleration of Large-Scale FDTD Simulations on High Performance GPU Clusters
    Ong, C.
    Weldon, M.
    Cyca, D.
    Okoniewski, M.
    [J]. 2009 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM AND USNC/URSI NATIONAL RADIO SCIENCE MEETING, VOLS 1-6, 2009, : 545 - 548
  • [7] Epidemic simulation of a large-scale social contact network on GPU clusters
    Zou, Peng
    Lu, Ya-shuai
    Wu, Ling-da
    Chen, Li-li
    Yao, Yi-ping
    [J]. SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2013, 89 (10): : 1154 - 1172
  • [8] A heterogeneous parallel implementation of the Markov clustering algorithm for large-scale biological networks on distributed CPU–GPU clusters
    You Fu
    Wei Zhou
    [J]. The Journal of Supercomputing, 2022, 78 : 9017 - 9037
  • [9] MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters
    Weng, Qizhen
    Xiao, Wencong
    Yu, Yinghao
    Wang, Wei
    Wang, Cheng
    He, Jian
    Li, Yong
    Zhang, Liping
    Lin, Wei
    Ding, Yu
    [J]. PROCEEDINGS OF THE 19TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI '22), 2022, : 945 - 960
  • [10] FleetRec: Large-Scale Recommendation Inference on Hybrid GPU-FPGA Clusters
    Jiang, Wenqi
    He, Zhenhao
    Zhang, Shuai
    Zeng, Kai
    Feng, Liang
    Zhang, Jiansong
    Liu, Tongxuan
    Li, Yong
    Zhou, Jingren
    Zhang, Ce
    Alonso, Gustavo
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3097 - 3105