GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data

被引:0
|
作者
Haseeb, Muhammad [1 ]
Saeed, Fahad [1 ,2 ,3 ]
机构
[1] Florida Int Univ FIU, Knight Fdn Sch Comp & Informat Sci, Miami, FL 33199 USA
[2] Biomol Sci Inst BSI, Miami, FL 33199 USA
[3] Florida Int Univ, Herbert Wertheim Sch Med, Dept Human & Mol Genet, Miami, FL 33199 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
TANDEM; IDENTIFICATION; SEQUENCES; ULTRAFAST;
D O I
10.1038/s41598-023-43033-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Database peptide search is the primary computational technique for identifying peptides from the mass spectrometry (MS) data. Graphical Processing Units (GPU) computing is now ubiquitous in the current-generation of high-performance computing (HPC) systems, yet its application in the database peptide search domain remains limited. Part of the reason is the use of sub-optimal algorithms in the existing GPU-accelerated methods resulting in significantly inefficient hardware utilization. In this paper, we design and implement a new-age CPU-GPU HPC framework, called GiCOPS, for efficient and complete GPU-acceleration of the modern database peptide search algorithms on supercomputers. Our experimentation shows that the GiCOPS exhibits between 1.2 to 5x speed improvement over its CPU-only predecessor, HiCOPS, and over 10x improvement over several existing GPU-based database search algorithms for sufficiently large experiment sizes. We further assess and optimize the performance of our framework using the Roofline Model and report near-optimal results for several metrics including computations per second, occupancy rate, memory workload, branch efficiency and shared memory performance. Finally, the CPU-GPU methods and optimizations proposed in our work for complex integer- and memory-bounded algorithmic pipelines can also be extended to accelerate the existing and future peptide identification algorithms. GiCOPS is now integrated with our umbrella HPC framework HiCOPS and is available at: https://github.com/pcdslab/gicops.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
    Muhammad Haseeb
    Fahad Saeed
    Scientific Reports, 13
  • [2] GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering
    Suzuki, Shuji
    Kakuta, Masanori
    Ishida, Takashi
    Akiyama, Yutaka
    PLOS ONE, 2016, 11 (08):
  • [3] Communication lower-bounds for distributed-memory computations for mass spectrometry based omics data
    Saeed, Fahad
    Haseeb, Muhammad
    Iyengar, S. S.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 161 : 37 - 47
  • [4] Linear Systems Solvers for Distributed-Memory Machines with GPU Accelerators
    Kurzak, Jakub
    Gates, Mark
    Charara, Ali
    YarKhan, Asim
    Yamazaki, Ichitaro
    Dongarra, Jack
    EURO-PAR 2019: PARALLEL PROCESSING, 2019, 11725 : 495 - 506
  • [5] Least Squares Solvers for Distributed-Memory Machines with GPU Accelerators
    Kurzak, Jakub
    Gates, Mark
    Charara, Ali
    Yarkhan, Asim
    Dongarra, Jack
    INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2019), 2019, : 117 - 126
  • [6] GPU-acceleration of the ELPA2 distributed eigensolver for dense symmetric and hermitian eigenproblems
    Yu, Victor Wen-zhe
    Moussa, Jonathan
    Kus, Pavel
    Marek, Andreas
    Messmer, Peter
    Yoon, Mina
    Lederer, Hermann
    Blum, Volker
    COMPUTER PHYSICS COMMUNICATIONS, 2021, 262
  • [7] GPU-acceleration on a low-latency binary-coalescence gravitational wave search pipeline
    Guo, Xiangyu
    Chu, Qi
    Chung, Shin Kee
    Du, Zhihui
    Wen, Linqing
    Gu, Yanqi
    COMPUTER PHYSICS COMMUNICATIONS, 2018, 231 : 62 - 71
  • [8] DATA AND TASK ALIGNMENT IN DISTRIBUTED-MEMORY ARCHITECTURES
    SINHAROY, B
    SZYMANSKI, BK
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 21 (01) : 61 - 74
  • [9] Peptide sequence tags for fast database search in mass-spectrometry
    Frank, A
    Tanner, S
    Pevzner, P
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2005, 3500 : 326 - 341
  • [10] Peptide sequence tags for fast database search in mass-spectrometry
    Frank, A
    Tanner, S
    Bafna, V
    Pevzner, P
    JOURNAL OF PROTEOME RESEARCH, 2005, 4 (04) : 1287 - 1295