GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data

被引：0

作者：

Haseeb, Muhammad ^{[1
]}

Saeed, Fahad ^{[1
,2
,3
]}

机构：

[1] Florida Int Univ FIU, Knight Fdn Sch Comp & Informat Sci, Miami, FL 33199 USA

[2] Biomol Sci Inst BSI, Miami, FL 33199 USA

[3] Florida Int Univ, Herbert Wertheim Sch Med, Dept Human & Mol Genet, Miami, FL 33199 USA

来源：

SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期

基金：

美国国家科学基金会; 美国国家卫生研究院;

关键词：

TANDEM; IDENTIFICATION; SEQUENCES; ULTRAFAST;

D O I：

10.1038/s41598-023-43033-w

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Database peptide search is the primary computational technique for identifying peptides from the mass spectrometry (MS) data. Graphical Processing Units (GPU) computing is now ubiquitous in the current-generation of high-performance computing (HPC) systems, yet its application in the database peptide search domain remains limited. Part of the reason is the use of sub-optimal algorithms in the existing GPU-accelerated methods resulting in significantly inefficient hardware utilization. In this paper, we design and implement a new-age CPU-GPU HPC framework, called GiCOPS, for efficient and complete GPU-acceleration of the modern database peptide search algorithms on supercomputers. Our experimentation shows that the GiCOPS exhibits between 1.2 to 5x speed improvement over its CPU-only predecessor, HiCOPS, and over 10x improvement over several existing GPU-based database search algorithms for sufficiently large experiment sizes. We further assess and optimize the performance of our framework using the Roofline Model and report near-optimal results for several metrics including computations per second, occupancy rate, memory workload, branch efficiency and shared memory performance. Finally, the CPU-GPU methods and optimizations proposed in our work for complex integer- and memory-bounded algorithmic pipelines can also be extended to accelerate the existing and future peptide identification algorithms. GiCOPS is now integrated with our umbrella HPC framework HiCOPS and is available at: https://github.com/pcdslab/gicops.

引用

页数：14

共 50 条

[31] Nonblocking Data Structures for Distributed-Memory Machines: Stacks as an Example
Diep, Thanh-Dang
Furlinger, Karl
2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 9 - 17
[32] Distributed-memory multi-GPU block-sparse tensor contraction for electronic structure
Herault, Thomas
Robert, Yves
Bosilca, George
Harrison, Robert J.
Lewis, Cannada A.
Valeev, Edward F.
Dongarra, Jack J.
2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 537 - 546
[33] Protein database search of hybrid alignment algorithm based on GPU parallel acceleration
Zhou, Wei
Cai, Zhanxiu
Lian, Bo
Wang, Jincai
Ma, Jianping
JOURNAL OF SUPERCOMPUTING, 2017, 73 (10): : 4517 - 4534
[34] Protein database search of hybrid alignment algorithm based on GPU parallel acceleration
Wei Zhou
Zhanxiu Cai
Bo Lian
Jincai Wang
Jianping Ma
The Journal of Supercomputing, 2017, 73 : 4517 - 4534
[35] Efficient Breadth-First Search on Massively Parallel and Distributed-Memory Machines
Ueno K.
Suzumura T.
Maruyama N.
Fujisawa K.
Matsuoka S.
Data Science and Engineering, 2017, 2 (1) : 22 - 35
[36] Generic matrix multiplication for multi-GPU accelerated distributed-memory platforms over PARSEC
Herault, Thomas
Robert, Yves
Bosilca, George
Dongarra, Jack
PROCEEDINGS OF SCALA 2019: 2019 IEEE/ACM 10TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE-SCALE SYSTEMS (SCALA), 2019, : 33 - 41
[37] A general approach for supporting nonblocking data structures on distributed-memory systems
Thanh-Dang Diep
Phuong Hoai Ha
Fuerlinger, Karl
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 173 : 48 - 60
[38] Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory
Dathathri, Roshan
Reddy, Chandan
Ramashekar, Thejas
Bondhugula, Uday
2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 375 - 386
[39] FILTER-BASED JOIN ALGORITHMS ON UNIPROCESSOR AND DISTRIBUTED-MEMORY MULTIPROCESSOR DATABASE MACHINES
QADAH, GZ
LECTURE NOTES IN COMPUTER SCIENCE, 1988, 303 : 388 - 413
[40] PROCESSOR TAGGED DESCRIPTORS - A DATA STRUCTURE FOR COMPILING FOR DISTRIBUTED-MEMORY MULTICOMPUTERS
SU, E
PALERMO, DJ
BANERJEE, P
PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 1994, 50 : 123 - 132

← 1 2 3 4 5 →