RabbitSAlign: Accelerating Short-Read Alignment for CPU-GPU Heterogeneous Platforms

被引:0
|
作者
Yan, Lifeng [1 ]
Yin, Zekun [1 ]
Li, Jinjin [1 ]
Yang, Yang [1 ]
Zhang, Tong [1 ]
Zhu, Fangjin [1 ]
Duan, Xiaohui [1 ]
Schmidt, Bertil [2 ]
Liu, Weiguo [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan, Peoples R China
[2] Johannes Gutenberg Univ Mainz, Inst Comp Sci, Mainz, Germany
关键词
Next-generation sequencing; Read alignment; GPUs; High-performance bio-computing; GENOME;
D O I
10.1007/978-981-97-5131-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short-read alignment is a critical, yet time-consuming step in many next-generation sequencing data analysis pipelines. Most approaches follow the seed-and-extend strategy, where seeding usually involves a large number of random memory accesses, and extension of seeds relies on computationally expensive alignment algorithms, resulting in huge time consumption. Recently, Strobealign has reached state-of-the-art alignment speed while maintaining high accuracy through an innovative seeding strategy. Yet, there is still room for further optimization, especially on modern CPU-GPU heterogeneous platforms. In this paper, we present RabbitSAlign, a new GPU-accelerated short-read aligner based on Strobealign. By optimizing inefficient operations in the seeding process and utilizing GPUs to accelerate the extension process, RabbitSAlign doubles the processing speed on real biological datasets compared to Strobealign. It surpasses the performance of highly optimized BWA-MEM2 and NVIDIA Parabricks by a factor of at least four, while also being one-order-of-magnitude faster than the widely-utilized BWA-MEM and Bowtie2. Additionally, RabbitSAlign features highly competitive accuracy on both simulated and real biological data. Remarkably, it can process a 30x human genome sequencing dataset in merely 18 min. C++ sources are available at https://github.com/RabbitBio/RabbitSAlign.
引用
收藏
页码:83 / 94
页数:12
相关论文
共 50 条
  • [31] Parallelization with load balancing of the weather scheme WSM7 for heterogeneous CPU-GPU platforms
    Jakobs, Thomas
    Kloeckner, Oliver
    Ruenger, Gudula
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (10): : 14645 - 14665
  • [32] Performance Analysis of AES on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    CLOUD COMPUTING, BIG DATA & EMERGING TOPICS, JCC-BD&ET 2022, 2022, 1634 : 31 - 42
  • [33] Reducing CPU-GPU Interferences to Improve CPU Performance in Heterogeneous Architectures
    Wen H.
    Zhang W.
    Journal of Computing Science and Engineering, 2020, 16 (04) : 131 - 145
  • [34] Accelerating Progressive Set Similarity Join with the CPU-GPU Architecture
    Yu, Lining
    Nie, Tiezheng
    Shen, Derong
    Kou, Yue
    BIG DATA RESEARCH, 2021, 26
  • [35] Matrix inversion on CPU-GPU platforms with applications in control theory
    Benner, Peter
    Ezzatti, Pablo
    Quintana-Orti, Enrique S.
    Remon, Alfredo
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (08): : 1170 - 1182
  • [36] Towards Optimal Fast Matrix Multiplication on CPU-GPU Platforms
    Shao, Senhao
    Wang, Yizhuo
    Ji, Weixing
    Gao, Jianhua
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT 2021, 2022, 13148 : 223 - 236
  • [37] A Survey on Task Scheduling of CPU-GPU Heterogeneous Cluster
    ZHOU Yiheng
    ZENG Wei
    ZHENG Qingfang
    LIU Zhilong
    CHEN Jianping
    ZTE Communications, 2024, 22 (03) : 83 - 90
  • [38] Image Noise Removal on Heterogeneous CPU-GPU Configurations
    Sanchez, Maria G.
    Vidal, Vicente
    Arnal, Josep
    Vidal, Anna
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 2219 - 2229
  • [39] Performance Optimization for CPU-GPU Heterogeneous Parallel System
    Wang, Yanhua
    Qiao, Jianzhong
    Lin, Shukuan
    Zhao, Tinglei
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1259 - 1266
  • [40] A Flexible Scheduling Framework for Heterogeneous CPU-GPU Clusters
    Sajjapongse, Kittisak
    Agarwal, Tejaswi
    Becchi, Michela
    2014 21ST INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2014,