RabbitSAlign: Accelerating Short-Read Alignment for CPU-GPU Heterogeneous Platforms

被引:0
|
作者
Yan, Lifeng [1 ]
Yin, Zekun [1 ]
Li, Jinjin [1 ]
Yang, Yang [1 ]
Zhang, Tong [1 ]
Zhu, Fangjin [1 ]
Duan, Xiaohui [1 ]
Schmidt, Bertil [2 ]
Liu, Weiguo [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan, Peoples R China
[2] Johannes Gutenberg Univ Mainz, Inst Comp Sci, Mainz, Germany
关键词
Next-generation sequencing; Read alignment; GPUs; High-performance bio-computing; GENOME;
D O I
10.1007/978-981-97-5131-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short-read alignment is a critical, yet time-consuming step in many next-generation sequencing data analysis pipelines. Most approaches follow the seed-and-extend strategy, where seeding usually involves a large number of random memory accesses, and extension of seeds relies on computationally expensive alignment algorithms, resulting in huge time consumption. Recently, Strobealign has reached state-of-the-art alignment speed while maintaining high accuracy through an innovative seeding strategy. Yet, there is still room for further optimization, especially on modern CPU-GPU heterogeneous platforms. In this paper, we present RabbitSAlign, a new GPU-accelerated short-read aligner based on Strobealign. By optimizing inefficient operations in the seeding process and utilizing GPUs to accelerate the extension process, RabbitSAlign doubles the processing speed on real biological datasets compared to Strobealign. It surpasses the performance of highly optimized BWA-MEM2 and NVIDIA Parabricks by a factor of at least four, while also being one-order-of-magnitude faster than the widely-utilized BWA-MEM and Bowtie2. Additionally, RabbitSAlign features highly competitive accuracy on both simulated and real biological data. Remarkably, it can process a 30x human genome sequencing dataset in merely 18 min. C++ sources are available at https://github.com/RabbitBio/RabbitSAlign.
引用
收藏
页码:83 / 94
页数:12
相关论文
共 50 条
  • [41] Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems
    Yu, Yuanhang
    Wen, Dong
    Zhang, Ying
    Wang, Xiaoyang
    Zhang, Wenjie
    Lin, Xuemin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1871 - 1876
  • [42] Heterogeneous parallel_for Template for CPU-GPU Chips
    Navarro, Angeles
    Corbera, Francisco
    Rodriguez, Andres
    Vilches, Antonio
    Asenjo, Rafael
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 47 (02) : 213 - 233
  • [43] Efficient Pattern Matching on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 391 - 403
  • [44] Parallel Smith-Waterman Algorithm for Pairwise Sequence Alignment on CPU-GPU Heterogeneous Platform
    Dong, Yinghui
    Xia, Fei
    Jin, Guoqing
    8TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2014), 2014, : 16 - 24
  • [45] A High Performance Implementation of Spectral Clustering on CPU-GPU Platforms
    Jin, Yu
    Jaja, Joseph F.
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 825 - 834
  • [46] Improving CPU Performance through Dynamic GPU Access Throttling in CPU-GPU Heterogeneous Processors
    Rai, Siddharth
    Chaudhuri, Mainak
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 18 - 29
  • [47] Real-Time and Energy-Efficient Face Detection on CPU-GPU Heterogeneous Embedded Platforms
    Oh, Chanyoung
    Yi, Saehanseul
    Yi, Youngmin
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12): : 2878 - 2888
  • [48] A Hybrid B plus -tree as Solution for In-Memory Indexing on CPU-GPU Heterogeneous Computing Platforms
    Shahvarani, Amirhesam
    Jacobsen, Hans-Arno
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1523 - 1538
  • [49] Efficient adaptive load balancing approach for compressive background subtraction algorithm on heterogeneous CPU-GPU platforms
    Mabrouk, Lhoussein
    Huet, Sylvain
    Houzet, Dominique
    Belkouch, Said
    Hamzaoui, Abdelkrim
    Zennayi, Yahya
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (05) : 1567 - 1583
  • [50] Performance optimization in DNA short-read alignment
    Wilton, Richard
    Szalay, Alexander S.
    BIOINFORMATICS, 2022, 38 (08) : 2081 - 2087