RabbitSAlign: Accelerating Short-Read Alignment for CPU-GPU Heterogeneous Platforms

被引:0
|
作者
Yan, Lifeng [1 ]
Yin, Zekun [1 ]
Li, Jinjin [1 ]
Yang, Yang [1 ]
Zhang, Tong [1 ]
Zhu, Fangjin [1 ]
Duan, Xiaohui [1 ]
Schmidt, Bertil [2 ]
Liu, Weiguo [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan, Peoples R China
[2] Johannes Gutenberg Univ Mainz, Inst Comp Sci, Mainz, Germany
关键词
Next-generation sequencing; Read alignment; GPUs; High-performance bio-computing; GENOME;
D O I
10.1007/978-981-97-5131-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short-read alignment is a critical, yet time-consuming step in many next-generation sequencing data analysis pipelines. Most approaches follow the seed-and-extend strategy, where seeding usually involves a large number of random memory accesses, and extension of seeds relies on computationally expensive alignment algorithms, resulting in huge time consumption. Recently, Strobealign has reached state-of-the-art alignment speed while maintaining high accuracy through an innovative seeding strategy. Yet, there is still room for further optimization, especially on modern CPU-GPU heterogeneous platforms. In this paper, we present RabbitSAlign, a new GPU-accelerated short-read aligner based on Strobealign. By optimizing inefficient operations in the seeding process and utilizing GPUs to accelerate the extension process, RabbitSAlign doubles the processing speed on real biological datasets compared to Strobealign. It surpasses the performance of highly optimized BWA-MEM2 and NVIDIA Parabricks by a factor of at least four, while also being one-order-of-magnitude faster than the widely-utilized BWA-MEM and Bowtie2. Additionally, RabbitSAlign features highly competitive accuracy on both simulated and real biological data. Remarkably, it can process a 30x human genome sequencing dataset in merely 18 min. C++ sources are available at https://github.com/RabbitBio/RabbitSAlign.
引用
下载
收藏
页码:83 / 94
页数:12
相关论文
共 50 条
  • [21] Direction-Optimizing Breadth-First Search on CPU-GPU heterogeneous platforms
    Zou, Dan
    Dou, Yong
    Wang, Qiang
    Xu, Jinbo
    Li, Baofeng
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1064 - 1069
  • [22] Denial of Service in CPU-GPU Heterogeneous Architectures
    Wen, Hao
    Zhang, Wei
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [23] A Survey of CPU-GPU Heterogeneous Computing Techniques
    Mittal, Sparsh
    Vetter, Jeffrey S.
    ACM COMPUTING SURVEYS, 2015, 47 (04)
  • [24] Heterogeneous CPU-GPU Execution of Stencil Applications
    Siklosi, Balint
    Reguly, Istvan Z.
    Mudalige, Gihan R.
    PROCEEDINGS OF 2018 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC 2018), 2018, : 71 - 80
  • [25] gem5-gpu: A Heterogeneous CPU-GPU Simulator
    Power, Jason
    Hestness, Joel
    Orr, Marc S.
    Hill, Mark D.
    Wood, David A.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2015, 14 (01) : 34 - 36
  • [26] Accelerating Iterative Protein Sequence Alignment on a Heterogeneous GPU-CPU Platform
    Said, Mai
    Safar, Mona
    Taher, Whamed
    Wahba, Ayman
    2016 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2016), 2016, : 403 - 410
  • [27] Accelerating Distance Transform Image based Hand Detection using CPU-GPU Heterogeneous Computing
    Yi, Zhaohua
    Hu, Xiaoqi
    Kim, Eung Kyeu
    Kim, Kyung Ki
    Jang, Byunghyun
    JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2016, 16 (05) : 557 - 563
  • [28] GHive: Accelerating Analytical Query Processing in Apache Hive via CPU-GPU Heterogeneous Computing
    Liu, Haotian
    Tang, Bo
    Zhang, Jiashu
    Deng, Yangshen
    Yan, Xiao
    Zheng, Xinying
    Shen, Qiaomu
    Zeng, Dan
    Mao, Zunyao
    Zhang, Chaozu
    You, Zhengxin
    Wang, Zhihao
    Jiang, Runzhe
    Wang, Fang
    Yiu, Man Lung
    Li, Huan
    Han, Mingji
    Li, Qian
    Luo, Zhenghai
    PROCEEDINGS OF THE 13TH SYMPOSIUM ON CLOUD COMPUTING, SOCC 2022, 2022, : 158 - 172
  • [29] CUSHAW2-GPU: Empowering Faster Gapped Short-Read Alignment Using GPU Computing
    Liu, Yongchao
    Schmidt, Bertil
    IEEE DESIGN & TEST, 2014, 31 (01) : 31 - 39
  • [30] Accelerating Exact Inner Product Retrieval by CPU-GPU Systems
    Xiang, Long
    Tang, Bo
    Yang, Chuan
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1277 - 1280