Thread-cooperative, Bit-parallel Computation of Levenshtein Distance on GPU

被引:24
|
作者
Chacon, Alejandro [1 ]
Marco-Sola, Santiago [2 ]
Espinosa, Antonio [1 ]
Ribeca, Paolo [2 ]
Carlos Moure, Juan [1 ]
机构
[1] UAB, Bellaterra 08193, Spain
[2] CNAG, Barcelona 08028, Spain
关键词
SIMD; GPU; CUDA; Myers' algorithm; ALIGNMENT; ACCURATE; SEQUENCE; EFFICIENT; SEARCH;
D O I
10.1145/2597652.2597677
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Approximate string matching is a very important problem in computational biology; it requires the fast computation of string distance as one of its essential components. Myers' bit-parallel algorithm improves the classical dynamic programming approach to Levenshtein distance computation, and offers competitive performance on CPUs. The main challenge when designing an efficient GPU implementation is to expose enough SIMD parallelism while at the same time keeping a relatively small working set for each thread. In this work we implement and optimise a CUDA version of Myers' algorithm suitable to be used as a building block for DNA sequence alignment. We achieve high efficiency by means of a cooperative parallelisation strategy for (1) very-long integer addition and shift operations, and (2) several simultaneous pattern matching tasks. In addition, we explore the performance impact obtained when using features specific to the Kepler architecture. Our results show an overall performance of the order of tera cells updates per second using a single high-end Nvidia GPU, and factor speedups in excess of 20x with respect to a sixteen-core, non-vectorised CPU implementation.
引用
收藏
页码:103 / 112
页数:10
相关论文
共 15 条
  • [1] Bit-parallel computation for string alignment
    Yu, Yunqing
    Baba, Kensuke
    Hanmei, E.
    Murakami, Kazuaki
    [J]. RECENT PROGRESS IN COMPUTATIONAL SCIENCES AND ENGINEERING, VOLS 7A AND 7B, 2006, 7A-B : 589 - 593
  • [2] GPU acceleration of Levenshtein distance computation between long strings
    Castells-Rufas, David
    [J]. PARALLEL COMPUTING, 2023, 116
  • [3] Bit-parallel multiple approximate string matching based on GPU
    Xu, Kefu
    Cui, Wenke
    Hu, Yue
    Guo, Li
    [J]. FIRST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2013, 17 : 523 - 529
  • [4] New bit-parallel Indel-distance algorithm
    Hyyrö, H
    Pinzon, Y
    Shinohara, A
    [J]. EXPERIMENTAL AND EFFICIENT ALGORITHMS, PROCEEDINGS, 2005, 3503 : 380 - 390
  • [5] A GPU-Based Bit-Parallel Multiple Pattern Matching Algorithm
    Hung, Che-Lun
    Wang, Hsiao-Hsi
    Hsu, Tzu-Hung
    Lin, Chun-Yuan
    [J]. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1219 - 1222
  • [6] Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi
    Tuan Tu Tran
    Liu, Yongchao
    Schmidt, Bertil
    [J]. PARALLEL COMPUTING, 2016, 54 : 128 - 138
  • [7] Bit-parallel computation of local similarity score matrices with unitary weights
    Hyyro, Heikki
    Navarro, Gonzalo
    [J]. INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2006, 17 (06) : 1325 - 1344
  • [8] Universal VLSI architecture for bit-parallel computation in GF(2m)
    Lin, CC
    Chang, FK
    Chang, HC
    Lee, CY
    [J]. PROCEEDINGS OF THE 2004 IEEE ASIA-PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, VOL 1 AND 2: SOC DESIGN FOR UBIQUITOUS INFORMATION TECHNOLOGY, 2004, : 125 - 128
  • [9] Bit-Parallel Cubing Computation over GF(3m) for Irreducible Trinomials
    Park, Sun-Mi
    Chang, Ku-Young
    Hong, Dowon
    Seo, Changho
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2014, E97A (01) : 347 - 353
  • [10] Closest Distance Searching by GPU-Based Massive Parallel Computation
    Fei, Yunfeng
    Song, Yinhao
    Sun, Guangyi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2015, : 2036 - 2039