Parallelizing Exact and Approximate String Matching via Inclusive Scan on a GPU

被引:18
|
作者
Mitani, Yasuaki [1 ]
Ino, Fumihiko [2 ]
Hagihara, Kenichi [2 ]
机构
[1] DWANGO Corp Ltd, Dev Head Off, Chuo Ku, 4-12-15 Ginza, Tokyo 1040061, Japan
[2] Osaka Univ, Grad Sch Informat Sci & Technol, 1-5 Yamadaoka, Suita, Osaka 5650871, Japan
基金
日本学术振兴会;
关键词
String matching; bit-parallel algorithm; inclusive scan; shift-or algorithm; Wu-Manber algorithm; GPU; READ ALIGNMENT; ALGORITHM;
D O I
10.1109/TPDS.2016.2645222
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this study, to substantially improve the runtimes of exact and approximate string matching algorithms, we propose a tribrid parallel method for bit-parallel algorithms such as the Shift-Or andWu-Manber algorithms. Our underlying idea is to interpret bit-parallel algorithms as inclusive-scan operations, which allow these bit-parallel algorithms to run efficiently on a graphics processing unit (GPU); we achieve this speed-up here because inclusive-scan operations not only eliminate duplicate searches between threads but also realize a GPU-friendly memory access pattern that maximizes memory read/write throughput. To realize our ideas, we first define two binary operators and then present a proof regarding the associativity of these operators, which is necessary for the parallelization of the inclusive-scan operations. Finally, we integrate the inclusive-scan scheme into a previous segmentation-based scheme to maximize search throughput, identifying the best tradeoff point between synchronization cost and duplicate work. Through our experiments, we compared our proposed method with previous segmentation-based methods and indexing-based sequence aligners. For online string matching, our proposed method performed 6.7-16.7 times faster than previous methods, achieving a search throughput of up to 1.88 terabits per second (Tbps) on a GeForce GTX TITAN X GPU. We therefore conclude that our proposed method is quite effective for decreasing the runtimes of online string matching of short patterns.
引用
收藏
页码:1989 / 2002
页数:14
相关论文
共 50 条
  • [1] A Fast Approximate String Matching Algorithm on GPU
    Nunes, Lucas S. N.
    Bordim, J. L.
    Nakano, K.
    Ito, Y.
    [J]. PROCEEDINGS OF 2015 THIRD INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2015, : 188 - 192
  • [2] Average complexity of exact and approximate multiple string matching
    Navarro, G
    Fredriksson, K
    [J]. THEORETICAL COMPUTER SCIENCE, 2004, 321 (2-3) : 283 - 290
  • [3] Bit-parallel multiple approximate string matching based on GPU
    Xu, Kefu
    Cui, Wenke
    Hu, Yue
    Guo, Li
    [J]. FIRST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2013, 17 : 523 - 529
  • [4] A weak approach to suffix automata simulation for exact and approximate string matching
    Faro, Simone
    Scafiti, Stefano
    [J]. THEORETICAL COMPUTER SCIENCE, 2022, 933 : 88 - 103
  • [5] A weak approach to suffix automata simulation for exact and approximate string matching
    Faro, Simone
    Scafiti, Stefano
    [J]. Theoretical Computer Science, 2022, 933 : 88 - 103
  • [6] A Memory-Access-Efficient Implementation of the Approximate String Matching Algorithm on GPU
    Nunes, Lucas S. N.
    Bordim, J. L.
    Nakano, K.
    Ito, Y.
    [J]. 2016 FOURTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2016, : 483 - 489
  • [7] APPROXIMATE STRING MATCHING
    HALL, PAV
    DOWLING, GR
    [J]. COMPUTING SURVEYS, 1980, 12 (04) : 381 - 402
  • [8] Multipattern String Matching On A GPU
    Zha, Xinyan
    Sahni, Sartaj
    [J]. 2011 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2011,
  • [9] An Optimal Implementation of the Approximate String Matching on the Hierarchical Memory Machine, with Performance Evaluation on the GPU
    Man, Duhu
    Nakano, Koji
    Ito, Yasuaki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (12): : 3063 - 3071
  • [10] ALGORITHMS FOR APPROXIMATE STRING MATCHING
    UKKONEN, E
    [J]. INFORMATION AND CONTROL, 1985, 64 (1-3): : 100 - 118