A bit-parallel algorithm for searching multiple patterns with various lengths

被引:4
|
作者
Kusudo, Ko [1 ,2 ]
Ino, Fumihiko [1 ]
Hagihara, Kenichi [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 5650871, Japan
[2] Fujitsu Ltd, Minato Ku, Tokyo 1057123, Japan
基金
日本学术振兴会; 日本科学技术振兴机构;
关键词
String search; Bit-parallel algorithm; Acceleration; AVX; GRAPHICS;
D O I
10.1016/j.jpdc.2014.11.003
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we present an Advanced Vector Extensions (AVX) accelerated method for a bit-parallel algorithm that realizes fast string search for maximizing stable search throughput. An advantage of our method is that it accelerates string search by regularizing both control flow and data structures. This regularization facilitates the exploitation of the latest vector instruction set to achieve efficient parallel search of multiple patterns of different lengths. We use AVX instructions to increase search throughput per CPU core and employ OpenMP directives to realize data-parallel search of strings. As a result, we found that our data structure doubled search throughput as compared with a previous bit-parallel approach that used a data structure for patterns of the same length. We also found that our method achieved stable search throughput for arbitrary data if the pattern size is large, but small enough to fit into a word. Some experimental results are provided to understand the advantage and disadvantage of our method with a comparison to Aho-Corasick based methods. We believe that our method is useful for large genome texts with many partial matches. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:49 / 57
页数:9
相关论文
共 50 条
  • [1] An Efficient Bit-Parallel Multi-Patterns Word Searching Algorithm through Splitting the Text
    Yadav, Ishadutta
    Singh, Bharat
    Agarwal, Suneeta
    Prasad, Rajesh
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 406 - 410
  • [2] A bit-parallel tree matching algorithm for patterns with horizontal VLDC's
    Tsuji, Hisashi
    Ishino, Akira
    Takeda, Masayuki
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2005, 3772 : 388 - 398
  • [3] Bit-Parallel Multiple Pattern Matching
    Tuan Tu Tran
    Giraud, Mathieu
    Varre, Jean-Stephane
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT II, 2012, 7204 : 292 - 301
  • [4] A GPU-Based Bit-Parallel Multiple Pattern Matching Algorithm
    Hung, Che-Lun
    Wang, Hsiao-Hsi
    Hsu, Tzu-Hung
    Lin, Chun-Yuan
    [J]. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1219 - 1222
  • [5] A space efficient bit-parallel algorithm for the multiple string matching problem
    Cantone, Domenico
    Faro, Simone
    [J]. INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2006, 17 (06) : 1235 - 1251
  • [6] Bit-Parallel Search Algorithms for Long Patterns
    Durian, Branislav
    Peltola, Hannu
    Salmela, Leena
    Tarhio, Jorma
    [J]. EXPERIMENTAL ALGORITHMS, PROCEEDINGS, 2010, 6049 : 129 - +
  • [7] A new exact bit-parallel algorithm for SAT
    Segundo, Pablo San
    Tapia, Cristobal
    Puente, Julio
    Rodriguez-Losada, Diego
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL 2, PROCEEDINGS, 2008, : 59 - 65
  • [8] A Bit-Parallel Algorithm for Sequential Pattern Matching with Wildcards
    Guo, Dan
    Hong, Xiao-Li
    Hu, Xue-Gang
    Gao, Jun
    Liu, Ying-Ling
    Wu, Gong-Qing
    Wu, Xindong
    [J]. CYBERNETICS AND SYSTEMS, 2011, 42 (06) : 382 - 401
  • [9] An exact bit-parallel algorithm for the maximum clique problem
    San Segundo, Pablo
    Rodriguez-Losada, Diego
    Jimenez, Agustin
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2011, 38 (02) : 571 - 581
  • [10] An Alternative Bit-Parallel Algorithm for Parameterized String Matching
    Prasad, Rajesh
    Agarwal, Suneeta
    [J]. INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 2148 - 2155