Speed-up of Aho-Corasick pattern matching machines by rearranging states

被引:7
|
作者
Nishimura, T [1 ]
Fukamachi, S [1 ]
Shinohara, T [1 ]
机构
[1] Kyushu Inst Technol, Dept Artificial Intelligence, Iizuka, Fukuoka 8208502, Japan
关键词
D O I
10.1109/SPIRE.2001.989753
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes speed-up of string pattern matching by rearranging states in Aho-Corasick pattern matching machine, which is a kind of a finite automaton. We realized speed-up of string pattern matching using data compression. Although we obtain higher compression ratio using a finite state model, it doesn't lead speed-up of string pattern matching. Because the pattern matching machine becomes very large, when compression codes are complex. Random Access Memory (RAM) are scattered with states used frequently. Such states are close to the initial state of pattern matching machine. We rearrange states so as to collecting states used frequently for CPU cache efficiency. We renumber states in breadth-first order. In experiments, the elapsed time is reduced to about 55% in case of a compressed English text.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 50 条
  • [1] AUGMENTING THE AHO-CORASICK PATTERN-MATCHING MACHINE
    SRIDHAR, MA
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1990, 32 (3-4) : 149 - 153
  • [2] Pattern Matching in YARA: Improved Aho-Corasick Algorithm
    Regeciova, Dominika
    Kolar, Dusan
    Milkovic, Marek
    [J]. IEEE ACCESS, 2021, 9 : 62857 - 62866
  • [3] A RUN-TIME EFFICIENT REALIZATION OF AHO-CORASICK PATTERN-MATCHING MACHINES
    ARIKAWA, S
    SHINOHARA, T
    [J]. NEW GENERATION COMPUTING, 1984, 2 (02) : 171 - 186
  • [4] Efficient implementation of Aho-Corasick pattern matching automata using Unicode
    Nieminen, Janne
    Kilpelainen, Pekka
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (06): : 669 - 690
  • [5] Optimized Aho-Corasick String Matching Algorithm for Smart Phones
    Lu, Rui
    Pao, Derek
    [J]. 2016 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2016, : 342 - 343
  • [6] Efficient dictionary matching by Aho-Corasick automata of truncated patterns
    Zhang, Meng
    Fan, Jiashu
    Chen, Dequan
    [J]. INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (04) : 323 - 329
  • [7] Efficient multi-attribute pattern matching using the extended Aho-Corasick method
    Ando, K
    Okada, M
    Shishibori, M
    Aoe, J
    [J]. SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 3936 - 3941
  • [8] Multiple-pattern matching in LZW compressed files using Aho-Corasick algorithm
    Tao, T
    Mukherjee, A
    [J]. DCC 2005: Data Compression Conference, Proceedings, 2005, : 482 - 482
  • [9] Space-Time Tradeoff in the Aho-Corasick String Matching Algorithm
    Xu, Yisi
    Pao, Derek
    [J]. 2015 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2015, : 713 - 714
  • [10] Dictionary Matching: Review of the Aho-Corasick Algorithm and Vision for Large Dictionaries
    Qiao ZhanPeng
    Goto, Kento
    Ohshima, Takuya
    Tajima, Masahiro
    Motomichi, Toyama
    [J]. ICIST '18: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES, 2018,