SIMD IMPLEMENTATION OF THE AHO-CORASICK ALGORITHM USING INTEL AVX2

被引:0
|
作者
Lazhar, Ourlis [1 ]
Djamel, Bellala [2 ]
机构
[1] Univ Batna2, Fac Technol, Dept Ind Engn, Batna, Algeria
[2] Univ Batna2, Dept Comp Sci, Batna, Algeria
来源
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE | 2019年 / 20卷 / 03期
关键词
Pattern-matching; Aho-Corasick algorithm; Vectorization; Intel (R) Streaming SIMD Extensions 2.0 (AVX2);
D O I
10.12694/scpe.v20i3.1572
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Aho-Corasick (AC) algorithm is a multiple pattern exact string-matching algorithm proposed by Alfred V. Aho and Margaret J. Corasick. It is used to locate all occurrences of a finite set of patterns within an input text simultaneously. The AC algorithm is in the heart of many applications including digital forensics such as digital signatures demonstrating the authenticity of a digital message or document, full text search (utility programs such as grept, awk and sed of Unix systems), information retrieval (biological sequence analysis and gene identification), intrusion detection systems (IDS) in computer networks like SNORT, web filtering, spam filters, and anti-malware solutions (virus scanner). In this paper we present a vectorized version of the AC algorithm designed with the use of packed instructions based on the Intel (R) streaming SIMD (Single Instruction Multiple Data) extensions AVX2 (Advanced Vector Extensions 2.0). This paper shows that the vectorized AC algorithm reduces significantly the time matching process comparing to the implementation of the original AC algorithm.
引用
收藏
页码:563 / 576
页数:14
相关论文
共 50 条
  • [31] Optimizing Dilithium Implementation with AVX2/-512
    Xu, Runqing
    He, Debiao
    Luo, Min
    Peng, Cong
    Zeng, Xiangyong
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (06)
  • [32] An Efficient Multicharacter Transition String-Matching Engine Based on the Aho-Corasick Algorithm
    Chen, Chien-Chi
    Wang, Sheng-De
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2013, 10 (04)
  • [33] A MULTI-CHARACTER TRANSITION STRING MATCHING ARCHITECTURE BASED ON AHO-CORASICK ALGORITHM
    Chen, Chien-Chi
    Wang, Sheng-De
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (12): : 8367 - 8386
  • [34] 使用Stencil评估Intel AVX2 Vgather指令
    林新华
    秦强
    李硕
    文敏华
    松岗聪
    计算机科学, 2017, 44 (01) : 20 - 24
  • [35] Fast Implementation of Curve25519 Using AVX2
    Faz-Hernandez, Armando
    Lopez, Julio
    PROGRESS IN CRYPTOLOGY - LATINCRYPT 2015, 2015, 9230 : 329 - 345
  • [36] A failureless pipelined Aho-Corasick algorithm for FPGA-based parallel string matching engine
    Kim, HyunJin
    Lecture Notes in Electrical Engineering, 2015, 339 : 157 - 164
  • [37] Fast Implementation of Simeck Family Block Ciphers Using AVX2
    Park, Taehwan
    Seo, Hwajeong
    Kim, Howon
    2018 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON18), 2018, : 208 - 211
  • [38] An Exploration of Using the Intel AVX2 Gather Load Instructions for Vectorised Image Processing
    Cree, Michael J.
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2018,
  • [39] All-pairs suffix/prefix in optimal time using Aho-Corasick space
    Loukides, Grigorios
    Pissis, Solon P.
    INFORMATION PROCESSING LETTERS, 2022, 178
  • [40] Efficient multi-attribute pattern matching using the extended Aho-Corasick method
    Ando, K
    Okada, M
    Shishibori, M
    Aoe, J
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 3936 - 3941