SIMD IMPLEMENTATION OF THE AHO-CORASICK ALGORITHM USING INTEL AVX2

被引:0
|
作者
Lazhar, Ourlis [1 ]
Djamel, Bellala [2 ]
机构
[1] Univ Batna2, Fac Technol, Dept Ind Engn, Batna, Algeria
[2] Univ Batna2, Dept Comp Sci, Batna, Algeria
来源
关键词
Pattern-matching; Aho-Corasick algorithm; Vectorization; Intel (R) Streaming SIMD Extensions 2.0 (AVX2);
D O I
10.12694/scpe.v20i3.1572
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Aho-Corasick (AC) algorithm is a multiple pattern exact string-matching algorithm proposed by Alfred V. Aho and Margaret J. Corasick. It is used to locate all occurrences of a finite set of patterns within an input text simultaneously. The AC algorithm is in the heart of many applications including digital forensics such as digital signatures demonstrating the authenticity of a digital message or document, full text search (utility programs such as grept, awk and sed of Unix systems), information retrieval (biological sequence analysis and gene identification), intrusion detection systems (IDS) in computer networks like SNORT, web filtering, spam filters, and anti-malware solutions (virus scanner). In this paper we present a vectorized version of the AC algorithm designed with the use of packed instructions based on the Intel (R) streaming SIMD (Single Instruction Multiple Data) extensions AVX2 (Advanced Vector Extensions 2.0). This paper shows that the vectorized AC algorithm reduces significantly the time matching process comparing to the implementation of the original AC algorithm.
引用
收藏
页码:563 / 576
页数:14
相关论文
共 50 条
  • [1] Auto implementation of parallel hardware architecture for Aho-Corasick algorithm
    Najam-ul-Islam, M.
    Zahra, Fatima Tu
    Jafri, Atif Raza
    Shah, Roman
    ul Hassan, Masood
    Rashid, Muhammad
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2022, 26 (01) : 29 - 53
  • [2] Heterogeneous Parallelization of Aho-Corasick Algorithm
    Soroushnia, Shima
    Daneshtalab, Masoud
    Plosila, Juha
    Liljeberg, Pasi
    8TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS (PACBB 2014), 2014, 294 : 153 - 160
  • [3] Auto implementation of parallel hardware architecture for Aho-Corasick algorithm
    M. Najam-ul-Islam
    Fatima Tu Zahra
    Atif Raza Jafri
    Roman Shah
    Masood ul Hassan
    Muhammad Rashid
    Design Automation for Embedded Systems, 2022, 26 : 29 - 53
  • [4] Performance Optimization of Aho-Corasick Algorithm on a GPU
    Nhat-Phuong Tran
    Lee, Myungho
    Hong, Sugwon
    Bae, Jongwoo
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 1143 - 1152
  • [5] Efficient implementation of Aho-Corasick pattern matching automata using Unicode
    Nieminen, Janne
    Kilpelainen, Pekka
    SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (06): : 669 - 690
  • [6] Memory Efficient Parallelization for Aho-Corasick Algorithm on a GPU
    Nhat-Phuong Tran
    Lee, Myungho
    Hong, Sugwon
    Shin, Minho
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 432 - 438
  • [7] On the Efficient Application of Aho-Corasick Algorithm in Process Mining
    Konchagin, Andrey M.
    Kalenkova, Anna A.
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2017, 2018, 10716 : 371 - 377
  • [8] Extension of Aho-Corasick Algorithm to Detect Injection Attacks
    Rejeb, Jale
    Srinivasan, Mahalakshmi
    ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 207 - 212
  • [9] Pattern Matching in YARA: Improved Aho-Corasick Algorithm
    Regeciova, Dominika
    Kolar, Dusan
    Milkovic, Marek
    IEEE ACCESS, 2021, 9 (09): : 62857 - 62866
  • [10] A File Undelete with Aho-Corasick Algorithm In File Recovery
    Sitompul, Opim Salim
    Handoko, Andrew
    Rahmat, Romi Fadillah
    2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 427 - 431