Multi-Pattern Matching Algorithm with Wildcards Based on Bit-Parallelism

被引:0
|
作者
Ahmed A.F.Saif [1 ]
HU Liang [1 ]
CHU Jianfeng [1 ]
机构
[1] College of Computer Science and Technology,Jilin University
关键词
multi-pattern; string matching; wildcard; bit-parallelism;
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
081203 ; 0835 ;
摘要
Multi-pattern matching with wildcards is a problem of finding the occurrence of all patterns in a pattern set{p~1,---,p~k}in a given text t. If the percentage of wildcards in pattern set is not high,this problem can be solved using finite automata. We introduce a multi-pattern matching algorithm with a fixed number of wildcards to overcome the high percentage of the occurrence of wildcards in patterns. In our proposed method,patterns are matched as bit patterns using a sliding window approach. The window is a bit window that slides along the given text,matching against stored bit patterns. Matching process is executed using bit wise operations. The experimental results demonstrate that the percentage of wildcard occurrence does not affect the proposed algorithm’s performance and the proposed algorithm is more efficient than the algorithms based on the fast Fourier transform. The proposed algorithm is simple to implement and runs efficiently in O(n+d(n/σ)(m/w))time,where n is text length,d is symbol distribution over k patterns,m is pattern length,and σ is alphabet size.
引用
收藏
页码:178 / 184
页数:7
相关论文
共 50 条
  • [1] Multi-pattern matching with wildcards
    Zhang M.
    Zhang Y.
    Tang J.
    Bai X.
    [J]. Journal of Software, 2011, 6 (12 SPEC. ISSUE) : 2391 - 2398
  • [2] Multi-pattern Matching Algorithm with Wildcards Based on Euclidean Distance and Hash Function
    Saif, Ahmed Abdo Farhan
    Hu, Liang
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT I, 2016, 9786 : 334 - 344
  • [3] Faster string matching based on hashing and bit-parallelism
    Al-Ssulami, Abdulrakeeb M.
    Mathkour, Hassan
    [J]. INFORMATION PROCESSING LETTERS, 2017, 123 : 51 - 55
  • [4] HybridSA: GPU Acceleration of Multi-pattern Regex Matching using Bit Parallelism
    Le Glaunec, Alexis
    Kong, Lingkun
    Mamouras, Konstantinos
    [J]. Proceedings of the ACM on Programming Languages, 2024, 8 (OOPSLA2)
  • [5] Suffix array for multi-pattern matching with variable length wildcards
    Liu, Na
    Xie, Fei
    Wu, Xindong
    [J]. INTELLIGENT DATA ANALYSIS, 2021, 25 (02) : 283 - 303
  • [6] A Multi-pattern Matching Algorithm Based on WM Algorithm
    Yu, Genzhen
    Gao, Qinquan
    Meng, Fanlin
    Fu, Changhong
    Wu, Shunxiang
    [J]. PROCEEDING OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES, 2009, : 476 - 479
  • [7] A novel optimal multi-pattern matching method with wildcards for DNA sequence
    Wang, Xinlu
    Saif, Ahmed A. F.
    Liu, Dayou
    Zhu, Yungang
    Benediktsson, Jon Atli
    [J]. TECHNOLOGY AND HEALTH CARE, 2021, 29 : S115 - S124
  • [8] Increased bit-parallelism for approximate string matching
    Hyyrö, H
    Fredriksson, K
    Navarro, G
    [J]. EXPERIMENTAL AND EFFICIENT ALGORITHMS, 2004, 3059 : 285 - 298
  • [9] Multi-pattern Matching Algorithm Based on MapReduce and Hadoop
    Zhang, Wei
    Li, Baolu
    Li, Kun
    [J]. PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1856 - 1859
  • [10] Multi-dimensional packet classification based on bit-parallelism
    Wang Xue-Guang
    [J]. 2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 880 - 884