Linear and efficient string matching algorithms based on weak factor recognition

被引:5
|
作者
Cantone D. [1 ]
Faro S. [1 ]
Pavone A. [2 ]
机构
[1] Department of Mathematics and Computer Science, University of Catania, Viale A.Doria n.6, Catania
[2] Department of Cognitive Science, University of Messina, Via Concezione n.6, Messina
来源
关键词
Design and analysis of algorithms; Experimental evaluation; String matching; Text processing; Weak factor;
D O I
10.1145/3301295
中图分类号
学科分类号
摘要
We present a simple and very efficient algorithm for string matching based on the combination of weak factor recognition and hashing. Despite its quadratic worst-case running time, our algorithm exhibits a sublinear behaviour. We also propose some practical improvements of our algorithm and a variant with a linear worst-case time complexity. Experimental results show that, in most cases, some of the variants of our algorithm obtain the best running times when compared, under various conditions, against the most effective algorithms present in the literature. For instance, in the case of small alphabets and long patterns, the gain in running time is up to 18%. This makes our proposed algorithm one of the most flexible solutions in practical cases. © 2019 Association for Computing Machinery.
引用
收藏
相关论文
共 50 条
  • [1] Efficient algorithms for approximate string matching with swaps
    Lee, JS
    Kim, DK
    Park, K
    Cho, Y
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 1997, 1264 : 28 - 39
  • [2] Efficient parallel hardware algorithms for string matching
    Park, JH
    George, KM
    MICROPROCESSORS AND MICROSYSTEMS, 1999, 23 (03) : 155 - 168
  • [3] Efficient algorithms for approximate string matching with swaps
    Kim, DK
    Lee, JS
    Park, K
    Cho, Y
    JOURNAL OF COMPLEXITY, 1999, 15 (01) : 128 - 147
  • [4] Efficient string matching algorithms for combinatorial universal denoising
    Chen, S
    Diggavi, S
    Dusad, S
    Muthukrishnan, S
    DCC 2005: Data Compression Conference, Proceedings, 2005, : 153 - 162
  • [5] ALGORITHMS FOR APPROXIMATE STRING MATCHING
    UKKONEN, E
    INFORMATION AND CONTROL, 1985, 64 (1-3): : 100 - 118
  • [6] Deterministic memory-efficient string matching algorithms for intrusion detection
    Tuck, N
    Sherwood, T
    Calder, B
    Varghese, G
    IEEE INFOCOM 2004: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-4, PROCEEDINGS, 2004, : 2628 - 2639
  • [7] Wood Recognition Based on Histogram Matching Algorithms
    Xu, Tianlong
    Wang, Hangjun
    Sun, Lingjun
    INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS II, PTS 1-3, 2011, 58-60 : 1744 - 1748
  • [8] FFT-based algorithms for the string matching with mismatches problem
    Schoenmeyr, T
    Zhang, DY
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2005, 57 (02): : 130 - 139
  • [9] On the performance of data compression algorithms based upon string matching
    Yang, EH
    Kieffer, JC
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (01) : 47 - 65
  • [10] THE ACCURACY OF APPROXIMATE STRING MATCHING ALGORITHMS
    NESBIT, JC
    JOURNAL OF COMPUTER-BASED INSTRUCTION, 1986, 13 (03): : 80 - 83