Linear and efficient string matching algorithms based on weak factor recognition

被引:5
|
作者
Cantone D. [1 ]
Faro S. [1 ]
Pavone A. [2 ]
机构
[1] Department of Mathematics and Computer Science, University of Catania, Viale A.Doria n.6, Catania
[2] Department of Cognitive Science, University of Messina, Via Concezione n.6, Messina
来源
关键词
Design and analysis of algorithms; Experimental evaluation; String matching; Text processing; Weak factor;
D O I
10.1145/3301295
中图分类号
学科分类号
摘要
We present a simple and very efficient algorithm for string matching based on the combination of weak factor recognition and hashing. Despite its quadratic worst-case running time, our algorithm exhibits a sublinear behaviour. We also propose some practical improvements of our algorithm and a variant with a linear worst-case time complexity. Experimental results show that, in most cases, some of the variants of our algorithm obtain the best running times when compared, under various conditions, against the most effective algorithms present in the literature. For instance, in the case of small alphabets and long patterns, the gain in running time is up to 18%. This makes our proposed algorithm one of the most flexible solutions in practical cases. © 2019 Association for Computing Machinery.
引用
收藏
相关论文
共 50 条
  • [31] Maximum-Shift String Matching Algorithms
    Kadhim, Hakem Adil
    AbdulRashid, NurAini
    2014 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2014,
  • [32] A string matching based algorithm for performance evaluation of mathematical expression recognition
    Kumar, P. Pavan
    Agarwal, Arun
    Bhagvati, Chakravarthy
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2014, 39 (01): : 63 - 79
  • [33] Technology beats algorithms (in exact string matching)
    Tarhio, Jorma
    Holub, Jan
    Giaquinta, Emanuele
    SOFTWARE-PRACTICE & EXPERIENCE, 2017, 47 (12): : 1877 - 1885
  • [34] Faster algorithms for string matching with k mismatches
    Amir, A
    Lewenstein, M
    Porat, E
    PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2000, : 794 - 803
  • [35] A string matching based algorithm for performance evaluation of mathematical expression recognition
    P PAVAN KUMAR
    ARUN AGARWAL
    CHAKRAVARTHY BHAGVATI
    Sadhana, 2014, 39 : 63 - 79
  • [36] Efficient string matching on coded texts
    Breslauer, D
    Gasieniec, L
    COMBINATORIAL PATTERN MATCHING, 1995, 937 : 27 - 40
  • [37] Efficient string matching on packed texts
    Breslauer, D
    Gasieniec, L
    RAIRO-INFORMATIQUE THEORIQUE ET APPLICATIONS-THEORETICAL INFORMATICS AND APPLICATIONS, 1996, 30 (06): : 521 - 544
  • [38] Efficient Approximation Algorithms for String Kernel Based Sequence Classification
    Farhan, Muhammad
    Tariq, Juvaria
    Zaman, Arif
    Shabbir, Mudassir
    Khan, Imdad Ullah
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [39] Effects of Suffix Repetition Rates of a String on the Performance of String Matching Algorithms
    Wang, Yang
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 53 - 58
  • [40] Entropy-Based Approach in Selection Exact String-Matching Algorithms
    Markic, Ivan
    Stula, Maja
    Zoric, Marija
    Stipanicev, Darko
    ENTROPY, 2021, 23 (01) : 1 - 19