PAMA: A fast string matching algorithm

被引：0

作者：

Lu, SF ^{[1
]}

Cao, F ^{[1
]}

Lu, Y ^{[1
]}

机构：

[1] Wayne State Univ, Detroit, MI 48202 USA

来源：

INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE | 2006年 / 17卷 / 02期

关键词：

D O I：

10.1142/S0129054106003875

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

String matching is a fundamental operation in computer science, and its performance has great impact on many applications including database query, text processing, DNA and protein sequence analysis. In this paper, we propose a fast string matching algorithm, PAMA (PAttern MAtching). The shift rule used by PAMA not only subsumes both the bad character rule and the good suffix rule employed by the well-known Boyer-Moore algorithm, but also employs an additional key observation to enable faster shifting during the string matching process. Theoretically, we prove that from the same alignment, the next shift of PAMA will be at least as much as that of the Boyer-Moore algorithm. Experimentally, we show that PAMA indeed significantly outperforms the original Boyer-Moore algorithm in almost all cases, and outperforms other Boyer-Moore variants such as Tuned-BM, Turbo-BM and Horspool for long patterns (length >= 128) or for small alphabets (size < 8).

引用

下载

页码：357 / 378

页数：22

共 50 条

[21] Fast string matching for DNA sequences
Ryu, Cheol
Lecroq, Thierry
Park, Kunsoo
THEORETICAL COMPUTER SCIENCE, 2020, 812 (137-148) : 137 - 148
[22] Fast index for approximate string matching
Tsur, Dekel
JOURNAL OF DISCRETE ALGORITHMS, 2010, 8 (04) : 339 - 345
[23] Fast kernels for inexact string matching
Leslie, C
Kuang, R
LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 114 - 128
[24] Fast string matching for multiple searches
Fenwick, P
SOFTWARE-PRACTICE & EXPERIENCE, 2001, 31 (09): : 815 - 833
[25] FAST AND PRACTICAL APPROXIMATE STRING MATCHING
BAEZAYATES, RA
PERLEBERG, CH
LECTURE NOTES IN COMPUTER SCIENCE, 1992, 644 : 185 - 192
[26] FAST STRING-MATCHING WITH MISMATCHES
BAEZAYATES, RA
GONNET, GH
INFORMATION AND COMPUTATION, 1994, 108 (02) : 187 - 199
[27] Fast and practical approximate string matching
BaezaYates, RA
Perleberg, CH
INFORMATION PROCESSING LETTERS, 1996, 59 (01) : 21 - 27
[28] FAST APPROXIMATE STRING MATCHING.
Owolabi, O.
McGregor, D.R.
Software - Practice and Experience, 1988, 18 (04) : 387 - 393
[29] A very fast string matching algorithm for small alphabets and long patterns (Extended abstract)
Charras, C
Lecroq, T
Pehoushek, JD
COMBINATORIAL PATTERN MATCHING, 1998, 1448 : 55 - 64
[30] A fast bit-vector algorithm for approximate string matching based on dynamic programming
Myers, G
JOURNAL OF THE ACM, 1999, 46 (03) : 395 - 415

← 1 2 3 4 5 →