Multiple approximate string matching

被引:0
|
作者
BaezaYates, R [1 ]
Navarro, G [1 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present two new algorithms for on-line multiple approximate string matching. These are extensions of previous algorithms that search for a single pattern. The single-pattern version of the first one is based on the simulation with bits of a non-deterministic finite automaton built from the pattern and using the text as input. To search for multiple patterns, we superimpose their automata, using the result as a filter. The second algorithm partitions the pattern in sub-patterns that are searched with no errors, with a fast exact multipattern search algorithm. To handle multiple patterns, we search the sub-patterns of all of them together. The average running time achieved is in both cases O(n) for moderate error level, pattern length and number of patterns. They adapt (with higher costs) to the other cases. However, the algorithms differ in speed and thresholds of usefulness. We analyze theoretically when each algorithm should be used, and show experimentally that they are faster than previous solutions in a wide range of cases.
引用
收藏
页码:174 / 184
页数:11
相关论文
共 50 条
  • [1] Improved single and multiple approximate string matching
    Fredriksson, K
    Navarro, G
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2004, 3109 : 457 - 471
  • [2] Average complexity of exact and approximate multiple string matching
    Navarro, G
    Fredriksson, K
    [J]. THEORETICAL COMPUTER SCIENCE, 2004, 321 (2-3) : 283 - 290
  • [3] Average-optimal multiple approximate string matching
    Fredriksson, K
    Navarro, G
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2003, 2676 : 109 - 128
  • [4] New and faster filters for multiple approximate string matching
    BaezaYates, R
    Navarro, G
    [J]. RANDOM STRUCTURES & ALGORITHMS, 2002, 20 (01) : 23 - 49
  • [5] APPROXIMATE STRING MATCHING
    HALL, PAV
    DOWLING, GR
    [J]. COMPUTING SURVEYS, 1980, 12 (04) : 381 - 402
  • [6] ALGORITHMS FOR APPROXIMATE STRING MATCHING
    UKKONEN, E
    [J]. INFORMATION AND CONTROL, 1985, 64 (1-3): : 100 - 118
  • [7] A Preprocessing for Approximate String Matching
    Baba, Kensuke
    Nakatoh, Tetsuya
    Yamada, Yasuhiro
    Ikeda, Daisuke
    [J]. INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT II, 2011, 252 : 610 - +
  • [8] Spatial Approximate String Matching
    Katsumata, Akifumi
    Miura, Takao
    [J]. 2009 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 123 - 128
  • [9] Faster approximate string matching
    BaezaYates, R
    Navarro, G
    [J]. ALGORITHMICA, 1999, 23 (02) : 127 - 158
  • [10] Approximate String Matching with SIMD
    Fiori, Fernando J.
    Pakalen, Waltteri
    Tarhio, Jorma
    [J]. COMPUTER JOURNAL, 2022, 65 (06): : 1472 - 1488