Multiple approximate string matching

被引:0
|
作者
BaezaYates, R [1 ]
Navarro, G [1 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present two new algorithms for on-line multiple approximate string matching. These are extensions of previous algorithms that search for a single pattern. The single-pattern version of the first one is based on the simulation with bits of a non-deterministic finite automaton built from the pattern and using the text as input. To search for multiple patterns, we superimpose their automata, using the result as a filter. The second algorithm partitions the pattern in sub-patterns that are searched with no errors, with a fast exact multipattern search algorithm. To handle multiple patterns, we search the sub-patterns of all of them together. The average running time achieved is in both cases O(n) for moderate error level, pattern length and number of patterns. They adapt (with higher costs) to the other cases. However, the algorithms differ in speed and thresholds of usefulness. We analyze theoretically when each algorithm should be used, and show experimentally that they are faster than previous solutions in a wide range of cases.
引用
收藏
页码:174 / 184
页数:11
相关论文
共 50 条
  • [31] Approximate string matching for music analysis
    Clifford, R
    Iliopoulos, C
    SOFT COMPUTING, 2004, 8 (09) : 597 - 603
  • [32] A parallel algorithm for approximate string matching
    Kaplan, K
    Burge, LL
    Garuba, M
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1844 - 1848
  • [33] Approximate string matching for music analysis
    R. Clifford
    C. Iliopoulos
    Soft Computing, 2004, 8 : 597 - 603
  • [34] AN APPROXIMATE STRING-MATCHING ALGORITHM
    KIM, JY
    SHAWETAYLOR, J
    THEORETICAL COMPUTER SCIENCE, 1992, 92 (01) : 107 - 117
  • [35] Approximate String Matching with Reduced Alphabet
    Salmela, Leena
    Tarhio, Jorma
    ALGORITHMS AND APPLICATIONS: ESSAYS DEDICATED TO ESKO UKKONEN ON THE OCCASION OF HIS 60TH BIRTHDAY, 2010, 6060 : 210 - +
  • [36] Compressed Indexes for Approximate String Matching
    Chan, Ho-Leung
    Lam, Tak-Wah
    Sung, Wing-Kin
    Tam, Siu-Lung
    Wong, Swee-Seong
    ALGORITHMICA, 2010, 58 (02) : 263 - 281
  • [37] Compressed Indexes for Approximate String Matching
    Ho-Leung Chan
    Tak-Wah Lam
    Wing-Kin Sung
    Siu-Lung Tam
    Swee-Seong Wong
    Algorithmica, 2010, 58 : 263 - 281
  • [38] Approximate string matching with suffix automata
    Ukkonen, Wesko
    Wood, Derick
    Algorithmica (New York), 1993, 10 (05): : 353 - 364
  • [39] Compressed indexes for approximate string matching
    Chan, Ho-Leung
    Lam, Tak-Wah
    Sung, Wing-Kin
    Tam, Siu-Lung
    Wong, Swee-Seong
    ALGORITHMS - ESA 2006, PROCEEDINGS, 2006, 4168 : 208 - 219
  • [40] AN IMPROVED ALGORITHM FOR APPROXIMATE STRING MATCHING
    GALIL, Z
    PARK, K
    LECTURE NOTES IN COMPUTER SCIENCE, 1989, 372 : 394 - 404