Multiple approximate string matching

被引:0
|
作者
BaezaYates, R [1 ]
Navarro, G [1 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present two new algorithms for on-line multiple approximate string matching. These are extensions of previous algorithms that search for a single pattern. The single-pattern version of the first one is based on the simulation with bits of a non-deterministic finite automaton built from the pattern and using the text as input. To search for multiple patterns, we superimpose their automata, using the result as a filter. The second algorithm partitions the pattern in sub-patterns that are searched with no errors, with a fast exact multipattern search algorithm. To handle multiple patterns, we search the sub-patterns of all of them together. The average running time achieved is in both cases O(n) for moderate error level, pattern length and number of patterns. They adapt (with higher costs) to the other cases. However, the algorithms differ in speed and thresholds of usefulness. We analyze theoretically when each algorithm should be used, and show experimentally that they are faster than previous solutions in a wide range of cases.
引用
收藏
页码:174 / 184
页数:11
相关论文
共 50 条
  • [21] AN IMPROVED ALGORITHM FOR APPROXIMATE STRING MATCHING
    GALIL, Z
    PARK, K
    SIAM JOURNAL ON COMPUTING, 1990, 19 (06) : 989 - 999
  • [22] Faster Filters for Approximate String Matching
    Karkkainen, Juha
    Na, Joong Chae
    PROCEEDINGS OF THE NINTH WORKSHOP ON ALGORITHM ENGINEERING AND EXPERIMENTS AND THE FOURTH WORKSHOP ON ANALYTIC ALGORITHMICS AND COMBINATORICS, 2007, : 84 - 90
  • [23] On approximate string matching of unique oligonucleotides
    Hyyrö, H
    Vihinen, M
    Juhola, M
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 960 - 964
  • [24] A comparison of approximate string matching algorithms
    Jokinen, P
    Tarhio, J
    Ukkonen, E
    SOFTWARE-PRACTICE & EXPERIENCE, 1996, 26 (12): : 1439 - 1458
  • [25] The stringdist Package for Approximate String Matching
    van der Loo, Mark P. J.
    R JOURNAL, 2014, 6 (01): : 111 - 122
  • [26] Fast index for approximate string matching
    Tsur, Dekel
    JOURNAL OF DISCRETE ALGORITHMS, 2010, 8 (04) : 339 - 345
  • [27] A metric index for approximate string matching
    Navarro, G
    Chávez, E
    THEORETICAL COMPUTER SCIENCE, 2006, 352 (1-3) : 266 - 279
  • [28] Approximate string matching in DNA sequences
    Cheng, LL
    Cheung, DW
    Yiu, SM
    EIGHTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2003, : 303 - 310
  • [29] A randomized algorithm for approximate string matching
    Atallah, MJ
    Chyzak, F
    Dumas, P
    ALGORITHMICA, 2001, 29 (03) : 468 - 486
  • [30] Approximate String Matching by Fuzzy Automata
    Snasel, Vaclav
    Keprt, Ales
    Abraham, Ajith
    Hassanien, Aboul Ella
    MAN-MACHINE INTERACTIONS, 2009, 59 : 281 - +