Fast approximate matching using suffix trees

被引:0
|
作者
Cobbs, AL
机构
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Let T be a text of length n and P a pattern of length m, both strings over a fixed finite alphabet Sigma. We wish to find all approximate occurrences of P in T having weighted edit distance at most k from P: this is the approximate substring matching problem. We focus on the case in which T is fixed and preprocessed in linear time, while P and k vary over consecutive searches. We give an O(mq + t(occ)) time and O(q) space algorithm, where q less than or equal to n depends on the problem instance, and t(occ) is the size of the output. The running time is proportional to the amount of matching, in the worst case as fast as standard dynamic programming. The algorithm uses the suffix tree representation of the text. The best previous algorithm requires O(mq log q + t(occ)) time and O(mq) space.
引用
收藏
页码:41 / 54
页数:14
相关论文
共 50 条
  • [1] Multidimensional matching and fast search in suffix trees
    Cole, R
    Lewenstein, M
    [J]. PROCEEDINGS OF THE FOURTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2003, : 851 - 852
  • [2] Approximate word sequence matching over sparse suffix trees
    Risvik, KM
    [J]. COMBINATORIAL PATTERN MATCHING, 1998, 1448 : 65 - 79
  • [3] The Gapped Suffix Array: A New Index Structure for Fast Approximate Matching
    Crochemore, Maxime
    Tischler, German
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, 2010, 6393 : 359 - 364
  • [4] Approximate string matching using compressed suffix arrays
    Huynh, TND
    Hon, WK
    Lam, TW
    Sung, WK
    [J]. THEORETICAL COMPUTER SCIENCE, 2006, 352 (1-3) : 240 - 249
  • [5] Approximate string matching using compressed suffix arrays
    Huynh, TND
    Hon, WK
    Lam, TW
    Sung, WK
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2004, 3109 : 434 - 444
  • [6] Fast algorithms for approximate Frechet matching queries in geometric trees
    Gudmundsson, Joachim
    Smid, Michiel
    [J]. COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2015, 48 (06): : 479 - 494
  • [7] Improved Approximate String Matching Using Compressed Suffix Data Structures
    Tak-Wah Lam
    Wing-Kin Sung
    Swee-Seong Wong
    [J]. Algorithmica, 2008, 51 : 298 - 314
  • [8] Improved approximate string matching using compressed suffix data structures
    Lam, Tak-Wah
    Sung, Wing-Kin
    Wong, Swee-Seong
    [J]. ALGORITHMICA, 2008, 51 (03) : 298 - 314
  • [9] Improved approximate string matching using compressed suffix data structures
    Lam, TW
    Sung, WK
    Wong, SS
    [J]. ALGORITHMS AND COMPUTATION, 2005, 3827 : 339 - 348
  • [10] APPROXIMATE STRING-MATCHING WITH SUFFIX AUTOMATA
    UKKONEN, E
    WOOD, D
    [J]. ALGORITHMICA, 1993, 10 (05) : 353 - 364