Approximate Circular Pattern Matching Under Edit Distance

被引:0
|
作者
Charalampopoulos, Panagiotis [1 ]
Pissis, Solon P. [2 ,3 ]
Radoszewski, Jakub [4 ]
Rytter, Wojciech [4 ]
Walen, Tomasz [4 ]
Zuba, Wiktor [2 ]
机构
[1] Birkbeck Univ London, London, England
[2] CWI, Amsterdam, Netherlands
[3] Vrije Univ, Amsterdam, Netherlands
[4] Univ Warsaw, Warsaw, Poland
关键词
circular pattern matching; approximate pattern matching; edit distance; FASTER; TIME;
D O I
10.4230/LIPIcs.STACS.2024.24
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the k-Edit Circular Pattern Matching (k-Edit CPM) problem, we are given a length-n text T, a length-m pattern P, and a positive integer threshold k, and we are to report all starting positions of the substrings of T that are at edit distance at most k from some cyclic rotation of P. In the decision version of the problem, we are to check if any such substring exists. Very recently, Charalampopoulos et al. [ESA 2022] presented O(nk(2))-time and O(nk log(3) k) -time solutions for the reporting and decision versions of k-Edit CPM, respectively. Here, we show that the reporting and decision versions of k-Edit CPM can be solved in (n (n + m) k(6)) time and O(n+ (n + m) k(5) log(3) k) time, respectively, thus obtaining the first algorithms with a complexity of the type O(n + (n1m)poly(k)) for this problem. Notably, our algorithms run in O(n) time when m = Omega(k6) and are superior to the previous respective solutions when m = omega(k(4)). We provide a meta-algorithm that yields efficient algorithms in several other interesting settings, such as when the strings are given in a compressed form (as straight-line programs), when the strings are dynamic, or when we have a quantum computer. We obtain our solutions by exploiting the structure of approximate circular occurrences of P in T, when T is relatively short w.r.t. P. Roughly speaking, either the starting positions of approximate occurrences of rotations of P form Omega(k(4)) intervals that can be computed efficiently, or some rotation of P is almost periodic (is at a small edit distance from a string with small period). Dealing with the almost periodic case is the most technically demanding part of this work; we tackle it using properties of locked fragments (originating from [Cole and Hariharan, SICOMP 2002]).
引用
收藏
页数:22
相关论文
共 50 条
  • [21] A Filter-Based Approach for Approximate Circular Pattern Matching
    Azim, Md. Aashikur Rahman
    Iliopoulos, Costas S.
    Rahman, M. Sohel
    Samiruzzaman, Mohammad
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS (ISBRA 2015), 2015, 9096 : 24 - 35
  • [22] Block edit models for approximate string matching
    Lopresti, D
    Tomkins, A
    [J]. THEORETICAL COMPUTER SCIENCE, 1997, 181 (01) : 159 - 179
  • [23] Block edit models for approximate string matching
    Panasonic Technologies, Inc, Princeton, United States
    [J]. Theor Comput Sci, 1 (159-179):
  • [24] Reducing approximate Longest Common Subsequence to approximate Edit Distance
    Rubinstein, Aviad
    Song, Zhao
    [J]. PROCEEDINGS OF THE THIRTY-FIRST ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA'20), 2020, : 1591 - 1600
  • [25] Reducing approximate Longest Common Subsequence to approximate Edit Distance
    Rubinstein, Aviad
    Song, Zhao
    [J]. PROCEEDINGS OF THE 2020 ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2020, : 1591 - 1600
  • [26] Approximate Graph Edit Distance in Quadratic Time
    Riesen, Kaspar
    Ferrer, Miquel
    Bunke, Horst
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) : 483 - 494
  • [27] Approximate geometric pattern matching under rigid motions
    Goodrich, MT
    Mitchell, JSB
    Orletsky, MW
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (04) : 371 - 379
  • [28] The String Edit Distance Matching Problem With Moves
    Cormode, Graham
    Muthukrishnan, S.
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2007, 3 (01)
  • [29] The string edit distance matching problem with moves
    Cormode, G
    Muthukrishnan, S
    [J]. PROCEEDINGS OF THE THIRTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2002, : 667 - 676
  • [30] Efficient relational matching with local edit distance
    Myers, R
    Wilson, RC
    Hancock, ER
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1711 - 1714