Approximate Circular Pattern Matching Under Edit Distance

被引:0
|
作者
Charalampopoulos, Panagiotis [1 ]
Pissis, Solon P. [2 ,3 ]
Radoszewski, Jakub [4 ]
Rytter, Wojciech [4 ]
Walen, Tomasz [4 ]
Zuba, Wiktor [2 ]
机构
[1] Birkbeck Univ London, London, England
[2] CWI, Amsterdam, Netherlands
[3] Vrije Univ, Amsterdam, Netherlands
[4] Univ Warsaw, Warsaw, Poland
关键词
circular pattern matching; approximate pattern matching; edit distance; FASTER; TIME;
D O I
10.4230/LIPIcs.STACS.2024.24
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the k-Edit Circular Pattern Matching (k-Edit CPM) problem, we are given a length-n text T, a length-m pattern P, and a positive integer threshold k, and we are to report all starting positions of the substrings of T that are at edit distance at most k from some cyclic rotation of P. In the decision version of the problem, we are to check if any such substring exists. Very recently, Charalampopoulos et al. [ESA 2022] presented O(nk(2))-time and O(nk log(3) k) -time solutions for the reporting and decision versions of k-Edit CPM, respectively. Here, we show that the reporting and decision versions of k-Edit CPM can be solved in (n (n + m) k(6)) time and O(n+ (n + m) k(5) log(3) k) time, respectively, thus obtaining the first algorithms with a complexity of the type O(n + (n1m)poly(k)) for this problem. Notably, our algorithms run in O(n) time when m = Omega(k6) and are superior to the previous respective solutions when m = omega(k(4)). We provide a meta-algorithm that yields efficient algorithms in several other interesting settings, such as when the strings are given in a compressed form (as straight-line programs), when the strings are dynamic, or when we have a quantum computer. We obtain our solutions by exploiting the structure of approximate circular occurrences of P in T, when T is relatively short w.r.t. P. Roughly speaking, either the starting positions of approximate occurrences of rotations of P form Omega(k(4)) intervals that can be computed efficiently, or some rotation of P is almost periodic (is at a small edit distance from a string with small period). Dealing with the almost periodic case is the most technically demanding part of this work; we tackle it using properties of locked fragments (originating from [Cole and Hariharan, SICOMP 2002]).
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Restricted transposition invariant approximate string matching under edit distance
    Hyyro, Heikki
    [J]. String Processing and Information Retrieval, Proceedings, 2005, 3772 : 256 - 266
  • [2] An Edit-Distance Model for the Approximate Matching of Timed Strings
    Dobrisek, Simon
    Zibert, Janez
    Pavesic, Nikola
    Mihelic, France
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (04) : 736 - 741
  • [3] Online Pattern Matching for String Edit Distance with Moves
    Takabatake, Yoshimasa
    Tabei, Yasuo
    Sakamoto, Hiroshi
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 203 - 214
  • [4] Using edit distance in point-pattern matching
    Mäkinen, V
    [J]. EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2001, : 153 - 161
  • [5] Frequent Pattern Mining based on Approximate Edit Distance Matrix
    Guo, Dan
    Yuan, Ermao
    Hu, Xuegang
    [J]. 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 179 - 188
  • [6] EDAM: Edit Distance tolerant Approximate Matching content addressable memory
    Hanhan, Robert
    Garzon, Esteban
    Jahshan, Zuher
    Teman, Adam
    Lanuzza, Marco
    Yavits, Leonid
    [J]. PROCEEDINGS OF THE 2022 THE 49TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '22), 2022, : 495 - 507
  • [7] Approximate graph edit distance computation by means of bipartite graph matching
    Riesen, Kaspar
    Bunke, Horst
    [J]. IMAGE AND VISION COMPUTING, 2009, 27 (07) : 950 - 959
  • [8] Understanding Cloud Data Using Approximate String Matching and Edit Distance
    Jupin, Joseph
    Shi, Justin Y.
    Obradovic, Zoran
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1234 - 1243
  • [9] Approximate Graph Edit Distance Guided by Bipartite Matching of Bags of Walks
    Gauezere, Benoit
    Bougleux, Sebastien
    Riesen, Kaspar
    Brun, Luc
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2014, 8621 : 73 - 82
  • [10] Faster Pattern Matching under Edit Distance A Reduction to Dynamic Puzzle Matching and the Seaweed Monoid of Permutation Matrices
    Charalampopoulos, Panagiotis
    Kociumaka, Tomasz
    Wellnitz, Philip
    [J]. 2022 IEEE 63RD ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2022, : 698 - 707