Exact And Approximate Pattern Matching In The Streaming Model

被引:39
|
作者
Porat, Benny [1 ]
Porat, Ely [1 ]
机构
[1] Bar Ilan Univ, IL-52100 Ramat Gan, Israel
关键词
K-MISMATCHES; ALGORITHMS;
D O I
10.1109/FOCS.2009.11
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a fully online randomized algorithm for the classical pattern matching problem that uses merely O(log m) space(1), breaking the O(m) barrier that held for this problem for a long time. Our method can be used as a tool in many practical applications, including monitoring Internet traffic and firewall applications. In our online model we first receive the pattern P of size m and preprocess it. After the preprocessing phase, the characters of the text T of size n arrive one at a time in an online fashion. For each index of the text input we indicate whether the pattern matches the text at that location index or not. Clearly, for index i, an indication can only be given once all characters from index i till index i + m - 1 have arrived. Our goal is to provide such answers while using minimal space, and while spending as little time as possible on each character (time and space which are in O(poly log(n))). We present an algorithm whereby both false positive and false negative answers are allowed with probability of at most 1/n(3). Thus, overall, the correct answer for all positions is returned with a probability of 1/n(2). The time which our algorithm spends on each input character is bounded by O(log m), and the space complexity is O(log m) words. We also present a solution in the same model for the pattern matching with k mismatches problem. In this problem, a match means allowing up to k symbol mismatches between the pattern and the subtext beginning at index i. We provide an algorithm in which the time spent on each character is bounded by O(k(2) poly(log m)), and the space complexity is O(k(3) poly(log m)) words.
引用
收藏
页码:315 / 323
页数:9
相关论文
共 50 条
  • [41] Reconfigurable approximate pattern matching architectures for nanotechnology
    Annampedu, Viswanath
    Wagh, Meghanad D.
    MICROELECTRONICS JOURNAL, 2007, 38 (03) : 430 - 438
  • [42] A linear size index for approximate pattern matching
    Chan, Ho-Leung
    Lam, Tak-Wah
    Sung, Wing-Kin
    Tam, Siu-Lung
    Wong, Swee-Seong
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2006, 4009 : 49 - 59
  • [43] Approximate pattern matching and transitive closure logics
    Lemström, K
    Hella, L
    THEORETICAL COMPUTER SCIENCE, 2003, 299 (1-3) : 387 - 412
  • [44] Efficient Algorithm for δ - Approximate Jumbled Pattern Matching
    Castellanos, Ivan
    Pinzon, Yoan
    PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2015, 2015, : 47 - 56
  • [45] NetDAP: (δ, γ) −approximate pattern matching with length constraints
    Youxi Wu
    Jinquan Fan
    Yan Li
    Lei Guo
    Xindong Wu
    Applied Intelligence, 2020, 50 : 4094 - 4116
  • [46] A fast and exact single pattern matching algorithm
    Fan, Hongbo
    Yao, Nianmin
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (08): : 1341 - 1348
  • [47] Combinatorial and experimental methods for approximate point pattern matching
    Gavrilov, M
    Indyk, P
    Motwani, R
    Venkatasubramanian, S
    ALGORITHMICA, 2004, 38 (01) : 59 - 90
  • [48] Combinatorial and Experimental Methods for Approximate Point Pattern Matching
    Martin Gavrilov
    Piotr Indyk
    Rajeev Motwani
    Suresh Venkatasubramanian
    Algorithmica , 2004, 38 : 59 - 90
  • [49] Approximate geometric pattern matching under rigid motions
    Goodrich, MT
    Mitchell, JSB
    Orletsky, MW
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (04) : 371 - 379
  • [50] Approximate Circular Pattern Matching Under Edit Distance
    Charalampopoulos, Panagiotis
    Pissis, Solon P.
    Radoszewski, Jakub
    Rytter, Wojciech
    Walen, Tomasz
    Zuba, Wiktor
    41ST INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE, STACS 2024, 2024, 289