String matching in hypertext

被引:0
|
作者
Park, K
Kim, DK
机构
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we consider the string matching problem in hypertext, which is a nonlinear structure of text. We model the hypertext as a directed graph G = (V, E), where each node v is an element of V has text T-v associated with it and each link (v, w) is an element of E connects the end of text T-v to the start of text T-w. We define the string matching problem in hypertext as follows: Given a graph G modeling a hypertext and a pattern P, find all occurrences of the pattern in graph G. The pattern length is m and the sum of the lengths of all texts T-v in G is N. The main difficulty in the hypertext string matching problem is that the pattern may occur across links. There is a linear time algorithm for the case when graph G is a tree. In this paper we present a linear O(N + \E\) time algorithm when n(v) length (T-v) is larger than or equal to m for all v, and a more involved algorithm that takes O(N + \E\m) time when there exist some nodes v with n(v) < m. To obtain the results, we combine the notion of witnesses and duels with the suffix tree, which enables us to eliminate possible occurrences of any substring of the pattern.
引用
收藏
页码:318 / 329
页数:12
相关论文
共 50 条
  • [1] Pattern matching in hypertext
    Amir, A
    Lewenstein, M
    Lewenstein, N
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2000, 35 (01): : 82 - 99
  • [2] Pattern matching in hypertext
    Amir, A
    Lewenstein, M
    Lewenstein, N
    ALGORITHMS AND DATA STRUCTURES, 1997, 1272 : 160 - 173
  • [3] Improved approximate pattern matching on hypertext
    Navarro, G
    LATIN '98: THEORETICAL INFORMATICS, 1998, 1380 : 352 - 357
  • [4] Improved approximate pattern matching on hypertext
    Navarro, G
    THEORETICAL COMPUTER SCIENCE, 2000, 237 (1-2) : 455 - 463
  • [5] String matching with lookahead
    Peltola, Hannu
    Tarhio, Jorma
    DISCRETE APPLIED MATHEMATICS, 2014, 163 : 352 - 360
  • [6] APPROXIMATE STRING MATCHING
    HALL, PAV
    DOWLING, GR
    COMPUTING SURVEYS, 1980, 12 (04) : 381 - 402
  • [7] String matching on the internet
    Brönnimann, H
    Memon, N
    Shanmugasundaram, K
    COMBINATORIAL AND ALGORITHMIC ASPECTS OF NETWORKING, 2005, 3405 : 75 - 89
  • [8] String Range Matching
    Karkkainen, Juha
    Kempa, Dominik
    Puglisi, Simon J.
    COMBINATORIAL PATTERN MATCHING, CPM 2014, 2014, 8486 : 232 - 241
  • [9] Parallel String Matching
    Pfaffe, Philip
    Tillmann, Martin
    Lutteropp, Sarah
    Scheirle, Bernhard
    Zerr, Kevin
    EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS, 2017, 10104 : 187 - 198
  • [10] GENERALIZED STRING MATCHING
    ABRAHAMSON, K
    SIAM JOURNAL ON COMPUTING, 1987, 16 (06) : 1039 - 1051