String matching in hypertext

被引:0
|
作者
Park, K
Kim, DK
机构
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we consider the string matching problem in hypertext, which is a nonlinear structure of text. We model the hypertext as a directed graph G = (V, E), where each node v is an element of V has text T-v associated with it and each link (v, w) is an element of E connects the end of text T-v to the start of text T-w. We define the string matching problem in hypertext as follows: Given a graph G modeling a hypertext and a pattern P, find all occurrences of the pattern in graph G. The pattern length is m and the sum of the lengths of all texts T-v in G is N. The main difficulty in the hypertext string matching problem is that the pattern may occur across links. There is a linear time algorithm for the case when graph G is a tree. In this paper we present a linear O(N + \E\) time algorithm when n(v) length (T-v) is larger than or equal to m for all v, and a more involved algorithm that takes O(N + \E\m) time when there exist some nodes v with n(v) < m. To obtain the results, we combine the notion of witnesses and duels with the suffix tree, which enables us to eliminate possible occurrences of any substring of the pattern.
引用
收藏
页码:318 / 329
页数:12
相关论文
共 50 条
  • [31] STRING MATCHING - THE ERGODIC CASE
    SHIELDS, PC
    ANNALS OF PROBABILITY, 1992, 20 (03): : 1199 - 1203
  • [32] Practical and optimal string matching
    Fredriksson, Kimmo
    Grabowski, Szymon
    String Processing and Information Retrieval, Proceedings, 2005, 3772 : 376 - 387
  • [33] Faster approximate string matching
    BaezaYates, R
    Navarro, G
    ALGORITHMICA, 1999, 23 (02) : 127 - 158
  • [34] Multipattern String Matching On A GPU
    Zha, Xinyan
    Sahni, Sartaj
    2011 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2011,
  • [35] STRING-MATCHING FOR THE NOVICE
    PERCUS, OE
    PERCUS, JK
    AMERICAN MATHEMATICAL MONTHLY, 1994, 101 (10): : 944 - 947
  • [36] On the Complexity of String Matching for Graphs
    Equi, Massimo
    Makinen, Veli
    Tomescu, Alexandru I.
    Grossi, Roberto
    ACM TRANSACTIONS ON ALGORITHMS, 2023, 19 (03)
  • [37] On string matching in chunked texts
    Peltola, Hamm
    Tarhio, Jorma
    IMPLEMENTATION AND APPLICATION OF AUTOMATA, 2007, 4783 : 157 - 167
  • [38] Tries for approximate string matching
    Shang, H
    Merrettal, TH
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (04) : 540 - 547
  • [39] HARDWARE FOR CHARACTER STRING MATCHING
    MOROZOV, SA
    CHERKAY, AD
    SHAGURIN, II
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 1993, 31 (01) : 71 - 81
  • [40] Fast string matching algorithm
    Al-Howaide, Ala'a
    Mardini, Wail
    Khamayseh, Yaser
    Yasin, Muneer Bani
    2010 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (MSE 2010), VOL 2, 2010, : 247 - 250