Efficient filtering of XML documents with XPath expressions

被引:0
|
作者
C.-Y. Chan
P. Felber
M. Garofalakis
R. Rastogi
机构
[1] Bell Laboratories,
[2] Lucent Technologies,undefined
[3] 600 Mountain Ave.,undefined
[4] NJ 07974,undefined
[5] USA; {cychan,undefined
[6] pascal,undefined
[7] minos,undefined
[8] rastogi}@research.bell-labs.com ,undefined
来源
The VLDB Journal | 2002年 / 11卷
关键词
Keywords: Data dissemination - Document filtering - Index structure - XML - XPath;
D O I
暂无
中图分类号
学科分类号
摘要
The publish/subscribe paradigm is a popular model for allowing publishers (i.e., data generators) to selectively disseminate data to a large number of widely dispersed subscribers (i.e., data consumers) who have registered their interest in specific information items. Early publish/subscribe systems have typically relied on simple subscription mechanisms, such as keyword or ”bag of words” matching, or simple comparison predicates on attribute values. The emergence of XML as a standard for information exchange on the Internet has led to an increased interest in using more expressive subscription mechanisms (e.g., based on XPath expressions) that exploit both the structure and the content of published XML documents. Given the increased complexity of these new data-filtering mechanisms, the problem of effectively identifying the subscription profiles that match an incoming XML document poses a difficult and important research challenge. In this paper, we propose a novel index structure, termed XTrie, that supports the efficient filtering of XML documents based on XPath expressions. Our XTrie index structure offers several novel features that, we believe, make it especially attractive for large-scale publish/subscribe systems. First, XTrie is designed to support effective filtering based on complex XPath expressions (as opposed to simple, single-path specifications). Second, our XTrie structure and algorithms are designed to support both ordered and unordered matching of XML data. Third, by indexing on sequences of elements organized in a trie structure and using a sophisticated matching algorithm, XTrie is able to both reduce the number of unnecessary index probes as well as avoid redundant matchings, thereby providing extremely efficient filtering. Our experimental results over a wide range of XML document and XPath expression workloads demonstrate that our XTrie index structure outperforms earlier approaches by wide margins.
引用
收藏
页码:354 / 379
页数:25
相关论文
共 50 条
  • [1] Efficient filtering of XML documents with XPath expressions
    Chan, CY
    Felber, P
    Garofalakis, M
    Rastogi, R
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 235 - 244
  • [2] Efficient filtering of XML documents with XPath expressions
    Chan, CY
    Felber, P
    Garofalakis, M
    Rastogi, R
    [J]. VLDB JOURNAL, 2002, 11 (04): : 354 - 379
  • [3] Efficient Filtering of XML Documents with XPath Expressions Containing Ancestor Axis
    Ning, Bo
    Liu, Chengfei
    Wang, Guoren
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 551 - +
  • [4] Filtering XPath expressions for XML access control
    Jeon, JM
    Chung, YD
    Kim, MH
    Lee, YJ
    [J]. COMPUTERS & SECURITY, 2004, 23 (07) : 591 - 605
  • [5] Filtering contents by efficient evaluation of XPath expressions
    Yuriyama, M
    Nakamura, H
    [J]. 2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 148 - 155
  • [6] XML filtering with XPath expressions containing parent and ancestor axes
    Ning, Bo
    Liu, Chengfei
    [J]. INFORMATION SCIENCES, 2012, 210 : 41 - 54
  • [7] XXS: Efficient XPath Evaluation on Compressed XML Documents
    Brisaboa, Nieves R.
    Cerdeira-Pena, Ana
    Navarro, Gonzalo
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2014, 32 (03)
  • [8] Efficient Eager XPath Filtering over XML Streams
    Hagio, Kazuhito
    Ohgami, Takashi
    Bannai, Hideo
    Takeda, Masayuki
    [J]. PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2011, 2011, : 30 - 44
  • [9] YFilter: Efficient and scalable filtering of XML documents
    Diao, YL
    Fischer, P
    Franklin, MJ
    To, R
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 341 - 342
  • [10] Filtering XML Streams by XPath and Keywords
    Bou, Savong
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    [J]. 16TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS 2014), 2014, : 410 - 419