Navigation- vs. index-based XML multi-query processing

被引:26
|
作者
Bruno, N [1 ]
Gravano, L [1 ]
Koudas, N [1 ]
Srivastava, D [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
关键词
D O I
10.1109/ICDE.2003.1260788
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
XML path queries form the basis of complex filtering of XML data. Most current XML path query processing techniques can be divided in two groups. Navigation-based algorithms compute results by analyzing an input document one tag at a time. In contrast, index-based algorithms take advantage of precomputed numbering schemes over the input XML document. In this paper we introduce a new index-based technique, Index-Filter, to answer multiple XML path queries. Index-Filter uses indexes built over the document tags to avoid processing large portions of the input document that are guaranteed not to be part of any match. We analyze Index-Filter and compare it against Y-Filter, a state-of-the-art navigation-based technique. We show that both techniques have their advantages, and we discuss the scenarios under which each technique is superior to the other one. In particular we show that while most XML path query processing techniques work off SAX events, in some cases it pays off to preprocess the input document, augmenting it with auxiliary information that can be used to evaluate the queries faster We present experimental results over real and synthetic XML documents that validate our claims.
引用
收藏
页码:139 / 150
页数:12
相关论文
共 50 条
  • [1] Multi-query processing of XML data streams on multicore
    Kim, Soo-Hyung
    Lee, Kyong-Ha
    Lee, Yoon-Joon
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2339 - 2368
  • [2] Multi-query processing of XML data streams on multicore
    Soo-Hyung Kim
    Kyong-Ha Lee
    Yoon-Joon Lee
    [J]. The Journal of Supercomputing, 2017, 73 : 2339 - 2368
  • [3] Index-Based Batch Query Processing Revisited
    Mackenzie, Joel
    Moffat, Alistair
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 86 - 100
  • [4] Multi-Query Stream Processing on FPGAs
    Sadoghi, Mohammad
    Javed, Rija
    Tarafdar, Naif
    Singh, Harsh
    Palaniappan, Rohan
    Jacobsen, Hans-Arno
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 1229 - 1232
  • [5] Multi-query Optimization for Distributed Similarity Query Processing
    Zhuang, Yi
    Li, Qing
    Chen, Lei
    [J]. 28TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2008, : 639 - +
  • [6] Index-based query processing on distributed multidimensional data
    Tsatsanifos, George
    Sacharidis, Dimitris
    Sellis, Timos
    [J]. GEOINFORMATICA, 2013, 17 (03) : 489 - 519
  • [7] Index-based query processing on distributed multidimensional data
    George Tsatsanifos
    Dimitris Sacharidis
    Timos Sellis
    [J]. GeoInformatica, 2013, 17 : 489 - 519
  • [8] A Distributed Engine for Multi-query Processing Based on Predicates with Spark
    Zhang, Bin
    Sun, Ximin
    Bi, Liwei
    Zhao, Changhao
    Chen, Xin
    Li, Xin
    Sun, Lei
    [J]. WEB AND BIG DATA, 2021, 1505 : 27 - 36
  • [9] An efficient co-operative framework for multi-query processing over compressed XML data
    He, Juzhen
    Ng, Wilfred
    Wang, Xiaoling
    Zhou, Aoying
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 218 - 232
  • [10] XTREAM: An efficient multi-query evaluation on streaming XML data
    Min, Jun-Ki
    Park, Myung-Jae
    Chung, Chin-Wan
    [J]. INFORMATION SCIENCES, 2007, 177 (17) : 3519 - 3538