XXS: Efficient XPath Evaluation on Compressed XML Documents

被引:0
|
作者
Brisaboa, Nieves R. [1 ]
Cerdeira-Pena, Ana [1 ]
Navarro, Gonzalo [2 ]
机构
[1] Univ A Coruna, Dept Comp Sci, Fac Informat, La Coruna 15071, Spain
[2] Univ Chile, Dept Comp Sci, Santiago, Chile
关键词
Algorithms; Performance; Semistructured data; XML; XPath; compression; self-index; NATURAL-LANGUAGE TEXT; REPRESENTATION;
D O I
10.1145/2629554
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The eXtensible Markup Language (XML) is acknowledged as the de facto standard for semistructured data representation and data exchange on the Web and many other scenarios. A well-known shortcoming of XML is its verbosity, which increases manipulation, transmission, and processing costs. Various structure-blind and structure-conscious compression techniques can be applied to XML, and some are even access-friendly, meaning that the documents can be efficiently accessed in compressed form. Direct access is necessary to implement the query languages XPath and XQuery, which are the standard ones to exploit the expressiveness of XML. While a good deal of theoretical and practical proposals exist to solve XPath/XQuery operations on XML, only a few ones are well integrated with a compression format that supports the required access operations on the XML data. In this work we go one step further and design a compression format for XML collections that boosts the performance of XPath queries on the data. This is done by designing compressed representations of the XML data that support some complex operations apart from just accessing the data, and those are exploited to solve key components of the XPath queries. Our system, called XXS, is aimed at XML collections containing natural language text, which are compressed to within 35%-50% of their original size while supporting a large subset of XPath operations in time competitive with, and many times outperforming, the best state-of-the-art systems that work on uncompressed representations.
引用
收藏
页数:37
相关论文
共 50 条
  • [1] Efficient filtering of XML documents with XPath expressions
    Chan, CY
    Felber, P
    Garofalakis, M
    Rastogi, R
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 235 - 244
  • [2] Efficient filtering of XML documents with XPath expressions
    Chan, CY
    Felber, P
    Garofalakis, M
    Rastogi, R
    [J]. VLDB JOURNAL, 2002, 11 (04): : 354 - 379
  • [3] Efficient filtering of XML documents with XPath expressions
    C.-Y. Chan
    P. Felber
    M. Garofalakis
    R. Rastogi
    [J]. The VLDB Journal, 2002, 11 : 354 - 379
  • [4] Efficient Filtering of XML Documents with XPath Expressions Containing Ancestor Axis
    Ning, Bo
    Liu, Chengfei
    Wang, Guoren
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 551 - +
  • [5] Evaluation of XPath Queries Over XML Documents Using SparkSQL Framework
    Hricov, Radoslav
    Senk, Adam
    Kroha, Petr
    Valenta, Michal
    [J]. BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: TOWARDS EFFICIENT SOLUTIONS FOR DATA ANALYSIS AND KNOWLEDGE REPRESENTATION, 2017, 716 : 28 - 41
  • [6] Efficient Processing XPath Queries by Compressed XML Query Tree based on Structural Index
    Zhang, Haiwei
    Hu, Xiangyu
    Zhang, Ying
    Wen, Yanlong
    Yuan, Xiaojie
    [J]. MECHATRONICS AND INTELLIGENT MATERIALS, PTS 1 AND 2, 2011, 211-212 : 726 - 730
  • [7] XPath plus : A Tool for Linked XML Documents Navigation
    da Silva, Paulo Caetano
    Times, Valeria Cesario
    [J]. DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2009, 5679 : 67 - 74
  • [8] Automata for positive core XPath queries on compressed documents
    Fila, Barbara
    Anantharaman, Siva
    [J]. LOGIC FOR PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND REASONING, PROCEEDINGS, 2006, 4246 : 467 - +
  • [9] A methodology for coupling fragments of XPath with structural indexes for XML documents
    Fletcher, George H. L.
    Van Gucht, Dirk
    Wu, Yuqing
    Gyssens, Marc
    Brenes, Sofia
    Paredaens, Jan
    [J]. DATABASE PROGRAMMING LANGUAGES, 2007, 4797 : 48 - +
  • [10] Indexing XML documents for XPath query processing in external memory
    Chen, Qun
    Lim, Andrew
    Ong, Kian Win
    Tang, Jiqing
    [J]. DATA & KNOWLEDGE ENGINEERING, 2006, 59 (03) : 681 - 699