Cost-based holistic twig joins

被引:3
|
作者
Baca, Radim [1 ]
Lukas, Petr [1 ]
Kratky, Michal [1 ]
机构
[1] Tech Univ Ostrava, VSB, Fac Elect Engn & Comp Sci, Dept Comp Sci, Ostrava, Czech Republic
关键词
XML; Query processing; Cost-based approaches; Holistic algorithms; Access path; XSKETCH SYNOPSES; STRUCTURAL JOINS; QUERY; BENCHMARK;
D O I
10.1016/j.is.2015.03.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An evaluation of XML queries such as XQuery or XPath expressions represents a challenging task due to its complexity. Many algorithms have been introduced to cope with this problem. Some of them, called binary joins, evaluate separated parts of a query and subsequently merge intermediate results, while the others, called holistic twig joins, evaluate a query as a whole. Moreover, these algorithms also differ in what index data structure they use to handle XML data. There exist cost-based approaches utilizing binary joins and various index data structures; however, they share a limitation. The limitation is that they cannot perform a join between query nodes not having a direct XPath relationship. Such a join can be advantageous especially if their joint selectivity is high. Since holistic joins work with all query nodes they overcome this limitation. In this article, we introduce such a holistic twig join called CostTwigJoin. To the best of our knowledge, CostTwigjoin is the first holistic join capable of combining various index data structures during an evaluation of an XML query. Usage of the holistic join has yet another advantage for cost-based approaches: an optimizer does not have to resolve the order of binary joins; therefore, the search space is reduced. In this article, we perform thorough experiments on hundreds of queries to evaluate our approach and demonstrate its advantages. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:21 / 33
页数:13
相关论文
共 50 条
  • [1] C-Tree Indexing for Holistic Twig Joins
    Ning, Bo
    Liu, Chengfei
    Wang, Guoren
    [J]. ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2009, 5446 : 76 - +
  • [2] Cost-based query optimization for multi reachability joins
    Cheng, Jiefeng
    Yu, Jeffrey Xu
    Ding, Bolin
    [J]. ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 18 - +
  • [3] Using Structural Joins and Holistic Twig Joins for Native XML Query Optimization
    Weiner, Andreas M.
    Haerder, Theo
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2009, 5739 : 149 - 163
  • [4] Towards Cost-based Optimizations of Twig Content-based Queries
    Kratky, Michal
    Baca, Radim
    [J]. DATESO 2008 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 8TH ANNUAL INTERNATIONAL WORKSHOP, 2008, 330 : 71 - 73
  • [5] Parallel holistic twig joins on a multi-core system
    Machdi, Imam
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2010, 6 (02) : 149 - +
  • [6] XML data partitioning schemes for parallel holistic twig joins
    Machdi, Imam
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2009, 5 (02) : 151 - +
  • [7] Scaling Up Set Similarity Joins Using a Cost-Based Distributed-Parallel Framework
    Fier, Fabian
    Freytag, Johann-Christoph
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2021, 2021, 13058 : 17 - 31
  • [8] Efficient holistic twig joins in leaf-to-root combining with root-to-leaf way
    Li, Guoliang
    Feng, Jianhua
    Zhang, Yong
    Zhou, Lizhu
    [J]. ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 834 - +
  • [9] Fast Optimal Twig Joins
    Grimsmo, Nils
    Bjorklund, Truls A.
    Hetland, Magnus Lie
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 894 - 905
  • [10] Elements of cost-based tolerancing
    Youngworth, RN
    Stone, BD
    [J]. OPTICAL REVIEW, 2001, 8 (04) : 276 - 280