Distributed query evaluation on semistructured data

被引:40
|
作者
Suciu, D
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
[2] AT&T Corp, Shannon Labs, New York, NY 10013 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2002年 / 27卷 / 01期
关键词
algorithm; languages; theory; distributed evaluation; nested queries; parallel complexity; regular expressions; semistructured data;
D O I
10.1145/507234.507235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semistructured data is modeled as a rooted, labeled graph. The simplest kinds of queries on such data are those which traverse paths described by regular path expressions. More complex queries combine several regular path expressions, with complex data restructuring, and with sub-queries. This article addresses the problem of efficient query evaluation on distributed, semistructured databases. In our setting, the nodes of the database are distributed over a fixed number of sites, and the edges are classified into local (with both ends in the same site) and cross edges (with ends in two distinct sites). Efficient evaluation in this context means that the number of communication steps is fixed (independent on the data or the query), and that the total amount of data sent depends only on the number of cross links and of the size of the query's result. We give such algorithms in three different settings. First, for the simple case of queries consisting of a single regular expression; second, for all queries in a calculus for graphs based on structural recursion which in addition to regular path expressions can perform nontrivial restructuring of the graph; and third, for a class of queries we call select-where queries that combine pattern matching and regular path expressions with data restructuring and subqueries. This article also includes a discussion on how these methods can be used to derive efficient view maintenance algorithms.
引用
收藏
页码:1 / 62
页数:62
相关论文
共 50 条
  • [21] Towards a declarative query and transformation language for XML and semistructured data: Simulation unification
    Bry, F
    Schaffert, S
    LOGICS PROGRAMMING, PROCEEDINGS, 2002, 2401 : 255 - 270
  • [22] Minimizing data transfers in distributed query processing: A comparative study and evaluation
    Morrissey, JM
    Bealor, WT
    COMPUTER JOURNAL, 1996, 39 (08): : 675 - 687
  • [23] TopX:: efficient and versatile top-k query processing for semistructured data
    Theobald, Martin
    Bast, Holger
    Majumdar, Debapriyo
    Schenkel, Ralf
    Weikum, Gerhard
    VLDB JOURNAL, 2008, 17 (01): : 81 - 115
  • [24] A Query Engine for Distributed Query Processing on Linked Data
    Magalhaes, Regis Pires
    Monteiro, Jose Maria
    Vidal, Vania M. P.
    de Macedo, Jose A. F.
    Maia, Macedo
    Porto, Fabio
    Casanova, Marco A.
    ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 185 - 192
  • [25] Distributed Query Processing and Data Sharing
    Roy, Ahana
    Olmsted, Aspen
    2017 12TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2017, : 221 - 224
  • [26] Authorization enforcement in distributed query evaluation
    di Vimercati, Sabrina
    Foresti, Sara
    Jajodia, Sushil
    Paraboschi, Stefano
    Samarati, Pierangela
    JOURNAL OF COMPUTER SECURITY, 2011, 19 (04) : 751 - 794
  • [27] Efficient query evaluation techniques over large amount of distributed linked data
    Kalogeros, Eleftherios
    Gergatsoulis, Manolis
    Damigos, Matthew
    Nomikos, Christos
    INFORMATION SYSTEMS, 2023, 115
  • [28] Evaluation of Distributed Query-Based Monitoring over Data Distribution Service
    Bur, Marton
    Varro, Daniel
    2019 IEEE 5TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2019, : 674 - 679
  • [29] Accelerating Partial Evaluation in Distributed SPARQL Query Evaluation
    Peng, Peng
    Zou, Lei
    Guan, Runyu
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 112 - 123
  • [30] DataGuides: Enabling query formulation and optimization in semistructured databases
    Goldman, R
    Widom, J
    PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, 1997, : 436 - 445