Distributed query evaluation on semistructured data

被引:40
|
作者
Suciu, D
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
[2] AT&T Corp, Shannon Labs, New York, NY 10013 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2002年 / 27卷 / 01期
关键词
algorithm; languages; theory; distributed evaluation; nested queries; parallel complexity; regular expressions; semistructured data;
D O I
10.1145/507234.507235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semistructured data is modeled as a rooted, labeled graph. The simplest kinds of queries on such data are those which traverse paths described by regular path expressions. More complex queries combine several regular path expressions, with complex data restructuring, and with sub-queries. This article addresses the problem of efficient query evaluation on distributed, semistructured databases. In our setting, the nodes of the database are distributed over a fixed number of sites, and the edges are classified into local (with both ends in the same site) and cross edges (with ends in two distinct sites). Efficient evaluation in this context means that the number of communication steps is fixed (independent on the data or the query), and that the total amount of data sent depends only on the number of cross links and of the size of the query's result. We give such algorithms in three different settings. First, for the simple case of queries consisting of a single regular expression; second, for all queries in a calculus for graphs based on structural recursion which in addition to regular path expressions can perform nontrivial restructuring of the graph; and third, for a class of queries we call select-where queries that combine pattern matching and regular path expressions with data restructuring and subqueries. This article also includes a discussion on how these methods can be used to derive efficient view maintenance algorithms.
引用
收藏
页码:1 / 62
页数:62
相关论文
共 50 条
  • [1] Query rewriting for semistructured data
    Papakonstantinou, Y
    Vassalos, V
    SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 1999, : 455 - 466
  • [2] The Lorel query language for semistructured data
    Serge Abiteboul
    Dallan Quass
    Jason McHugh
    Jennifer Widom
    Janet L. Wiener
    International Journal on Digital Libraries, 1997, 1 (1) : 68 - 88
  • [3] Query generation for retrieving data from distributed semistructured documents using a metadata interface
    Choe, Guija
    Nam, Young-Kwang
    Goguen, Joseph
    Wang, Guilian
    COMPUTER LANGUAGES SYSTEMS & STRUCTURES, 2009, 35 (04) : 422 - 434
  • [4] Query Decomposition Strategy for Integration of Semistructured Data
    Handoko
    Getta, J. R.
    16TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS 2014), 2014, : 459 - 463
  • [5] Dynamic Query Scheduling for Online Integration of Semistructured Data
    Handoko
    Getta, Janusz R.
    IEEE 39TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC 2015), VOL 3, 2015, : 375 - 380
  • [6] Distributed Query Evaluation over Encrypted Data
    di Vimercati, Sabrina De Capitani
    Foresti, Sara
    Jajodia, Sushil
    Livraga, Giovanni
    Paraboschi, Stefano
    Samarati, Pierangela
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXXV, 2021, 12840 : 96 - 114
  • [7] Schema extracting and query processing for semistructured data in COMMIX
    Wang, T.J.
    Tang, S.W.
    Yang, D.Q.
    Liu, Y.F.
    Tong, Y.H.
    Ruan Jian Xue Bao/Journal of Software, 2001, 12 (SUPPL.): : 230 - 236
  • [8] View-based query answering and query containment over semistructured data
    Calvanese, D
    De Giacomo, G
    Lenzerini, M
    Vardi, MY
    DATABASE PROGRAMMING LANGUAGES, 2002, 2397 : 40 - 61
  • [9] Implementing a query language for context-dependent semistructured data
    Stavrakas, Y
    Pristouris, K
    Efandis, A
    Sellis, T
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2004, 3255 : 173 - 188
  • [10] Query Processing using Dynamic Relational Structure for Semistructured Data
    Alom, B. M. Monjurul
    Henskens, Frans
    Hannaford, Michael
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (04): : 104 - 113