Optimizing large join queries in mediation systems

被引:0
|
作者
Yerneni, R [1 ]
Li, C [1 ]
Ullman, J [1 ]
Garcia-Molina, H [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
DATABASE THEORY - ICDT'99 | 1999年 / 1540卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In data integration systems, queries posed to a mediator need to be translated into a sequence of queries to the underlying data sources. In a heterogeneous environment, with sources of diverse and limited query capabilities, not all the translations are feasible. In this paper, we study the problem of finding feasible and efficient query plans for mediator systems. We consider conjunctive queries on mediators and model the source capabilities through attribute-binding adornments. We use a simple cast model that focuses on the major costs in mediation systems, those involved with sending queries to sources and getting answers back. Under this metric, we develop two algorithms for source query sequencing - one based on a simple greedy strategy and another based on a partitioning scheme. The first algorithm produces optimal plans in some scenarios, and we show a linear bound on its worst case performance when it misses optimal plans. The second algorithm generates optimal plans in more scenarios, while having no bound on the margin by which it misses the optimal plans. We also report on the results of the experiments that study the performance of the two algorithms.
引用
收藏
页码:348 / 364
页数:17
相关论文
共 50 条
  • [21] Optimizing entity join queries when data transmission cost dominates
    Tsai, PSM
    Chen, ALP
    [J]. DATA & KNOWLEDGE ENGINEERING, 1997, 22 (03) : 283 - 308
  • [22] Optimizing Declarative Graph Queries at Large Scale
    Zhang, Qizhen
    Acharya, Akash
    Chen, Hongzhi
    Arora, Simran
    Chen, Ang
    Liu, Vincent
    Loo, Boon Thau
    [J]. SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 1411 - 1428
  • [23] Presenting a New Method for Optimizing Join Queries Processing in Heterogeneous Distributed Databases
    Zafarani, Elnaz
    Derakhshi, Mohammad Reza Feizi
    Asil, Hasan
    Asil, Amir
    [J]. THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 379 - 382
  • [24] OPTIMIZATION OF LARGE JOIN QUERIES - COMBINING HEURISTICS AND COMBINATORIAL TECHNIQUES
    SWAMI, A
    [J]. PROCEEDINGS OF THE 1989 ACM SIGMOD INTERNATIONAL CONFERENCE ON THE MANAGEMENT OF DATA, 1989, 18 : 367 - 376
  • [25] SRJA:A Research on Optimizing Top-k Join Queries Based on Spark
    Ren, Hui
    Fu, Haidong
    Xu, Fangfang
    Gu, Jinguang
    Zhao, Di
    [J]. PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1000 - 1005
  • [26] EFFICIENT OPTIMIZATION OF LARGE JOIN QUERIES USING TABU SEARCH
    MATYSIAK, M
    [J]. INFORMATION SCIENCES, 1995, 83 (1-2) : 77 - 88
  • [27] Optimizing XML Twig Queries in Relational Systems
    Maghaydah, Moad
    Orgun, Mehmet A.
    Khazali, Imad
    [J]. PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 123 - 129
  • [28] Expressing and optimizing sequence queries in database systems
    Sadri, R
    Zaniolo, C
    Zarkesh, A
    Adibi, J
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2004, 29 (02): : 282 - 318
  • [29] In-memory parallelization of join queries over large ontological hierarchies
    Bilidas, Dimitris
    Koubarakis, Manolis
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2021, 39 (03) : 545 - 582
  • [30] In-memory parallelization of join queries over large ontological hierarchies
    Dimitris Bilidas
    Manolis Koubarakis
    [J]. Distributed and Parallel Databases, 2021, 39 : 545 - 582