Join synopses for approximate query answering

被引:0
|
作者
Acharya, S [1 ]
Gibbons, PB [1 ]
Poosala, V [1 ]
Ramaswamy, S [1 ]
机构
[1] AT&T Bell Labs, Informat Sci Res Ctr, Murray Hill, NJ 07974 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In large data warehousing environments, it is often advantageous to provide fast, approximate answers to complex aggregate queries based on statistical summaries of the full data. In this paper, we demonstrate the difficulty of providing good approximate answers for join-queries using only statistics (in particular, samples) from the base relations. We propose join synopses as an effective solution for this problem and show how precomputing just one join synopsis for each relation suffices to significantly improve the quality of approximate answers for arbitrary queries with foreign key joins. We present optimal strategies for allocating the available space among the various join synopses when the query work load is known and identify heuristics for the common case when the work load is not known. We also present efficient algorithms for incrementally maintaining join synopses in the presence of updates to the base relations. Our extensive set of experiments on the TPC-D benchmark database show the effectiveness of join synopses and various other techniques proposed in this paper.
引用
收藏
页码:275 / 286
页数:12
相关论文
共 50 条
  • [1] TuG Synopses for Approximate Query Answering
    Spiegel, Joshua
    Polyzotis, Neoklis
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (01):
  • [2] Analytical synopses for approximate query answering in OLAP environments
    Cuzzocrea, A
    Matrangolo, U
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, 3180 : 359 - 370
  • [3] Query Similarity for Approximate Query Answering
    Kantere, Verena
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT II, 2016, 9828 : 355 - 367
  • [4] Synopses for Efficient and Reliable Approximate Query Processing
    Liang, Xi
    ProQuest Dissertations and Theses Global, 2022,
  • [5] Benchmarking Approximate Consistent Query Answering
    Calautti, Marco
    Console, Marco
    Pieris, Andreas
    PODS '21: PROCEEDINGS OF THE 40TH SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2021, : 233 - 246
  • [6] The aqua approximate query answering system
    Acharya, S
    Gibbons, PB
    Poosala, V
    Ramaswamy, S
    SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 1999, : 574 - 576
  • [7] Approximate query answering by model averaging
    Pavlov, D
    Smyth, P
    PROCEEDINGS OF THE THIRD SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2003, : 142 - 153
  • [8] Approximate query answering in numerical databases
    Hachem, N
    Bao, CY
    Taylor, S
    EIGHTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE SYSTEMS, PROCEEDINGS, 1996, : 63 - 73
  • [9] Benchmark for Approximate Query Answering Systems
    Di Tria, Francesco
    Lefons, Ezio
    Tangorra, Filippo
    JOURNAL OF DATABASE MANAGEMENT, 2015, 26 (01) : 1 - 29
  • [10] An Evolutionary Perspective on Approximate RDF Query Answering
    Gueret, Christophe
    Oren, Eyal
    Schlobach, Stefan
    Schut, Martijn
    SCALABLE UNCERTAINTY MANAGEMENT, SUM 2008, 2008, 5291 : 215 - 228