An adaptive query execution system for data integration

被引:0
|
作者
Ives, ZG [1 ]
Florescu, D [1 ]
Friedman, M [1 ]
Levy, A [1 ]
Weld, DS [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query processing in data integration occurs over network-bound, autonomous data sources. This requires extensions to traditional optimization and execution techniques for three reasons: there is an absence of quality statistics about the data, data transfer rates are unpredictable and bursty, and slow or unavailable data sources can often be replaced by overlapping or mirrored sources. This paper presents the Tukwila data integration system, designed to support adaptivity at its core using a two-pronged approach. Interleaved planning and execution with partial optimization allows Tukwila to quickly recover from decisions based on inaccurate estimates. During execution, Tukwila uses adaptive query operators such as the double pipelined hash join, which produces answers quickly, and the dynamic collector, which robustly and efficiently computes unions across overlapping data sources. We demonstrate that the Tukwila architecture extends previous innovations in adaptive execution (such as query scrambling, mid-execution re-optimization, and choose nodes), and we present experimental evidence that our techniques result in behavior desirable for a data integration system.
引用
收藏
页码:299 / 310
页数:12
相关论文
共 50 条
  • [1] Self-monitoring query execution for adaptive query processing
    Gounaris, A
    Paton, NW
    Fernandes, AAA
    Sakellariou, R
    [J]. DATA & KNOWLEDGE ENGINEERING, 2004, 51 (03) : 325 - 348
  • [2] Adaptive Concurrent Query Execution Framework for an Analytical In-Memory Database System
    Deshmukh, Harshad
    Memisoglu, Hakan
    Patel, Jignesh M.
    [J]. 2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 23 - 30
  • [3] A Query Simulation System To Illustrate Database Query Execution
    Allenstein, Brett
    Yost, Andrew
    Wagner, Paul
    Morrison, Joline
    [J]. SIGCSE'08: PROCEEDINGS OF THE 39TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2008, : 493 - 497
  • [4] On query execution over encrypted data
    Baby, Tinu
    Cherukuri, Aswani Kumar
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2015, 8 (02) : 321 - 331
  • [5] The researches on the query language of heterogeneous data integration system
    Li, GY
    Huang, H
    Zhang, J
    Xie, YW
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 9 - 12
  • [6] Service-oriented execution model supporting data sharing and adaptive query processing
    Yongwei Wu
    Jia Liu
    Gang Chen
    Qiming Fang
    Guangwen Yang
    [J]. Cluster Computing, 2010, 13 : 127 - 140
  • [7] Service-oriented execution model supporting data sharing and adaptive query processing
    Wu, Yongwei
    Liu, Jia
    Chen, Gang
    Fang, Qiming
    Yang, Guangwen
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2010, 13 (02): : 127 - 140
  • [8] An adaptive query processing mechanism in data stream system
    Song, Baoyan
    Zhang, Lijie
    Yu, Ge
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 3113 - 3118
  • [9] Utility-driven adaptive query workload execution
    Paton, Norman W.
    de Aragao, Marcelo A. T.
    Fernandes, Alvaro A. A. A.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (07): : 1070 - 1079
  • [10] ADAPTIVE SELECTION OF QUERY EXECUTION STRATEGIES BY LEARNING AUTOMATA
    TOMPA, FWM
    ICAZA, JI
    [J]. INFORMATION SCIENCES, 1990, 50 (03) : 219 - 240