Robust Estimation of Resource Consumption for SQL Queries using Statistical Techniques

被引:67
|
作者
Li, Jiexing [1 ]
Konig, Arnd Christian [2 ]
Narasayya, Vivek [2 ]
Chaudhuri, Surajit [2 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, 1210 West Dayton St, Madison, WI 53706 USA
[2] Microsoft Res, Redmond, WA 98052 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2012年 / 5卷 / 11期
关键词
D O I
10.14778/2350229.2350269
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ability to estimate resource consumption of SQL queries is crucial for a number of tasks in a database system such as admission control, query scheduling and costing during query optimization. Recent work has explored the use of statistical techniques for resource estimation in place of the manually constructed cost models used in query optimization. Such techniques, which require as training data examples of resource usage in queries, offer the promise of superior estimation accuracy since they can account for factors such as hardware characteristics of the system or bias in cardinality estimates. However, the proposed approaches lack robustness in that they do not generalize well to queries that are different from the training examples, resulting in significant estimation errors. Our approach aims to address this problem by combining knowledge of database query processing with statistical models. We model resource-usage at the level of individual operators, with different models and features for each operator type, and explicitly model the asymptotic behavior of each operator. This results in significantly better estimation accuracy and the ability to estimate resource usage of arbitrary plans, even when they are very different from the training instances. We validate our approach using various large scale real-life and benchmark workloads on Microsoft SQL Server.
引用
收藏
页码:1555 / 1566
页数:12
相关论文
共 50 条
  • [1] Robust heuristics for scalable optimization of complex SQL queries
    Das, Gopal Chandra
    Haritsa, Jayant R.
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1256 - +
  • [2] Inferring SQL Queries Using Interactivity
    Ahkouk, Karamwh
    Machkour, Mustapha
    antari, Jilali
    [J]. 3RD INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEM & SECURITY (NISS'20), 2020,
  • [3] Optimizer Time Estimation for SQL Queries Extended Abstract
    Ramesh, Bhashyam
    Jaiprakash, C.
    Sankaran, Naveen
    Yasaswi, Jitendra
    [J]. 30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [4] SQL Database Queries to Determine Treatment Planning Resource Usage
    Fox, C.
    Gladstone, D.
    [J]. MEDICAL PHYSICS, 2015, 42 (06) : 3573 - 3573
  • [5] Robust Cardinality: a novel approach for cardinality prediction in SQL queries
    B. S. Praciano, Francisco D.
    Amora, Paulo R. P.
    Abreu, Italo C.
    Pereira, Francisco L. F.
    Machado, Javam C.
    [J]. Journal of the Brazilian Computer Society, 2021, 27 (01)
  • [6] Identifying SQL Queries Similarity Using SVM
    Zahir, Jihad
    El Qadi, Abderrahim
    Bellatreche, Ladjel
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 657 - 666
  • [7] Using Containers to Execute SQL Queries in a Cloud
    Holland, David
    Zhang, Weining
    [J]. 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING COMPANION (UCC COMPANION), 2018, : 26 - 27
  • [8] Optimising SQL Queries Using Genetic Improvement
    Callan, James
    Petke, Justyna
    [J]. 2021 IEEE/ACM INTERNATIONAL WORKSHOP ON GENETIC IMPROVEMENT (GI 2021), 2021, : 9 - 10
  • [9] Inferring Gene Networks using Robust Statistical Techniques
    Nadadoor, Venkat R.
    Ben-Zvi, Amos
    Shah, Sirish L.
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [10] Statistical Matching Techniques Using the Robust Regression Model
    Jhun, Myoungshic
    Jung, Ji Song
    Park, Hye Jin
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (06) : 981 - 996