Robust Estimation of Resource Consumption for SQL Queries using Statistical Techniques

被引:67
|
作者
Li, Jiexing [1 ]
Konig, Arnd Christian [2 ]
Narasayya, Vivek [2 ]
Chaudhuri, Surajit [2 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, 1210 West Dayton St, Madison, WI 53706 USA
[2] Microsoft Res, Redmond, WA 98052 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2012年 / 5卷 / 11期
关键词
D O I
10.14778/2350229.2350269
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ability to estimate resource consumption of SQL queries is crucial for a number of tasks in a database system such as admission control, query scheduling and costing during query optimization. Recent work has explored the use of statistical techniques for resource estimation in place of the manually constructed cost models used in query optimization. Such techniques, which require as training data examples of resource usage in queries, offer the promise of superior estimation accuracy since they can account for factors such as hardware characteristics of the system or bias in cardinality estimates. However, the proposed approaches lack robustness in that they do not generalize well to queries that are different from the training examples, resulting in significant estimation errors. Our approach aims to address this problem by combining knowledge of database query processing with statistical models. We model resource-usage at the level of individual operators, with different models and features for each operator type, and explicitly model the asymptotic behavior of each operator. This results in significantly better estimation accuracy and the ability to estimate resource usage of arbitrary plans, even when they are very different from the training instances. We validate our approach using various large scale real-life and benchmark workloads on Microsoft SQL Server.
引用
收藏
页码:1555 / 1566
页数:12
相关论文
共 50 条
  • [31] Robust respiratory flow estimation using statistical properties of tracheal sounds
    Yadollahi, Azadeh
    Moussavi, Zahra
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 1106 - 1109
  • [32] Robust Statistical Estimation for Capture-Recapture Using Administrative Data
    Chipperfield, James O.
    Chu, Randall
    Zhang, Li-Chun
    Baffour, Bernard
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2024, 40 (02) : 215 - 237
  • [33] Robust respiratory flow estimation using statistical properties of tracheal sounds
    Yadollahi, Azadeh
    Moussavi, Zahra
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4220 - 4223
  • [34] Automatic Whitelist Generation for SQL Queries Using Web Application Tests
    Nomura, Komei
    Rikitake, Kenji
    Matsumoto, Ryosuke
    [J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 465 - 470
  • [35] External SQL rewriters - Using artificial intelligence to optimize database queries
    To, R
    Pang, C
    [J]. DR DOBBS JOURNAL, 2002, 27 (05): : 42 - +
  • [36] SQLSketch: Generating SQL Queries using a sketch-based approach
    Ahkouk, Karam
    Machkour, Mustapha
    Majhadi, Khadija
    Mama, Rachid
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 12253 - 12263
  • [37] Using positional sequence patterns to estimate the selectivity of SQL LIKE queries
    Aytimur, Mehmet
    Cakmak, Ali
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
  • [38] Robust Cardinality Estimation for Subgraph Isomorphism Queries on Property Graphs
    Paradies, Marcus
    Vasilyeva, Elena
    Mocan, Adrian
    Lehner, Wolfgang
    [J]. BIOMEDICAL DATA MANAGEMENT AND GRAPH ONLINE QUERYING, 2016, 9579 : 184 - 198
  • [39] ENERGY CONSUMPTION ESTIMATION IN WATER DISTRIBUTION SYSTEMS USING FUZZY TECHNIQUES
    Istrate, Marcel
    Grigors, Gheorghe
    [J]. ENVIRONMENTAL ENGINEERING AND MANAGEMENT JOURNAL, 2010, 9 (02): : 249 - 256
  • [40] Estimation of Transmission Reliability Margin Using Statistical Methods and Intelligent Techniques
    Dadi, Swathi
    Mathew, Seema
    Khatavkar, V. V.
    [J]. 2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 896 - 900