Automated Verification of Query Equivalence Using Satisfiability Modulo Theories

被引:15
|
作者
Zhou, Qi [1 ]
Arulraj, Joy [1 ]
Navathe, Shamkant [1 ]
Harris, William [2 ]
Xu, Dong [3 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Galois Inc, Portland, ME USA
[3] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 12卷 / 11期
基金
美国国家科学基金会;
关键词
D O I
10.14778/3342263.3342267
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Database-as-a-service offerings enable users to quickly create and deploy complex data processing pipelines. In practice, these pipelines often exhibit significant overlap of computation due to redundant execution of certain sub-queries. It is challenging for developers and database administrators to manually detect overlap across queries since they may be distributed across teams, organization roles, and geographic locations. Thus, we require automated cloud-scale tools for identifying equivalent queries to minimize computation overlap. State-of-the-art algebraic approaches to automated verification of query equivalence suffer from two limitations. First, they are unable to model the semantics of widely-used SQL features, such as complex query predicates and three-valued logic. Second, they have a computationally intensive verification procedure. These limitations restrict their efficacy and efficiency in cloud-scale database-as-a-service offerings. This paper makes the case for an alternate approach to determining query equivalence based on symbolic representation. The key idea is to effectively transform a wide range of SQL queries into first order logic formulae and then use satisfiability modulo theories to efficiently verify their equivalence. We have implemented this symbolic representation-based approach in EQUITAS. Our evaluation shows that EQUITAS proves the semantic equivalence of a larger set of query pairs compared to algebraic approaches and reduces the verification time by 27 x . We also demonstrate that on a set of 17,461 real-world SQL queries, it automatically identifies redundant execution across 11% of the queries. Our symbolic-representation based technique is currently deployed on Alibaba's MaxCompute database-as-a-service platform.
引用
收藏
页码:1276 / 1288
页数:13
相关论文
共 50 条
  • [1] Solving quantified verification conditions using satisfiability modulo theories
    Ge, Yeting
    Barrett, Clark
    Tinelli, Cesare
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2009, 55 (1-2) : 101 - 122
  • [2] Solving quantified verification conditions using satisfiability modulo theories
    Yeting Ge
    Clark Barrett
    Cesare Tinelli
    Annals of Mathematics and Artificial Intelligence, 2009, 55 : 101 - 122
  • [3] Using Satisfiability Modulo Theories for Inductive Verification of Lustre Programs
    Franzen, Anders
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2006, 144 (01) : 19 - 33
  • [4] Solving quantified verification conditions using satisfiability modulo theories
    Ge, Yeting
    Barrett, Clark
    Tinelli, Cesare
    AUTOMATED DEDUCTION - CADE-21, PROCEEDINGS, 2007, 4603 : 167 - +
  • [5] Tender System Verification with Satisfiability Modulo Theories
    Davila, Rene
    Aldeco-Perez, Rocio
    Barcenas, Everardo
    2021 9TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION (CONISOFT 2021), 2021, : 69 - 78
  • [6] Accelerating temporal verification of Simulink diagrams using satisfiability modulo theories
    Petr Bauch
    Vojtěch Havel
    Jiří Barnat
    Software Quality Journal, 2016, 24 : 37 - 63
  • [7] Practical Approach in Verification of Security Systems Using Satisfiability Modulo Theories
    Zbrzezny, Agnieszka M.
    Szymoniak, Sabina
    Kurkowski, Miroslaw
    LOGIC JOURNAL OF THE IGPL, 2022, 30 (02) : 289 - 300
  • [8] Accelerating temporal verification of Simulink diagrams using satisfiability modulo theories
    Bauch, Petr
    Havel, Vojtech
    Barnat, Jiri
    SOFTWARE QUALITY JOURNAL, 2016, 24 (01) : 37 - 63
  • [9] Satisfiability modulo theories
    Barrett, Clark
    Sebastiani, Roberto
    Seshia, Sanjit A.
    Tinelli, Cesare
    Frontiers in Artificial Intelligence and Applications, 2009, 185 (01) : 825 - 885
  • [10] From propositional satisfiability to satisfiability modulo theories
    Sheini, Hossein M.
    Sakallah, Karem A.
    THEORY AND APPLICATIONS OF SATISFIABILITY TESTING - SAT 2006, PROCEEDINGS, 2006, 4121 : 1 - 9