Automated Verification of Query Equivalence Using Satisfiability Modulo Theories

被引:15
|
作者
Zhou, Qi [1 ]
Arulraj, Joy [1 ]
Navathe, Shamkant [1 ]
Harris, William [2 ]
Xu, Dong [3 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Galois Inc, Portland, ME USA
[3] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 12卷 / 11期
基金
美国国家科学基金会;
关键词
D O I
10.14778/3342263.3342267
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Database-as-a-service offerings enable users to quickly create and deploy complex data processing pipelines. In practice, these pipelines often exhibit significant overlap of computation due to redundant execution of certain sub-queries. It is challenging for developers and database administrators to manually detect overlap across queries since they may be distributed across teams, organization roles, and geographic locations. Thus, we require automated cloud-scale tools for identifying equivalent queries to minimize computation overlap. State-of-the-art algebraic approaches to automated verification of query equivalence suffer from two limitations. First, they are unable to model the semantics of widely-used SQL features, such as complex query predicates and three-valued logic. Second, they have a computationally intensive verification procedure. These limitations restrict their efficacy and efficiency in cloud-scale database-as-a-service offerings. This paper makes the case for an alternate approach to determining query equivalence based on symbolic representation. The key idea is to effectively transform a wide range of SQL queries into first order logic formulae and then use satisfiability modulo theories to efficiently verify their equivalence. We have implemented this symbolic representation-based approach in EQUITAS. Our evaluation shows that EQUITAS proves the semantic equivalence of a larger set of query pairs compared to algebraic approaches and reduces the verification time by 27 x . We also demonstrate that on a set of 17,461 real-world SQL queries, it automatically identifies redundant execution across 11% of the queries. Our symbolic-representation based technique is currently deployed on Alibaba's MaxCompute database-as-a-service platform.
引用
收藏
页码:1276 / 1288
页数:13
相关论文
共 50 条
  • [21] Combined satisfiability modulo parametric theories
    Krstic, Sava
    Goel, Amit
    Grundy, Jim
    Tinelli, Cesare
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2007, 4424 : 602 - +
  • [22] Satisfiability Modulo Theories: Introduction and Applications
    De Moura, Leonardo
    Bjorner, Nikolaj
    COMMUNICATIONS OF THE ACM, 2011, 54 (09) : 69 - 77
  • [23] An Instantiation Scheme for Satisfiability Modulo Theories
    Mnacho Echenim
    Nicolas Peltier
    Journal of Automated Reasoning, 2012, 48 : 293 - 362
  • [24] A progressive simplifier for satisfiability modulo theories
    Sheini, Hossein M.
    Sakallah, Karem A.
    THEORY AND APPLICATIONS OF SATISFIABILITY TESTING - SAT 2006, PROCEEDINGS, 2006, 4121 : 184 - 197
  • [25] Personnel Scheduling as Satisfiability Modulo Theories
    Erkinger, Christoph
    Musliu, Nysret
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 614 - 621
  • [26] Analog Layout Placement Retargeting using Satisfiability Modulo Theories
    Mohamed, Aya
    Dessouky, Mohamed
    Saif, Sherif M.
    2017 14TH INTERNATIONAL CONFERENCE ON SYNTHESIS, MODELING, ANALYSIS AND SIMULATION METHODS AND APPLICATIONS TO CIRCUIT DESIGN (SMACD), 2017,
  • [27] Using Incomplete Satisfiability Modulo Theories to Determine Robotic Tasks
    Witsch, Andreas
    Skubch, Hendrik
    Niemczyk, Stefan
    Geihs, Kurt
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 4784 - 4789
  • [28] Using Satisfiability Modulo Theories to Analyze Abstract State Machines
    Veanes, Margus
    Saabas, Ando
    ABSTRACT STATE MACHINES, B AND Z, PROCEEDINGS, 2008, 5238 : 355 - 355
  • [29] Verification modulo theories
    Cimatti, Alessandro
    Griggio, Alberto
    Mover, Sergio
    Roveri, Marco
    Tonetta, Stefano
    FORMAL METHODS IN SYSTEM DESIGN, 2022, 60 (03) : 452 - 481
  • [30] Verification modulo theories
    Alessandro Cimatti
    Alberto Griggio
    Sergio Mover
    Marco Roveri
    Stefano Tonetta
    Formal Methods in System Design, 2022, 60 (3) : 452 - 481