Automated Verification of Query Equivalence Using Satisfiability Modulo Theories

被引:15
|
作者
Zhou, Qi [1 ]
Arulraj, Joy [1 ]
Navathe, Shamkant [1 ]
Harris, William [2 ]
Xu, Dong [3 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Galois Inc, Portland, ME USA
[3] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 12卷 / 11期
基金
美国国家科学基金会;
关键词
D O I
10.14778/3342263.3342267
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Database-as-a-service offerings enable users to quickly create and deploy complex data processing pipelines. In practice, these pipelines often exhibit significant overlap of computation due to redundant execution of certain sub-queries. It is challenging for developers and database administrators to manually detect overlap across queries since they may be distributed across teams, organization roles, and geographic locations. Thus, we require automated cloud-scale tools for identifying equivalent queries to minimize computation overlap. State-of-the-art algebraic approaches to automated verification of query equivalence suffer from two limitations. First, they are unable to model the semantics of widely-used SQL features, such as complex query predicates and three-valued logic. Second, they have a computationally intensive verification procedure. These limitations restrict their efficacy and efficiency in cloud-scale database-as-a-service offerings. This paper makes the case for an alternate approach to determining query equivalence based on symbolic representation. The key idea is to effectively transform a wide range of SQL queries into first order logic formulae and then use satisfiability modulo theories to efficiently verify their equivalence. We have implemented this symbolic representation-based approach in EQUITAS. Our evaluation shows that EQUITAS proves the semantic equivalence of a larger set of query pairs compared to algebraic approaches and reduces the verification time by 27 x . We also demonstrate that on a set of 17,461 real-world SQL queries, it automatically identifies redundant execution across 11% of the queries. Our symbolic-representation based technique is currently deployed on Alibaba's MaxCompute database-as-a-service platform.
引用
收藏
页码:1276 / 1288
页数:13
相关论文
共 50 条
  • [41] Automating Elevator Design with Satisfiability Modulo Theories
    Demarchi, Stefano
    Menapace, Marco
    Tacchella, Armando
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 26 - 33
  • [42] Satisfiability Modulo Theories: A Beginner's Tutorial
    Barrett, Clark
    Tinelli, Cesare
    Barbosa, Haniel
    Niemetz, Aina
    Preiner, Mathias
    Reynolds, Andrew
    Zohar, Yoni
    FORMAL METHODS, PT II, FM 2024, 2025, 14934 : 571 - 596
  • [43] Preface to special issue on satisfiability modulo theories
    Alberto Griggio
    Philipp Rümmer
    Formal Methods in System Design, 2017, 51 : 431 - 432
  • [44] A Platform for Placement of Analog Integrated Circuits Using Satisfiability Modulo Theories
    Saif, Sherif M.
    Dessouky, Mohamed
    El-Kharashi, M. Watheq
    Abbas, Hazem
    Nassar, Salwa
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2016, 25 (05)
  • [45] Pareto Front Analog Layout Placement using Satisfiability Modulo Theories
    Saif, Sherif M.
    Dessouky, Mohamed
    El-Kharashi, M. Watheq
    Abbas, Hazem
    Nassar, Salwa
    PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1411 - 1416
  • [46] Rule-Compliant Trajectory Repairing using Satisfiability Modulo Theories
    Lin, Yuanfei
    Althoff, Matthias
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 449 - 456
  • [47] Consistency Verification of a Rule-Based Smart Home Reasoning System with Satisfiability Modulo Theories
    Mekuria, Dagmawi Neway
    Sernani, Paolo
    Falcionelli, Nicola
    Dragoni, Aldo Franco
    PROCEEDINGS OF THE 2020 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE), 2020, : 52 - 59
  • [48] Towards Automated Strategies in Satisfiability Modulo Theory
    Galvez Ramirez, Nicolas
    Hamadi, Youssef
    Monfroy, Eric
    Saubion, Frederic
    GENETIC PROGRAMMING, EUROGP 2016, 2016, 9594 : 230 - 245
  • [49] Satisfiability Modulo Custom Theories in Z3
    Bjorner, Nikolaj
    Eisenhofer, Clemens
    Kovacs, Laura
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, VMCAI 2023, 2023, 13881 : 91 - 105
  • [50] Exploiting Satisfiability Modulo Theories for Analog Layout Automation
    Saif, Sherif M.
    Dessouky, Mohamed
    Nassar, Salwa
    Abbas, Hazem
    El-Kharashi, M. Watheq
    Abdulaziz, Mohammad
    2014 9TH INTERNATIONAL DESIGN & TEST SYMPOSIUM (IDT), 2014, : 73 - 78