Data generation for testing and grading SQL queries

被引:28
|
作者
Chandra, Bikash [1 ]
Chawda, Bhupesh [1 ]
Kar, Biplab [1 ]
Reddy, K. V. Maheshwara [1 ]
Shah, Shetal [1 ]
Sudarshan, S. [1 ]
机构
[1] Indian Inst Technol, Mumbai, Maharashtra, India
来源
VLDB JOURNAL | 2015年 / 24卷 / 06期
关键词
Mutation testing; Test data generation; SQL query grading;
D O I
10.1007/s00778-015-0395-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Correctness of SQL queries is usually tested by executing the queries on one or more datasets. Erroneous queries are often the results of small changes or mutations of the correct query. A mutation Q of a query Q is killed by a dataset D if Q(D) Q(D). Earlier work on the XData system showed how to generate datasets that kill all mutations in a class of mutations that included join type and comparison operation mutations. In this paper, we extend the XData data generation techniques to handle a wider variety of SQL queries and a much larger class of mutations. We have also built a system for grading SQL queries using the datasets generated by XData. We present a study of the effectiveness of the datasets generated by the extended XData approach, using a variety of queries including queries submitted by students as part of a database course. We show that the XData datasets outperform predefined datasets as well as manual grading done earlier by teaching assistants, while also avoiding the drudgery of manual correction. Thus, we believe that our techniques will be of great value to database course instructors and TAs, particularly to those of MOOCs. It will also be valuable to database application developers and testers for testing SQL queries.
引用
收藏
页码:731 / 755
页数:25
相关论文
共 50 条
  • [21] Sensitivity Analysis of SQL Queries
    Laud, Peeter
    Pettai, Martin
    Randmets, Jaak
    [J]. PLAS'18: PROCEEDINGS OF THE 13TH WORKSHOP ON PROGRAMMING LANGUAGES AND ANALYSIS FOR SECURITY, 2018, : 2 - 12
  • [22] MAKE BULLETPROOF SQL QUERIES
    LINTHICUM, DS
    [J]. BYTE, 1995, 20 (02): : 111 - 113
  • [23] SQL queries with CASE expressions
    Gryz, Jarek
    Wang, Qiong
    Qian, Xiaoyan
    Zuzarte, Calisto
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 351 - +
  • [24] FORMAL SEMANTICS OF SQL QUERIES
    NEGRI, M
    PELAGATTI, G
    SBATTELLA, L
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1991, 16 (03): : 513 - 534
  • [25] Proving the safety of SQL queries
    Brass, S
    Goldberg, C
    [J]. QSIC 2005: FIFTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, PROCEEDINGS, 2005, : 197 - 204
  • [26] Sphinx: Distributed Execution of Interactive SQL Queries on Big Spatial Data
    Eldawy, Ahmed
    Elganainy, Mostafa
    Bakeer, Ammar
    Abdelmotaleb, Ahmed
    Mokbel, Mohamed
    [J]. 23RD ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2015), 2015,
  • [27] An Appraisal to Optimize SQL Queries
    Myalapalli, Vamsi Krishna
    Shiva, Muddu Butchi
    [J]. 2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [28] Privacy preserving SQL queries
    Park, Hyun -A
    Zhan, Justin
    Lee, Dong Hoon
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INFORMATION SECURITY AND ASSURANCE, 2008, : 549 - +
  • [29] Optimizing star join queries for data warehousing in Microsoft SQL Server
    Galindo-Legaria, Cesar A.
    Grabs, Torsten
    Gukal, Sreenivas
    Herbert, Steve
    Surna, Aleksandras
    Wang, Shirley
    Yu, Wei
    Zabback, Peter
    Zhang, Shin
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1190 - 1199
  • [30] An Adaptive Data Partitioning Scheme for Accelerating Exploratory Spark SQL Queries
    Guo, Chenghao
    Wu, Zhigang
    He, Zhenying
    Wang, X. Sean
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT I, 2017, 10177 : 114 - 128