Data generation for testing and grading SQL queries

被引:28
|
作者
Chandra, Bikash [1 ]
Chawda, Bhupesh [1 ]
Kar, Biplab [1 ]
Reddy, K. V. Maheshwara [1 ]
Shah, Shetal [1 ]
Sudarshan, S. [1 ]
机构
[1] Indian Inst Technol, Mumbai, Maharashtra, India
来源
VLDB JOURNAL | 2015年 / 24卷 / 06期
关键词
Mutation testing; Test data generation; SQL query grading;
D O I
10.1007/s00778-015-0395-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Correctness of SQL queries is usually tested by executing the queries on one or more datasets. Erroneous queries are often the results of small changes or mutations of the correct query. A mutation Q of a query Q is killed by a dataset D if Q(D) Q(D). Earlier work on the XData system showed how to generate datasets that kill all mutations in a class of mutations that included join type and comparison operation mutations. In this paper, we extend the XData data generation techniques to handle a wider variety of SQL queries and a much larger class of mutations. We have also built a system for grading SQL queries using the datasets generated by XData. We present a study of the effectiveness of the datasets generated by the extended XData approach, using a variety of queries including queries submitted by students as part of a database course. We show that the XData datasets outperform predefined datasets as well as manual grading done earlier by teaching assistants, while also avoiding the drudgery of manual correction. Thus, we believe that our techniques will be of great value to database course instructors and TAs, particularly to those of MOOCs. It will also be valuable to database application developers and testers for testing SQL queries.
引用
收藏
页码:731 / 755
页数:25
相关论文
共 50 条
  • [1] Data generation for testing and grading SQL queries
    Bikash Chandra
    Bhupesh Chawda
    Biplab Kar
    K. V. Maheshwara Reddy
    Shetal Shah
    S. Sudarshan
    [J]. The VLDB Journal, 2015, 24 : 731 - 755
  • [2] Automated Grading of SQL Queries
    Chandra, Bikash
    Banerjee, Ananyo
    Hazra, Udbhas
    Joseph, Mathew
    Sudarshan, S.
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1630 - 1633
  • [3] Edit Based Grading of SQL Queries
    Chandra, Bikash
    Banerjee, Ananyo
    Hazra, Udbhas
    Joseph, Mathew
    Sudarshan, S.
    [J]. CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 56 - 64
  • [4] Partial Marking for Automated Grading of SQL Queries
    Chandra, Bikash
    Joseph, Mathew
    Radhakrishnan, Bharath
    Acharya, Shreevidhya
    Sudarshan, S.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1541 - 1544
  • [5] Automatic Generation of SQL Queries
    Do, Quan
    Agrawal, Rajeev K.
    Rao, Dhana
    Gudivada, Venkat N.
    [J]. 2014 ASEE ANNUAL CONFERENCE, 2014,
  • [6] Search-Based Test Data Generation for SQL Queries
    Castelein, Jeroen
    Aniche, Mauricio
    Soltani, Mozhan
    Panichella, Annibale
    van Deursen, Arie
    [J]. PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, : 1220 - 1230
  • [7] Evolving SQL queries for data mining
    Salim, M
    Yao, X
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 62 - 67
  • [8] Data Provenance for Recursive SQL Queries
    Dietrich, Benjamin
    Mueller, Tobias
    Grust, Torsten
    [J]. PROCEEDINGS OF 14TH INTERNATIONAL WORKSHOP ON THE THEORY AND PRACTICE OF PROVENANCE, TAPP 2022, 2022, : 48 - 55
  • [9] Populating Test Databases for Testing SQL Queries
    Suarez-Cabal, M. J.
    de la Riva, C.
    Tuya, J.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2010, 8 (02) : 164 - 171
  • [10] Structural Coverage Criteria for Testing SQL Queries
    Jose Suarez-Cabal, Ma
    Tuya, Javier
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2009, 15 (03) : 584 - 619