Towards Text-to-SQL over Aggregate Tables

被引:0
|
作者
Shuqin Li [1 ]
Kaibin Zhou [2 ]
Zeyang Zhuang [2 ]
Haofen Wang [1 ]
Jun Ma [3 ]
机构
[1] College of Design and Innovation, Tongji University
[2] School of Software, Tongji University
[3] School of Automotive Studies, Tongji
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Text-to-SQL aims at translating textual questions into the corresponding SQL queries. Aggregate tables are widely created for high-frequent queries. Although text-to-SQL has emerged as an important task, recent studies paid little attention to the task over aggregate tables. The increased aggregate tables bring two challenges:(1) mapping of natural language questions and relational databases will suffer from more ambiguity,(2) modern models usually adopt self-attention mechanism to encode database schema and question. The mechanism is of quadratic time complexity, which will make inferring more time-consuming as input sequence length grows. In this paper, we introduce a novel approach named WAGG for text-to-SQL over aggregate tables. To effectively select among ambiguous items, we propose a relation selection mechanism for relation computing. To deal with high computation costs, we introduce a dynamical pruning strategy to discard unrelated items that are common for aggregate tables. We also construct a new large-scale dataset Spiderw AGG extended from Spider dataset for validation, where extensive experiments show the effectiveness and efficiency of our proposed method with 4% increase of accuracy and 15% decrease of inference time w.r.t a strong baseline RAT-SQL.
引用
收藏
页码:457 / 474
页数:18
相关论文
共 50 条
  • [1] Towards Text-to-SQL over Aggregate Tables
    Li, Shuqin
    Zhou, Kaibin
    Zhuang, Zeyang
    Wang, Haofen
    Ma, Jun
    DATA INTELLIGENCE, 2023, 5 (02) : 457 - 474
  • [2] DuoRAT: Towards Simpler Text-to-SQL Models
    Scholale, Torsten
    Li, Raymond
    Bandanau, Dzmitry
    de Vries, Harm
    Pal, Chris
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 1313 - 1321
  • [3] GuideSQL: Utilizing Tables to Guide the Prediction of Columns for Text-to-SQL Generation
    Wang, Huajie
    Chen, Lei
    Li, Mei
    Chen, Mengnan
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Towards Robustness of Text-to-SQL Models against Synonym Substitution
    Gan, Yujian
    Chen, Xinyun
    Huang, Qiuping
    Purver, Matthew
    Woodward, John R.
    Xie, Jinxia
    Huang, Pengsheng
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2505 - 2515
  • [5] MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing
    Dou, Longxu
    Gao, Yan
    Pan, Mingyang
    Wang, Dingzirui
    Che, Wanxiang
    Zhan, Dechen
    Lou, Jian-Guang
    Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, 2023, 37 : 12745 - 12753
  • [6] On the Vulnerabilities of Text-to-SQL Models
    Peng, Xutan
    Zhang, Yipeng
    Yang, Jingfeng
    Stevenson, Mark
    2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, ISSRE, 2023, : 1 - 12
  • [7] Global Reasoning over Database Structures for Text-to-SQL Parsing
    Bogin, Ben
    Gardner, Matt
    Berant, Jonathan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3659 - 3664
  • [8] Decoupling SQL query hardness parsing for text-to-SQL
    Yi, Jiawen
    Chen, Guo
    Zhou, Xiaojun
    Neurocomputing, 621
  • [9] Improving Text-to-SQL Evaluation Methodology
    Finegan-Dollak, Catherine
    Kummerfeld, Jonathan K.
    Zhang, Li
    Ramanathan, Karthik
    Sadasivam, Sesh
    Zhang, Rui
    Radev, Dragomir
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 351 - 360
  • [10] Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation
    Guo, Jiaqi
    Zhan, Zecheng
    Gao, Yan
    Xiao, Yan
    Lou, Jian-Guang
    Liu, Ting
    Zhang, Dongmei
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4524 - 4535