Translating natural language questions to SQL queries (nested queries)

被引:0
|
作者
Swamidorai, Sindhuja [1 ]
Murthy, T. Satyanarayana [2 ]
Sriharsha, K., V [3 ]
机构
[1] UpGrad, Data Sci, St-1, Bengaluru 500075, Karnataka, India
[2] CBIT, Informat Technol, Hyderabad 560071, Telangana, India
[3] NIT Trichy, Comp Applicat, Tiruchirappalli 620015, Tamil Nadu, India
关键词
Text-to-SQL nested queries; Spider; TEXT-TO-SQL;
D O I
10.1007/s11042-023-16987-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real world questions are generally complex and need the user to extract information from multiple tables in a database using complex SQL queries like nested queries. Though the overall accuracy in translation of Natural Language queries to SQL queries lies close to 75%, the accuracy of complex queries is still quite less, around 60% in the current state-of-the-art models. In this vein, this study proposes to improve the current IRNet framework for translating natural language queries to nested SQL queries, one type of complex queries. Data oversampling is first used to boost the representation of nested queries in order to achieve this goal. Second, a novel loss function that computes the overall loss while accounting for the complexity of SQL, as measured by the quantity of SELECT columns and keywords in the SQL query. The proposed method exhibited a 5% improvement in prediction of hard and extra hard queries when tested on Spider's development dataset.
引用
收藏
页码:45391 / 45405
页数:15
相关论文
共 50 条
  • [21] Translating XPath queries into SPARQL queries
    Droop, M.
    Flarer, M.
    Groppe, J.
    Groppe, S.
    Linnemann, V.
    Pinggeral, J.
    Santner, F.
    Schier, M.
    Schoepf, F.
    Staffler, H.
    Zugal, S.
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2007: OTM 2007 WORKSHOPS, PT 1, PROCEEDINGS, 2007, 4805 : 9 - +
  • [22] Extending the UML concepts to transform natural language queries with fuzzy semantics into SQL
    Tseng, Frank S. C.
    Chen, Chun-Ling
    INFORMATION AND SOFTWARE TECHNOLOGY, 2006, 48 (09) : 901 - 914
  • [23] Efficient processing of nested fuzzy SQL queries in a fuzzy database
    Yang, Q
    Zhang, WN
    Liu, CW
    Wu, J
    Yu, C
    Nakajima, H
    Rishe, ND
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2001, 13 (06) : 884 - 901
  • [24] Translating Place-Related Questions to GeoSPARQL Queries
    Hamzei, Ehsan
    Tomko, Martin
    Winter, Stephan
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 902 - 911
  • [25] Provenance for Natural Language Queries
    Deutch, Daniel
    Frost, Nave
    Gilad, Amir
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (05): : 577 - 588
  • [26] Ontology Based Natural Language Queries Transformation into SPARQL Queries
    Askar, Majid
    Algergawy, Alsayed
    Soliman, Taysir Hassan A.
    Koenig-Ries, Birgitta
    Sewisy, Adel A.
    BALTIC JOURNAL OF MODERN COMPUTING, 2020, 8 (04): : 719 - 731
  • [27] Translating synthetic natural language to database queries with a polyglot deep learning framework
    Bazaga, Adrian
    Gunwant, Nupur
    Micklem, Gos
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [28] Translating synthetic natural language to database queries with a polyglot deep learning framework
    Adrián Bazaga
    Nupur Gunwant
    Gos Micklem
    Scientific Reports, 11
  • [29] SQL#: A Language for Maintainable and Debuggable Database Queries
    Hu, Yamin
    Jiang, Hao
    Tang, Hanlin
    Lin, Xin
    Hu, Zongyao
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (05) : 619 - 649
  • [30] NL2pSQL: Generating Pseudo-SQL Queries from Under-Specified Natural Language Questions
    Chen, Fuxiang
    Hwang, Seung-won
    Choo, Jaegul
    Ha, Jung-Woo
    Kim, Sunghun
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2603 - 2613