Translating natural language questions to SQL queries (nested queries)

被引:0
|
作者
Swamidorai, Sindhuja [1 ]
Murthy, T. Satyanarayana [2 ]
Sriharsha, K., V [3 ]
机构
[1] UpGrad, Data Sci, St-1, Bengaluru 500075, Karnataka, India
[2] CBIT, Informat Technol, Hyderabad 560071, Telangana, India
[3] NIT Trichy, Comp Applicat, Tiruchirappalli 620015, Tamil Nadu, India
关键词
Text-to-SQL nested queries; Spider; TEXT-TO-SQL;
D O I
10.1007/s11042-023-16987-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real world questions are generally complex and need the user to extract information from multiple tables in a database using complex SQL queries like nested queries. Though the overall accuracy in translation of Natural Language queries to SQL queries lies close to 75%, the accuracy of complex queries is still quite less, around 60% in the current state-of-the-art models. In this vein, this study proposes to improve the current IRNet framework for translating natural language queries to nested SQL queries, one type of complex queries. Data oversampling is first used to boost the representation of nested queries in order to achieve this goal. Second, a novel loss function that computes the overall loss while accounting for the complexity of SQL, as measured by the quantity of SELECT columns and keywords in the SQL query. The proposed method exhibited a 5% improvement in prediction of hard and extra hard queries when tested on Spider's development dataset.
引用
收藏
页码:45391 / 45405
页数:15
相关论文
共 50 条
  • [1] Translating natural language questions to SQL queries (nested queries)
    Sindhuja Swamidorai
    T Satyanarayana Murthy
    K V Sriharsha
    [J]. Multimedia Tools and Applications, 2024, 83 : 45391 - 45405
  • [2] Translating Web Search Queries into Natural Language Questions
    Kumar, Adarsh
    Dandapat, Sandipan
    Chordia, Sushil
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 944 - 947
  • [3] A system to transform natural language queries into SQL queries
    Solanki A.
    Kumar A.
    [J]. International Journal of Information Technology, 2022, 14 (1) : 437 - 446
  • [4] ATHENA++: Natural Language Querying for Complex Nested SQL Queries
    Sen, Jaydeep
    Lei, Chuan
    Quamar, Abdul
    Özcan, Fatma
    Efthymiou, Vasilis
    Dalmia, Ayushi
    Stager, Greg
    Mittal, Ashish
    Saha, Diptikalyan
    Sankaranarayanan, Karthik
    [J]. Proceedings of the VLDB Endowment, 2020, 13 (11): : 2747 - 2759
  • [5] Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries
    Giordani, Alessandra
    Moschitti, Alessandro
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2336 - 2339
  • [6] Translating Natural Language Queries to SQL Using the T5 Model
    Wong, Albert
    Pham, Lien
    Lee, Young
    Chan, Shek
    Sadaya, Razel
    Khmelevsky, Youry
    Clement, Mathias
    Cheng, Florence Wing Yau
    Mahony, Joe
    Ferri, Michael
    [J]. 18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
  • [7] L2S: Transforming natural language questions into SQL queries
    Duc Tam Hoang
    Minh Le Nguyen
    Son Bao Pham
    [J]. 2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 85 - 90
  • [8] Semantic Mapping between Natural Language Questions and SQL Queries via Syntactic Pairing
    Giordani, Alessandra
    Moschitti, Alessandro
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 5723 : 207 - 221
  • [9] QUESTION PATTERNS FOR NATURAL LANGUAGE TRANSLATION IN SQL QUERIES
    Zhekova, Mariya
    Totkov, George
    [J]. INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2021, 13 (02): : 43 - 54
  • [10] ACL-SQL: Generating SQL Queries from Natural Language
    Kaoshik, Ronak
    Patil, Rohit
    Prakash, R.
    Agarawal, Shaurya
    Jain, Naman
    Singh, Mayank
    [J]. CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 423 - 423