An SQL query generator for cross-domain human language based questions based on NLP model

被引:3
|
作者
Naik, B. Balaji [1 ]
Reddy, T. Jaya Venkata Rama [2 ]
Karthik, K. Rohith Venkata [2 ]
Kuila, Pratyay [2 ]
机构
[1] Natl Inst Technol Patna, Comp Sci & Engn, Patna 800005, Bihar, India
[2] Natl Inst Technol Sikkim, Comp Sci & Engn, Ravangla 737139, Sikkim, India
关键词
SQL; Sentence-Table Encoder; Table-aware Decoder; Sparc; Spider; CoSQL; INTERFACE;
D O I
10.1007/s11042-023-15731-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The amount of data generated in the modern world is so great that data lakes are now being used to store data. However, relational databases are currently the primary repository for the world's data. However, it is very time-consuming for a user to type each query every time, especially the queries that include complex keywords. Our proposed approach uses the interaction history by altering the preceding projected query to improve the generation quality, based on the finding that successive human language queries are frequently lin- guistically dependent, and their equivalent SQL queries overlap. This paper focuses on text-to-SQL conversion for cross-domain datasets. Our approach reuses results produced at the token level and considers SQL statements as sequences. Finally, we evaluate our approach on different datasets like the Sparc, Spider, and CoSQL datasets. It compared our proposed approach with existing famous algorithms like Seq2seq, and added attention and copying to the seq2seq model, SQLNet model, and TypeSQL model in terms of accuracy and F1 score.
引用
收藏
页码:11861 / 11884
页数:24
相关论文
共 50 条
  • [41] SQL EER - SYNTAX AND SEMANTICS OF AN ENTITY-RELATIONSHIP-BASED QUERY LANGUAGE
    HOHENSTEIN, U
    ENGELS, G
    INFORMATION SYSTEMS, 1992, 17 (03) : 209 - 242
  • [42] Data Agnostic RoBERTa-based Natural Language to SQL Query Generation
    Pal, Debaditya
    Sharma, Harsh
    Chaudhuri, Kaustubh
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [43] Semantic clustering-based cross-domain recommendation
    Kumar, Anil
    Kumar, Nitesh
    Hussain, Muzammil
    Chaudhury, Santanu
    Agarwal, Sumeet
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2014, : 137 - 141
  • [44] A Cross-Domain Recommendation Algorithm Based On Graph Optimization
    Fan, Zheng
    Wang, Ying-Li
    Ma, Qi-Tao
    Du, Hai-Xia
    Ma, Hong-Bin
    Journal of Network Intelligence, 2023, 8 (03): : 856 - 868
  • [45] Cross-Domain Item Recommendation Based on User Similarity
    Xu, Zhenzhen
    Jiang, Huizhen
    Kong, Xiangjie
    Kang, Jialiang
    Wang, Wei
    Xia, Feng
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 13 (02) : 359 - 373
  • [46] Cross-domain recommendation based on latent factor alignment
    Yu, Xu
    Hu, Qiang
    Li, Hui
    Du, Junwei
    Gao, Jia
    Sun, Lijun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05): : 3421 - 3432
  • [47] Deepfake Detection Method Based on Cross-Domain Fusion
    Sun, Fang
    Zhang, Niuniu
    Xu, Pan
    Song, Zengren
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [48] The implementation of cross-domain SSO based on distributed authentication
    Li, Nie
    Lu, Jiguang
    DCABES 2007 Proceedings, Vols I and II, 2007, : 561 - 563
  • [49] Cross-Domain Data Traceability Mechanism Based on Blockchain
    Zhao, Shoucai
    Cao, Lifeng
    Li, Jinhui
    Wan, Jiling
    Bai, Jinlong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2531 - 2549
  • [50] Cross-domain graph based similarity measurement of workflows
    Koohi-Var T.
    Zahedi M.
    Journal of Big Data, 5 (1)