Translating Natural Language Queries to SQL Using the T5 Model

被引:0
|
作者
Wong, Albert [1 ]
Pham, Lien [1 ]
Lee, Young [2 ]
Chan, Shek [1 ]
Sadaya, Razel [1 ]
Khmelevsky, Youry [3 ]
Clement, Mathias [3 ]
Cheng, Florence Wing Yau [1 ]
Mahony, Joe [4 ]
Ferri, Michael [4 ]
机构
[1] Langara Coll, Math & Stat, Vancouver, BC, Canada
[2] Okanagan Coll, Math & Stat, Kelowna, BC, Canada
[3] Okanagan Coll, Comp Sci, Kelowna, BC, Canada
[4] Harris SmartWorks, Res & Dev, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Natural Language Processing; Data Query System; Text-to-SQL; Speech-to-SQL; Deep Learning; Machine Learning; T5; Model; Human-Machine-Systems; Energy Systems;
D O I
10.1109/SysCon61195.2024.10553509
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents the development process of a natural language to SQL model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, have a 73% and 84% exact match accuracy respectively. These models, in conjunction with other work completed in the research project, were implemented for several companies and used successfully on a daily basis. The approach used in the model development could be implemented in a similar fashion for other database environments and with a more powerful pre-trained language model.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [41] Problematic Unordered Queries in Temporal Moment Measurement by Using Natural Language
    Nawaz, Hafiza Sadia
    Dong, Junyu
    IEEE ACCESS, 2023, 11 : 37976 - 37986
  • [42] Reformulating natural language queries using sequence-to-sequence models
    Xiaoyu LIU
    Shunda PAN
    Qi ZHANG
    Yu-Gang JIANG
    Xuanjing HUANG
    Science China(Information Sciences), 2019, 62 (12) : 254 - 256
  • [43] INTERPRETATION OF NATURAL-LANGUAGE DATABASE QUERIES USING OPTIMIZATION METHODS
    LEIGH, W
    EVANS, J
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1986, 16 (01): : 40 - 52
  • [44] Reformulating natural language queries using sequence-to-sequence models
    Xiaoyu Liu
    Shunda Pan
    Qi Zhang
    Yu-Gang Jiang
    Xuanjing Huang
    Science China Information Sciences, 2019, 62
  • [45] INTERPRETATION OF NATURAL LANGUAGE DATABASE QUERIES USING OPTIMIZATION METHODS.
    Leigh, William
    Evans, James
    IEEE Transactions on Systems, Man and Cybernetics, 1986, SMC-16 (01): : 40 - 52
  • [46] Converting Complex Natural Language Query to SQL Based on Tree Representation Model
    Zhao M.
    Chen K.
    Shou L.-D.
    Wu S.
    Chen G.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (12): : 4727 - 4745
  • [47] SExpSMA-based T5: Serial exponential-slime mould algorithm based T5 model for question answer and distractor generation
    Bhuvan, Nikhila T.
    Jisha, G.
    Shamna, N., V
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2024, 18 (02): : 1447 - 1462
  • [48] T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting
    Nobel, J. Martijn
    Puts, Sander
    Weiss, Jakob
    Aerts, Hugo J. W. L.
    Mak, Raymond H.
    Robben, Simon G. F.
    Dekker, Andre L. A. J.
    INSIGHTS INTO IMAGING, 2021, 12 (01)
  • [49] T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting
    J. Martijn Nobel
    Sander Puts
    Jakob Weiss
    Hugo J. W. L. Aerts
    Raymond H. Mak
    Simon G. F. Robben
    André L. A. J. Dekker
    Insights into Imaging, 12
  • [50] Enhancement of Natural Language to SQL Query Conversion using Machine Learning Techniques
    Prasad, Akshar
    Badhya, Sourabh S.
    Yashwanth, Y. S.
    Rohan, Shetty
    Shobha, G.
    Deepamala, N.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (12) : 494 - 503