SE-HCL: Schema Enhanced Hybrid Curriculum Learning for Multi-Turn Text-to-SQL

被引:0
|
作者
Zhang, Yiyun [1 ]
Zhou, Sheng'an [1 ]
Huang, Gengsheng [1 ]
机构
[1] Guangdong Vocat Coll, Inst Elect Informat, Guangzhou 510800, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Data models; Training; Task analysis; Structured Query Language; Context modeling; Transformers; Museums; Natural language processing; Semantics; Curriculum development; Educational courses; semantic parsing; multi-turn text-to-SQL; curriculum learning; THERAPY;
D O I
10.1109/ACCESS.2024.3365522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing multi-turn Text-to-SQL approaches, mainly use data in a randomized order when training the model, ignoring the rich structural information contained in the dialog and schema. In this paper, we propose to use curriculum learning (CL) to better leverage the curriculum structure of schema, query, and dialog for multi-turn question-query pairs. We design a model-agnostic framework named Schema Enhanced Hybrid Curriculum Learning (SE-HCL) for multi-turn Text-to-SQL to help the models gain a full contextual semantic understanding. Concretely, We measure the difficulty of the data from both a structural and model perspective. In terms of data structure, we mainly consider the turns of the question and the complexity of the schema and SQL query. Accordingly, we designed a data course module to dynamically adjust the difficulty of the data based on the convergence of the model and the schema enhancement method we designed. In terms of the model, we propose a scoring module that will judge the difficulty of a problem based on whether the model could solve the question effectively. Finally, we will consider both aspects and design a hybrid curriculum to determine the flow of model training. Our experiments show that our proposed method improves SQL-generated performance over previous state-of-the-art models on SparC and CoSQL, especially for hard and long-turn questions.
引用
下载
收藏
页码:39902 / 39912
页数:11
相关论文
共 7 条
  • [1] Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing
    Wang, Run-Ze
    Ling, Zhen-Hua
    Zhou, Jing-Bo
    Hu, Yu
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13979 - 13987
  • [2] Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL
    Chen, Zhi
    Chen, Lu
    Li, Hanqi
    Cao, Ruisheng
    Ma, Da
    Wu, Mengyue
    Yu, Kai
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3063 - 3074
  • [3] A Multiple-Integration Encoder for Multi-Turn Text-to-SQL Semantic Parsing
    Wang, Run-Ze
    Ling, Zhen-Hua
    Zhou, Jing-Bo
    Hu, Yu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 1503 - 1513
  • [4] Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph
    Liu, Aiwei
    Hu, Xuming
    Lin, Li
    Wen, Lijie
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1021 - 1030
  • [5] TSCL-SQL: Two-Stage Curriculum Learning Framework for Text-to-SQL
    尹枫
    程路易
    王秋月
    王志军
    杜明
    徐波
    Journal of Donghua University(English Edition), 2023, 40 (04) : 421 - 427
  • [6] ER-SQL: Learning enhanced representation for Text-to-SQL using table contents
    Guo, Aibo
    Zhao, Xiang
    Ma, Wubin
    NEUROCOMPUTING, 2021, 465 : 359 - 370
  • [7] Multi-Level Curriculum Learning for Multi-Turn Dialogue Generation
    Chen, Guanhua
    Zhan, Runzhe
    Wong, Derek F.
    Chao, Lidia S.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3958 - 3967