Enhancing SPARQL Query Generation for Knowledge Base Question Answering Systems by Learning to Correct Triplets

被引:0
|
作者
Qi, Jiexing [1 ]
Su, Chang [1 ]
Guo, Zhixin [1 ]
Wu, Lyuwen [1 ]
Shen, Zanwei [1 ]
Fu, Luoyi [1 ]
Wang, Xinbing [1 ]
Zhou, Chenghu [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 04期
关键词
Knowledge Base Question Answering; Text-to-SPARQL; semantic parsing; further pretraining; Triplet Structure;
D O I
10.3390/app14041521
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Generating SPARQL queries from natural language questions is challenging in Knowledge Base Question Answering (KBQA) systems. The current state-of-the-art models heavily rely on fine-tuning pretrained models such as T5. However, these methods still encounter critical issues such as triple-flip errors (e.g., (subject, relation, object) is predicted as (object, relation, subject)). To address this limitation, we introduce TSET (Triplet Structure Enhanced T5), a model with a novel pretraining stage positioned between the initial T5 pretraining and the fine-tuning for the Text-to-SPARQL task. In this intermediary stage, we introduce a new objective called Triplet Structure Correction (TSC) to train the model on a SPARQL corpus derived from Wikidata. This objective aims to deepen the model's understanding of the order of triplets. After this specialized pretraining, the model undergoes fine-tuning for SPARQL query generation, augmenting its query-generation capabilities. We also propose a method named "semantic transformation" to fortify the model's grasp of SPARQL syntax and semantics without compromising the pre-trained weights of T5. Experimental results demonstrate that our proposed TSET outperforms existing methods on three well-established KBQA datasets: LC-QuAD 2.0, QALD-9 plus, and QALD-10, establishing a new state-of-the-art performance (95.0% F1 and 93.1% QM on LC-QuAD 2.0, 75.85% F1 and 61.76% QM on QALD-9 plus, 51.37% F1 and 40.05% QM on QALD-10).
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Knowledge Base Question Answering via Structured Query Generation using Question domain
    Li, Jiecheng
    Peng, Zizhen
    Zhu, Xiaoying
    Lu, Keda
    2022 IEEE 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS, IUCC/CIT/DSCI/SMARTCNS, 2022, : 394 - 400
  • [2] Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base
    Yih, Wen-tau
    Chang, Ming-Wei
    He, Xiaodong
    Gao, Jianfeng
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1321 - 1331
  • [3] Towards Knowledge Graph-Agnostic SPARQL Query Validation for Improving Question Answering
    Perevalov, Aleksandr
    Gashkov, Aleksandr
    Eltsova, Maria
    Both, Andreas
    SEMANTIC WEB: ESWC 2022 SATELLITE EVENTS, 2022, 13384 : 78 - 82
  • [4] SPARQL-QA-v2 system for Knowledge Base Question Answering
    Borroto, Manuel A.
    Ricca, Francesco
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [5] Staged query graph generation based on answer type for question answering over knowledge base
    Chen, Haoyuan
    Ye, Fei
    Fan, Yuankai
    He, Zhenying
    Jing, Yinan
    Zhang, Kai
    Wang, X. Sean
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [6] Question Answering over Knowledge Graphs with Query Path Generation
    Yang, Linqing
    Guo, Kecen
    Liu, Bo
    Gong, Jiazheng
    Zhang, Zhujian
    Zhao, Peiyu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2022, 13368 : 146 - 158
  • [7] Formal Query Generation for Question Answering over Knowledge Bases
    Zafar, Hamid
    Napolitano, Giulio
    Lehmann, Jens
    SEMANTIC WEB (ESWC 2018), 2018, 10843 : 714 - 728
  • [8] Natural language question answering over knowledge graph: the marriage of SPARQL query and keyword search
    Hu, Xin
    Duan, Jiangli
    Dang, Depeng
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (04) : 819 - 844
  • [9] Natural language question answering over knowledge graph: the marriage of SPARQL query and keyword search
    Xin Hu
    Jiangli Duan
    Depeng Dang
    Knowledge and Information Systems, 2021, 63 : 819 - 844
  • [10] Knowledge Base Question Answering via Encoding of Complex Query Graphs
    Luo, Kangqi
    Lin, Fengli
    Luo, Xusheng
    Zhu, Kenny Q.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2185 - 2194