SGPT: A Generative Approach for SPARQL Query Generation From Natural Language Questions

被引:7
|
作者
Rony, Md Rashad Al Hasan [1 ,2 ]
Kumar, Uttam [3 ]
Teucher, Roman [1 ]
Kovriguina, Liubov [1 ]
Lehmann, Jens [1 ,2 ]
机构
[1] Fraunhofer Inst Intelligent Anal & Informat Syst, D-01069 Dresden, Germany
[2] Univ Bonn, Smart Data Analyt Res Grp, D-53115 Bonn, Germany
[3] Univ Bonn, Data Sci & Intelligent Syst Grp, D-53115 Bonn, Germany
关键词
Measurement; Linguistics; Syntactics; Resource description framework; Adaptation models; Standards; Training; Knowledge based systems; knowledge graph; information retrieval; query generation; language models;
D O I
10.1109/ACCESS.2022.3188714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
SPARQL query generation from natural language questions is complex because it requires an understanding of both the question and underlying knowledge graph (KG) patterns. Most SPARQL query generation approaches are template-based, tailored to a specific knowledge graph and require pipelines with multiple steps, including entity and relation linking. Template-based approaches are also difficult to adapt for new KGs and require manual efforts from domain experts to construct query templates. To overcome this hurdle, we propose a new approach, dubbed SGPT, that combines the benefits of end-to-end and modular systems and leverages recent advances in large-scale language models. Specifically, we devise a novel embedding technique that can encode linguistic features from the question which enables the system to learn complex question patterns. In addition, we propose training techniques that allow the system to implicitly employ the graph-specific information (i.e., entities and relations) into the language model's parameters and generate SPARQL queries accurately. Finally, we introduce a strategy to adapt standard automatic metrics for evaluating SPARQL query generation. A comprehensive evaluation demonstrates the effectiveness of SGPT over state-of-the-art methods across several benchmark datasets.
引用
收藏
页码:70712 / 70723
页数:12
相关论文
共 50 条
  • [41] An Approach for generating best possible questions from the given text using Natural Language Processing
    Vaidya, Kimaya
    Bhagwatkar, Neha
    Singh, Aditi
    Borikar, Sneha
    Padwad, Hirkani
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 271 - 277
  • [42] Speech-to-SQL: toward speech-driven SQL query generation from natural language question
    Song, Yuanfeng
    Wong, Raymond Chi-Wing
    Zhao, Xuefang
    [J]. VLDB JOURNAL, 2024, 33 (04): : 1179 - 1201
  • [43] TOWARDS A NATURAL-LANGUAGE USER-INTERFACE - AN APPROACH OF FUZZY QUERY
    WANG, FJ
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SYSTEMS, 1994, 8 (02): : 143 - 162
  • [44] A Decision-Theoretic Approach to Natural Language Generation
    McKinley, Nathan
    Ray, Soumya
    [J]. PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 552 - 561
  • [45] A Natural-language-based Visual Query Approach of Uncertain Human Trajectories
    Huang, Zhaosong
    Zhao, Ye
    Chen, Wei
    Gao, Shengjie
    Yu, Kejie
    Xu, Weixia
    Tang, Mingjie
    Zhu, Minfeng
    Xu, Mingliang
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) : 1256 - 1266
  • [46] Top-down natural language query approach for embodied conversational agent
    Goh, Ong Sing
    Depickere, Arnold
    Fung, Chun Che
    Wong, Kok Wai
    [J]. IMECS 2006: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, 2006, : 470 - +
  • [47] An ASP-based Approach to Answering Natural Language Questions for Texts
    Pendharkar, Dhruva
    Basu, Kinjal
    Shakerin, Farhad
    Gupta, Gopal
    [J]. THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2022, 22 (03) : 419 - 443
  • [49] Optimizing Interpretation Generation in Natural Language Query Answering for Real Time End Users
    Sen, Jaydeep
    Saha, Diptikalyan
    Mittal, Ashish
    Sankaranarayanan, Karthik
    [J]. CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 341 - 349
  • [50] ASPECTS OF THE AUTOMATIC-GENERATION OF SQL STATEMENTS IN A NATURAL-LANGUAGE QUERY INTERFACE
    OTT, N
    [J]. INFORMATION SYSTEMS, 1992, 17 (02) : 147 - 159