An empirical study of pre-trained language models in simple knowledge graph question answering

被引：9

作者：

Hu, Nan ^{[1
]}

Wu, Yike ^{[1
]}

Qi, Guilin ^{[1
]}

Min, Dehai ^{[1
]}

Chen, Jiaoyan ^{[2
]}

Pan, Jeff Z. ^{[3
]}

Ali, Zafar ^{[1
]}

机构：

[1] Southeast Univ, Sch Comp Sci & Engn, 2 Dongda Rd, Nanjing 211189, Jiangsu, Peoples R China

[2] Univ Manchester, Dept Comp Sci, Oxford Rd, Manchester M13 9PL, England

[3] Univ Edinburgh, Sch Informat, 10 Crichton St, Edinburgh 2EH8 9AB, Scotland

来源：

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2023年 / 26卷 / 05期

关键词：

Knowledge graph question answering; Pretrained language models; Accuracy and efficiency; Scalability;

D O I：

10.1007/s11280-023-01166-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large-scale pre-trained language models (PLMs) such as BERT have recently achieved great success and become a milestone in natural language processing (NLP). It is now the consensus of the NLP community to adopt PLMs as the backbone for downstream tasks. In recent works on knowledge graph question answering (KGQA), BERT or its variants have become necessary in their KGQA models. However, there is still a lack of comprehensive research and comparison of the performance of different PLMs in KGQA. To this end, we summarize two basic KGQA frameworks based on PLMs without additional neural network modules to compare the performance of nine PLMs in terms of accuracy and efficiency. In addition, we present three benchmarks for larger-scale KGs based on the popular SimpleQuestions benchmark to investigate the scalability of PLMs. We carefully analyze the results of all PLMs-based KGQA basic frameworks on these benchmarks and two other popular datasets, WebQuestionSP and FreebaseQA, and find that knowledge distillation techniques and knowledge enhancement methods in PLMs are promising for KGQA. Furthermore, we test ChatGPT (https://chat.openai.com/), which has drawn a great deal of attention in the NLP community, demonstrating its impressive capabilities and limitations in zero-shot KGQA. We have released the code and benchmarks to promote the use of PLMs on KGQA (https://github.com/aannonymouuss/PLMs-in-Practical-KBQA).

引用

页码：2855 / 2886

页数：32

共 50 条

[11] UniRaG: Unification, Retrieval, and Generation for Multimodal Question Answering With Pre-Trained Language Models
Lim, Qi Zhi
Lee, Chin Poo
Lim, Kian Ming
Samingan, Ahmad Kamsani
IEEE ACCESS, 2024, 12 : 71505 - 71519
[12] Explanation Graph Generation via Pre-trained Language Models: An Empirical Study with Contrastive Learning
Saha, Swarnadeep
Yadav, Prateek
Bansal, Mohit
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1190 - 1208
[13] Knowledge Rumination for Pre-trained Language Models
Yao, Yunzhi
Wang, Peng
Mao, Shengyu
Tan, Chuanqi
Huang, Fei
Chen, Huajun
Zhang, Ningyu
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3387 - 3404
[14] Question-answering Forestry Pre-trained Language Model: ForestBERT
Tan, Jingwei
Zhang, Huaiqing
Liu, Yang
Yang, Jie
Zheng, Dongping
Linye Kexue/Scientia Silvae Sinicae, 2024, 60 (09): : 99 - 110
[15] Knowledge Inheritance for Pre-trained Language Models
Qin, Yujia
Lin, Yankai
Yi, Jing
Zhang, Jiajie
Han, Xu
Zhang, Zhengyan
Su, Yusheng
Liu, Zhiyuan
Li, Peng
Sun, Maosong
Zhou, Jie
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3921 - 3937
[16] Integrating Knowledge Graph Embeddings and Pre-trained Language Models in Hypercomplex Spaces
Nayyeri, Mojtaba
Wang, Zihao
Akter, Mst. Mahfuja
Alam, Mirza Mohtashim
Rony, Md Rashad Al Hasan
Lehmann, Jens
Staab, Steffen
SEMANTIC WEB, ISWC 2023, PART I, 2023, 14265 : 388 - 407
[17] Assisted Process Knowledge Graph Building Using Pre-trained Language Models
Bellan, Patrizio
Dragoni, Mauro
Ghidini, Chiara
AIXIA 2022 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2023, 13796 : 60 - 74
[18] Multi-Hop Knowledge Base Question Answering with Pre-Trained Language Model Feature Enhancement
Wei, Qianqiang
Zhao, Shuliang
Lu, Danqi
Jia, Xiaowen
Yang, Shilong
Computer Engineering and Applications, 2024, 60 (22) : 184 - 196
[19] An Empirical study on Pre-trained Embeddings and Language Models for Bot Detection
Garcia-Silva, Andres
Berrio, Cristian
Manuel Gomez-Perez, Jose
4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 148 - 155
[20] A Pre-trained Language Model for Medical Question Answering Based on Domain Adaption
Liu, Lang
Ren, Junxiang
Wu, Yuejiao
Song, Ruilin
Cheng, Zhen
Wang, Sibo
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 216 - 227

← 1 2 3 4 5 →