Improving Convolutional End-to-End Memory Networks with BERT for Question Answering

被引：1

作者：

Alkhawlani, Mohammed A. ^{[1
,3
]}

Azman, Azreen ^{[1
]}

Abdullah, Muhamad Taufik ^{[1
]}

Yaakob, Razali ^{[1
]}

Kadir, Rabiah Abdul ^{[2
]}

Alshari, Eissa M. ^{[3
]}

机构：

[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Upm Serdang 43400, Selangor, Malaysia

[2] Univ Kebangsaan Malaysia, Inst Visual Informat, Ukm Bangi 43600, Selangor, Malaysia

[3] Ibb Univ, Ibb, Yemen

来源：

INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2024 | 2024年 / 1066卷

关键词：

Convolutional End-to-End Memory Networks; BERT; Question answering; bAbI dataset;

D O I：

10.1007/978-3-031-66428-1_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Question answering (QA) systems process natural language queries in order to retrieve relevant answers from data, document corpus or the Web. Memory networks have shown encouraging results in certain reasoning tasks of QA such as the end-to-end memory networks. In this paper, we explore the integration of BERT (Bidirectional Encoder Representations from Transformers) with Convolutional End-to-End Memory Networks to improve QA performance. It is anticipated that BERT will provide rich contextual embeddings, allowing for a comprehensive understanding of semantic relationships within sentences and questions. Specifically, we propose an incorporation of BERT, a state-of-the-art pre-trained language model with the Convolutional End-to-End Memory Networks multi-hop reasoning model to improve the overall QA performance. The proposed model can be fine-tuned on smaller datasets, effectively handling overfitting issue. Our experiment shows that the proposed model exhibits remarkable performance, outperforming top results achieved by other memory networks models on the Facebook 'bAbI 1k' dataset with an accuracy of 92.86.

引用

页码：90 / 104

页数：15

共 50 条

[31] End-to-End Kernel Learning with Supervised Convolutional Kernel Networks
Mairal, Julien
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[32] SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery
Seenivasan, Lalithkumar
Islam, Mobarakol
Kannan, Gokul
Ren, Hongliang
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IX, 2023, 14228 : 281 - 290
[33] Improving Graph Convolutional Networks Based on Relation-Aware Attention for End-to-End Relation Extraction
Hong, Yin
Liu, Yanxia
Yang, Suizhu
Zhang, Kaiwen
Wen, Aiqing
Hu, Jianjun
IEEE ACCESS, 2020, 8 : 51315 - 51323
[34] Improving Users Engagement Detection using End-to-End Spatio-Temporal Convolutional Neural Networks
Saleh, Khaled
Yu, Kun
Chen, Fang
HRI '21: COMPANION OF THE 2021 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2021, : 190 - 194
[35] IMPROVING END-TO-END SPEECH TRANSLATION MODEL WITH BERT-BASED CONTEXTUAL INFORMATION
Bang, Jeong-Uk
Lee, Min-Kyu
Yun, Seung
Kim, Sang-Hun
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6227 - 6231
[36] End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights
Rueckle, Andreas
Gurevych, Iryna
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 19 - 24
[37] Improving End-to-End Multicast Rate Control in Wireless Networks
Kammoun, W.
Youssef, H.
ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 642 - 647
[38] A Bayesian end-to-end model with estimated uncertainties for simple question answering over knowledge bases
Zhang, Linhai
Lin, Chao
Zhou, Deyu
He, Yulan
Zhang, Meng
COMPUTER SPEECH AND LANGUAGE, 2021, 66
[39] Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Zhang, Ying
Pezeshki, Mohammad
Brakel, Philemon
Zhang, Saizheng
Laurent, Cesar
Bengio, Yoshua
Courville, Aaron
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 410 - 414
[40] Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks
Li, Hui
Wang, Peng
Shen, Chunhua
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5248 - 5256

← 1 2 3 4 5 →