Improving Convolutional End-to-End Memory Networks with BERT for Question Answering

被引：1

作者：

Alkhawlani, Mohammed A. ^{[1
,3
]}

Azman, Azreen ^{[1
]}

Abdullah, Muhamad Taufik ^{[1
]}

Yaakob, Razali ^{[1
]}

Kadir, Rabiah Abdul ^{[2
]}

Alshari, Eissa M. ^{[3
]}

机构：

[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Upm Serdang 43400, Selangor, Malaysia

[2] Univ Kebangsaan Malaysia, Inst Visual Informat, Ukm Bangi 43600, Selangor, Malaysia

[3] Ibb Univ, Ibb, Yemen

来源：

INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2024 | 2024年 / 1066卷

关键词：

Convolutional End-to-End Memory Networks; BERT; Question answering; bAbI dataset;

D O I：

10.1007/978-3-031-66428-1_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Question answering (QA) systems process natural language queries in order to retrieve relevant answers from data, document corpus or the Web. Memory networks have shown encouraging results in certain reasoning tasks of QA such as the end-to-end memory networks. In this paper, we explore the integration of BERT (Bidirectional Encoder Representations from Transformers) with Convolutional End-to-End Memory Networks to improve QA performance. It is anticipated that BERT will provide rich contextual embeddings, allowing for a comprehensive understanding of semantic relationships within sentences and questions. Specifically, we propose an incorporation of BERT, a state-of-the-art pre-trained language model with the Convolutional End-to-End Memory Networks multi-hop reasoning model to improve the overall QA performance. The proposed model can be fine-tuned on smaller datasets, effectively handling overfitting issue. Our experiment shows that the proposed model exhibits remarkable performance, outperforming top results achieved by other memory networks models on the Facebook 'bAbI 1k' dataset with an accuracy of 92.86.

引用

页码：90 / 104

页数：15

共 50 条

[1] Learning to Reason: End-to-End Module Networks for Visual Question Answering
Hu, Ronghang
Andreas, Jacob
Rohrbach, Marcus
Darrell, Trevor
Saenko, Kate
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 804 - 813
[2] Towards End-to-End Multilingual Question Answering
Ekaterina Loginova
Stalin Varanasi
Günter Neumann
Information Systems Frontiers, 2021, 23 : 227 - 241
[3] Towards End-to-End Multilingual Question Answering
Loginova, Ekaterina
Varanasi, Stalin
Neumann, Guenter
INFORMATION SYSTEMS FRONTIERS, 2021, 23 (01) : 227 - 241
[4] Convolutional End-to-End Memory Networks for Multi-Hop Reasoning
Yang, Xiaoqing
Fan, Pingzhi
IEEE ACCESS, 2019, 7 : 135268 - 135276
[5] End-To-End Memory Networks
Sukhbaatar, Sainbayar
Szlam, Arthur
Weston, Jason
Fergus, Rob
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
[6] GSQA: An End-to-End Model for Generative Spoken Question Answering
Shih, Min-Han
Chung, Ho-Lam
Pai, Yu-Chi
Hsu, Ming-Hao
Lin, Guan-Ting
Lie, Shang-Wen
Lee, Hung-yi
INTERSPEECH 2024, 2024, : 2970 - 2974
[7] End-to-End Open-Domain Question Answering with BERTserini
Yang, Wei
Xie, Yuqing
Lin, Aileen
Li, Xingyu
Tan, Luchen
Xiong, Kun
Li, Ming
Lin, Jimmy
NAACL HLT 2019: THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2019, : 72 - 77
[8] A Simple End-to-End Question Answering Model for Product Information
Lai, Tuan Manh
Bui, Trung
Li, Sheng
Lipka, Nedim
ECONOMICS AND NATURAL LANGUAGE PROCESSING (ECONLP 2018), 2018, : 38 - 43
[9] Smoothing CNN for end-to-end training in visual question answering
Long, Yu
Tang, Pengjie
Wang, Hanli
Li, Qinyu
DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 784 - 791
[10] A short survey on end-to-end simple question answering systems
José Wellington Franco da Silva
Amanda Drielly Pires Venceslau
Juliano Efson Sales
José Gilvan Rodrigues Maia
Vládia Célia Monteiro Pinheiro
Vânia Maria Ponte Vidal
Artificial Intelligence Review, 2020, 53 : 5429 - 5453

← 1 2 3 4 5 →