Improving Convolutional End-to-End Memory Networks with BERT for Question Answering

被引:1
|
作者
Alkhawlani, Mohammed A. [1 ,3 ]
Azman, Azreen [1 ]
Abdullah, Muhamad Taufik [1 ]
Yaakob, Razali [1 ]
Kadir, Rabiah Abdul [2 ]
Alshari, Eissa M. [3 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Upm Serdang 43400, Selangor, Malaysia
[2] Univ Kebangsaan Malaysia, Inst Visual Informat, Ukm Bangi 43600, Selangor, Malaysia
[3] Ibb Univ, Ibb, Yemen
关键词
Convolutional End-to-End Memory Networks; BERT; Question answering; bAbI dataset;
D O I
10.1007/978-3-031-66428-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) systems process natural language queries in order to retrieve relevant answers from data, document corpus or the Web. Memory networks have shown encouraging results in certain reasoning tasks of QA such as the end-to-end memory networks. In this paper, we explore the integration of BERT (Bidirectional Encoder Representations from Transformers) with Convolutional End-to-End Memory Networks to improve QA performance. It is anticipated that BERT will provide rich contextual embeddings, allowing for a comprehensive understanding of semantic relationships within sentences and questions. Specifically, we propose an incorporation of BERT, a state-of-the-art pre-trained language model with the Convolutional End-to-End Memory Networks multi-hop reasoning model to improve the overall QA performance. The proposed model can be fine-tuned on smaller datasets, effectively handling overfitting issue. Our experiment shows that the proposed model exhibits remarkable performance, outperforming top results achieved by other memory networks models on the Facebook 'bAbI 1k' dataset with an accuracy of 92.86.
引用
收藏
页码:90 / 104
页数:15
相关论文
共 50 条
  • [31] End-to-End Kernel Learning with Supervised Convolutional Kernel Networks
    Mairal, Julien
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [32] SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery
    Seenivasan, Lalithkumar
    Islam, Mobarakol
    Kannan, Gokul
    Ren, Hongliang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IX, 2023, 14228 : 281 - 290
  • [33] Improving Graph Convolutional Networks Based on Relation-Aware Attention for End-to-End Relation Extraction
    Hong, Yin
    Liu, Yanxia
    Yang, Suizhu
    Zhang, Kaiwen
    Wen, Aiqing
    Hu, Jianjun
    IEEE ACCESS, 2020, 8 : 51315 - 51323
  • [34] Improving Users Engagement Detection using End-to-End Spatio-Temporal Convolutional Neural Networks
    Saleh, Khaled
    Yu, Kun
    Chen, Fang
    HRI '21: COMPANION OF THE 2021 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2021, : 190 - 194
  • [35] IMPROVING END-TO-END SPEECH TRANSLATION MODEL WITH BERT-BASED CONTEXTUAL INFORMATION
    Bang, Jeong-Uk
    Lee, Min-Kyu
    Yun, Seung
    Kim, Sang-Hun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6227 - 6231
  • [36] End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights
    Rueckle, Andreas
    Gurevych, Iryna
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 19 - 24
  • [37] Improving End-to-End Multicast Rate Control in Wireless Networks
    Kammoun, W.
    Youssef, H.
    ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 642 - 647
  • [38] A Bayesian end-to-end model with estimated uncertainties for simple question answering over knowledge bases
    Zhang, Linhai
    Lin, Chao
    Zhou, Deyu
    He, Yulan
    Zhang, Meng
    COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [39] Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
    Zhang, Ying
    Pezeshki, Mohammad
    Brakel, Philemon
    Zhang, Saizheng
    Laurent, Cesar
    Bengio, Yoshua
    Courville, Aaron
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 410 - 414
  • [40] Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks
    Li, Hui
    Wang, Peng
    Shen, Chunhua
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5248 - 5256