Improving Convolutional End-to-End Memory Networks with BERT for Question Answering

被引:1
|
作者
Alkhawlani, Mohammed A. [1 ,3 ]
Azman, Azreen [1 ]
Abdullah, Muhamad Taufik [1 ]
Yaakob, Razali [1 ]
Kadir, Rabiah Abdul [2 ]
Alshari, Eissa M. [3 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Upm Serdang 43400, Selangor, Malaysia
[2] Univ Kebangsaan Malaysia, Inst Visual Informat, Ukm Bangi 43600, Selangor, Malaysia
[3] Ibb Univ, Ibb, Yemen
关键词
Convolutional End-to-End Memory Networks; BERT; Question answering; bAbI dataset;
D O I
10.1007/978-3-031-66428-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) systems process natural language queries in order to retrieve relevant answers from data, document corpus or the Web. Memory networks have shown encouraging results in certain reasoning tasks of QA such as the end-to-end memory networks. In this paper, we explore the integration of BERT (Bidirectional Encoder Representations from Transformers) with Convolutional End-to-End Memory Networks to improve QA performance. It is anticipated that BERT will provide rich contextual embeddings, allowing for a comprehensive understanding of semantic relationships within sentences and questions. Specifically, we propose an incorporation of BERT, a state-of-the-art pre-trained language model with the Convolutional End-to-End Memory Networks multi-hop reasoning model to improve the overall QA performance. The proposed model can be fine-tuned on smaller datasets, effectively handling overfitting issue. Our experiment shows that the proposed model exhibits remarkable performance, outperforming top results achieved by other memory networks models on the Facebook 'bAbI 1k' dataset with an accuracy of 92.86.
引用
收藏
页码:90 / 104
页数:15
相关论文
共 50 条
  • [11] A short survey on end-to-end simple question answering systems
    Franco da Silva, Jose Wellington
    Pires Venceslau, Amanda Drielly
    Sales, Juliano Efson
    Rodrigues Maia, Jose Gilvan
    Monteiro Pinheiro, Vladia Celia
    Ponte Vidal, Vania Maria
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (07) : 5429 - 5453
  • [12] Gated End-to-End Memory Networks
    Liu, Fei
    Perez, Julien
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 1 - 10
  • [13] Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer
    Peng, Min
    Wang, Chongyang
    Shi, Yu
    Zhou, Xiang-Dong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2038 - 2046
  • [14] EndCold: An End-to-End Framework for Cold Question Routing in Community Question Answering Services
    Sun, Jiankai
    Zhao, Jie
    Sun, Huan
    Parthasarathy, Srinivasan
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3244 - 3250
  • [15] MGRC: An End-to-End Multigranularity Reading Comprehension Model for Question Answering
    Liu, Qian
    Geng, Xiubo
    Huang, Heyan
    Qin, Tao
    Lu, Jie
    Jiang, Daxin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (05) : 2594 - 2605
  • [16] A model for quantitative evaluation of an end-to-end question-answering system
    Wacholder, Nina
    Kelly, Diane
    Kantor, Paul
    Rittman, Robert
    Sun, Ying
    Bai, Bing
    Small, Sharon
    Yamrom, Boris
    Strzalkowski, Tomek
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (08): : 1082 - 1099
  • [17] Expanding End-to-End Question Answering on Differentiable Knowledge Graphs with Intersection
    Sen, Priyanka
    Saffari, Amir
    Oliya, Armin
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8805 - 8812
  • [18] End-to-End Text Recognition with Convolutional Neural Networks
    Wang, Tao
    Wu, David J.
    Coates, Adam
    Ng, Andrew Y.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3304 - 3308
  • [19] BSQA: Bidirectional Stacked Question Answering Architecture for End-to-end Event Extraction
    Jiang, Zetai
    Tian, Sanchuan
    Kong, Fang
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [20] End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering
    Yu, Youngjae
    Ko, Hyungjin
    Choi, Jongwook
    Kim, Gunhee
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3261 - 3269