Improving Convolutional End-to-End Memory Networks with BERT for Question Answering

被引:1
|
作者
Alkhawlani, Mohammed A. [1 ,3 ]
Azman, Azreen [1 ]
Abdullah, Muhamad Taufik [1 ]
Yaakob, Razali [1 ]
Kadir, Rabiah Abdul [2 ]
Alshari, Eissa M. [3 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Upm Serdang 43400, Selangor, Malaysia
[2] Univ Kebangsaan Malaysia, Inst Visual Informat, Ukm Bangi 43600, Selangor, Malaysia
[3] Ibb Univ, Ibb, Yemen
关键词
Convolutional End-to-End Memory Networks; BERT; Question answering; bAbI dataset;
D O I
10.1007/978-3-031-66428-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) systems process natural language queries in order to retrieve relevant answers from data, document corpus or the Web. Memory networks have shown encouraging results in certain reasoning tasks of QA such as the end-to-end memory networks. In this paper, we explore the integration of BERT (Bidirectional Encoder Representations from Transformers) with Convolutional End-to-End Memory Networks to improve QA performance. It is anticipated that BERT will provide rich contextual embeddings, allowing for a comprehensive understanding of semantic relationships within sentences and questions. Specifically, we propose an incorporation of BERT, a state-of-the-art pre-trained language model with the Convolutional End-to-End Memory Networks multi-hop reasoning model to improve the overall QA performance. The proposed model can be fine-tuned on smaller datasets, effectively handling overfitting issue. Our experiment shows that the proposed model exhibits remarkable performance, outperforming top results achieved by other memory networks models on the Facebook 'bAbI 1k' dataset with an accuracy of 92.86.
引用
收藏
页码:90 / 104
页数:15
相关论文
共 50 条
  • [1] Learning to Reason: End-to-End Module Networks for Visual Question Answering
    Hu, Ronghang
    Andreas, Jacob
    Rohrbach, Marcus
    Darrell, Trevor
    Saenko, Kate
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 804 - 813
  • [2] Towards End-to-End Multilingual Question Answering
    Ekaterina Loginova
    Stalin Varanasi
    Günter Neumann
    Information Systems Frontiers, 2021, 23 : 227 - 241
  • [3] Towards End-to-End Multilingual Question Answering
    Loginova, Ekaterina
    Varanasi, Stalin
    Neumann, Guenter
    INFORMATION SYSTEMS FRONTIERS, 2021, 23 (01) : 227 - 241
  • [4] Convolutional End-to-End Memory Networks for Multi-Hop Reasoning
    Yang, Xiaoqing
    Fan, Pingzhi
    IEEE ACCESS, 2019, 7 : 135268 - 135276
  • [5] End-To-End Memory Networks
    Sukhbaatar, Sainbayar
    Szlam, Arthur
    Weston, Jason
    Fergus, Rob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [6] GSQA: An End-to-End Model for Generative Spoken Question Answering
    Shih, Min-Han
    Chung, Ho-Lam
    Pai, Yu-Chi
    Hsu, Ming-Hao
    Lin, Guan-Ting
    Lie, Shang-Wen
    Lee, Hung-yi
    INTERSPEECH 2024, 2024, : 2970 - 2974
  • [7] End-to-End Open-Domain Question Answering with BERTserini
    Yang, Wei
    Xie, Yuqing
    Lin, Aileen
    Li, Xingyu
    Tan, Luchen
    Xiong, Kun
    Li, Ming
    Lin, Jimmy
    NAACL HLT 2019: THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2019, : 72 - 77
  • [8] A Simple End-to-End Question Answering Model for Product Information
    Lai, Tuan Manh
    Bui, Trung
    Li, Sheng
    Lipka, Nedim
    ECONOMICS AND NATURAL LANGUAGE PROCESSING (ECONLP 2018), 2018, : 38 - 43
  • [9] Smoothing CNN for end-to-end training in visual question answering
    Long, Yu
    Tang, Pengjie
    Wang, Hanli
    Li, Qinyu
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 784 - 791
  • [10] A short survey on end-to-end simple question answering systems
    José Wellington Franco da Silva
    Amanda Drielly Pires Venceslau
    Juliano Efson Sales
    José Gilvan Rodrigues Maia
    Vládia Célia Monteiro Pinheiro
    Vânia Maria Ponte Vidal
    Artificial Intelligence Review, 2020, 53 : 5429 - 5453