Improving Convolutional End-to-End Memory Networks with BERT for Question Answering

被引:1
|
作者
Alkhawlani, Mohammed A. [1 ,3 ]
Azman, Azreen [1 ]
Abdullah, Muhamad Taufik [1 ]
Yaakob, Razali [1 ]
Kadir, Rabiah Abdul [2 ]
Alshari, Eissa M. [3 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Upm Serdang 43400, Selangor, Malaysia
[2] Univ Kebangsaan Malaysia, Inst Visual Informat, Ukm Bangi 43600, Selangor, Malaysia
[3] Ibb Univ, Ibb, Yemen
关键词
Convolutional End-to-End Memory Networks; BERT; Question answering; bAbI dataset;
D O I
10.1007/978-3-031-66428-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) systems process natural language queries in order to retrieve relevant answers from data, document corpus or the Web. Memory networks have shown encouraging results in certain reasoning tasks of QA such as the end-to-end memory networks. In this paper, we explore the integration of BERT (Bidirectional Encoder Representations from Transformers) with Convolutional End-to-End Memory Networks to improve QA performance. It is anticipated that BERT will provide rich contextual embeddings, allowing for a comprehensive understanding of semantic relationships within sentences and questions. Specifically, we propose an incorporation of BERT, a state-of-the-art pre-trained language model with the Convolutional End-to-End Memory Networks multi-hop reasoning model to improve the overall QA performance. The proposed model can be fine-tuned on smaller datasets, effectively handling overfitting issue. Our experiment shows that the proposed model exhibits remarkable performance, outperforming top results achieved by other memory networks models on the Facebook 'bAbI 1k' dataset with an accuracy of 92.86.
引用
收藏
页码:90 / 104
页数:15
相关论文
共 50 条
  • [41] Convolutional Dictionary Learning by End-To-End Training of Iterative Neural Networks
    Kofler, Andreas
    Wald, Christian
    Schaeffter, Tobias
    Haltmeier, Markus
    Kolbitsch, Christoph
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1213 - 1217
  • [42] Progressively Growing Convolutional Networks for End-to-End Deformable Image Registration
    Eppenhof, Koen A. J.
    Lafarge, Maxime W.
    Pluim, Josien P. W.
    MEDICAL IMAGING 2019: IMAGE PROCESSING, 2019, 10949
  • [43] End-to-end face parsing via interlinked convolutional neural networks
    Zi Yin
    Valentin Yiu
    Xiaolin Hu
    Liang Tang
    Cognitive Neurodynamics, 2021, 15 : 169 - 179
  • [44] Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
    Parcollet, Titouan
    Zhang, Ying
    Morchid, Mohamed
    Trabelsi, Chiheb
    Linares, Georges
    De Mori, Renato
    Bengio, Yoshua
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 22 - 26
  • [45] End-to-end face parsing via interlinked convolutional neural networks
    Yin, Zi
    Yiu, Valentin
    Hu, Xiaolin
    Tang, Liang
    COGNITIVE NEURODYNAMICS, 2021, 15 (01) : 169 - 179
  • [46] End-to-End Blood Pressure Prediction via Fully Convolutional Networks
    Baek, Sanghyun
    Jang, Jiyong
    Yoon, Sungroh
    IEEE ACCESS, 2019, 7 : 185458 - 185468
  • [47] End-to-End Bayesian Networks Exact Learning in Shared Memory
    Karan, Subhadeep
    Sayed, Zainul Abideen
    Zola, Jaroslaw
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (04) : 634 - 645
  • [48] End-to-end Convolutional Semantic Embeddings
    You, Quanzeng
    Zhang, Zhengyou
    Luo, Jiebo
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5735 - 5744
  • [49] End-to-End Joint Opinion Role Labeling with BERT
    Quan, Wei
    Hang, Jinli
    Hu, Xiaohua Tony
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2438 - 2446
  • [50] IMPROVING PROSODY MODELLING WITH CROSS-UTTERANCE BERT EMBEDDINGS FOR END-TO-END SPEECH SYNTHESIS
    Xii, Guanghui
    Song, Wei
    Zhang, Zhengchen
    Zhang, Chao
    He, Xiaodong
    Zhou, Bowen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6079 - 6083