Improving Convolutional End-to-End Memory Networks with BERT for Question Answering

被引:1
|
作者
Alkhawlani, Mohammed A. [1 ,3 ]
Azman, Azreen [1 ]
Abdullah, Muhamad Taufik [1 ]
Yaakob, Razali [1 ]
Kadir, Rabiah Abdul [2 ]
Alshari, Eissa M. [3 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Upm Serdang 43400, Selangor, Malaysia
[2] Univ Kebangsaan Malaysia, Inst Visual Informat, Ukm Bangi 43600, Selangor, Malaysia
[3] Ibb Univ, Ibb, Yemen
关键词
Convolutional End-to-End Memory Networks; BERT; Question answering; bAbI dataset;
D O I
10.1007/978-3-031-66428-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) systems process natural language queries in order to retrieve relevant answers from data, document corpus or the Web. Memory networks have shown encouraging results in certain reasoning tasks of QA such as the end-to-end memory networks. In this paper, we explore the integration of BERT (Bidirectional Encoder Representations from Transformers) with Convolutional End-to-End Memory Networks to improve QA performance. It is anticipated that BERT will provide rich contextual embeddings, allowing for a comprehensive understanding of semantic relationships within sentences and questions. Specifically, we propose an incorporation of BERT, a state-of-the-art pre-trained language model with the Convolutional End-to-End Memory Networks multi-hop reasoning model to improve the overall QA performance. The proposed model can be fine-tuned on smaller datasets, effectively handling overfitting issue. Our experiment shows that the proposed model exhibits remarkable performance, outperforming top results achieved by other memory networks models on the Facebook 'bAbI 1k' dataset with an accuracy of 92.86.
引用
收藏
页码:90 / 104
页数:15
相关论文
共 50 条
  • [21] Zero-Shot End-To-End Spoken Question Answering In Medical Domain
    Labrak, Yanis
    Moumeni, Adel
    Dufour, Richard
    Rouvier, Mickael
    INTERSPEECH 2024, 2024, : 2020 - 2024
  • [22] SUPPORTIVE ATTENTION IN END-TO-END MEMORY NETWORKS
    Chien, Jen-Tzung
    Lin, Ting-An
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [23] Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering
    Aditya, Somak
    Yang, Yezhou
    Baral, Chitta
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 629 - 637
  • [24] End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems
    Shakeri, Siamak
    Dos Santos, Cicero Nogueira
    Zhu, Henry
    Ng, Patrick
    Nan, Feng
    Wang, Zhiguo
    Nallapati, Ramesh
    Xiang, Bing
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5445 - 5460
  • [25] End-to-End Quantum-Like Language Models with Application to Question Answering
    Zhang, Peng
    Niu, Jiabin
    Su, Zhan
    Wang, Benyou
    Ma, Liqun
    Song, Dawei
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5666 - 5673
  • [26] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Jiang, Feng
    Tao, Wen
    Liu, Shaohui
    Ren, Jie
    Guo, Xun
    Zhao, Debin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 3007 - 3018
  • [27] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Tao, Wen
    Jiang, Feng
    Zhang, Shengping
    Ren, Jie
    Shi, Wuzhen
    Zuo, Wangmeng
    Guo, Xun
    Zhao, Debin
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 463 - 463
  • [28] VERY DEEP CONVOLUTIONAL NETWORKS FOR END-TO-END SPEECH RECOGNITION
    Zhang, Yu
    Chan, William
    Jaitly, Navdeep
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4845 - 4849
  • [29] An End-to-End Knowledge Graph Based Question Answering Approach for COVID-19
    Qiao, Yinbo
    Yang, Zhihao
    Lin, Hongfei
    Wang, Jian
    HEALTH INFORMATION PROCESSING, CHIP 2022, 2023, 1772 : 156 - 169
  • [30] Naranjo Question Answering using End-to-End Multi-task Learning Model
    Rawat, Bhanu Pratap Singh
    Li, Fei
    Yu, Hong
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2547 - 2555