Pseudo-labeling with transformers for improving Question Answering systems

被引:0
|
作者
Kuligowska, Karolina [1 ]
Kowalczuk, Bartlomiej [1 ]
机构
[1] Univ Warsaw, Fac Econ Sci, Dluga St 44-50, PL-00241 Warsaw, Poland
关键词
Natural Language Processing; Question Answering systems; pseudo-labeling; neural networks; transfer learning; knowledge distillation;
D O I
10.1016/j.procs.2021.08.119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in neural networks contributed to the fast development of Natural Language Processing systems. As a result, Question Answering systems have evolved and can classify and answer questions in an intuitive yet communicative way. However, the lack of large volumes of labeled data prevents large-scale training and development of Question Answering systems, confirming the need for further research. This paper aims to handle this real-world problem of lack of labeled datasets by applying a pseudolabeling technique relying on a neural network transformer model DistiIBERT. In order to evaluate our contribution, we examined the performance of a text classification transformer model that was fine-tuned on the data subject to prior pseudo-labeling. Research has shown the usefulness of the applied pseudo-labeling technique on a neural network text classification transformer model DistiIBERT. The results of our analysis indicated that the model with additional pseudo-labeled data achieved the best results among other compared neural network architectures. Based on that result, Question Answering systems may be directly improved by enriching their training steps with additional data acquired cost-effectively. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:1162 / 1169
页数:8
相关论文
共 50 条
  • [21] Non-Outlier Pseudo-Labeling for Short Text Clustering
    Zhou, Fangquan
    Gui, Shenglin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IX, 2023, 14262 : 102 - 113
  • [22] Learning From Synthetic Images via Active Pseudo-Labeling
    Song, Liangchen
    Xu, Yonghao
    Zhang, Lefei
    Du, Bo
    Zhang, Qian
    Wang, Xinggang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6452 - 6465
  • [23] Sound Event Detection by Pseudo-Labeling in Weakly Labeled Dataset
    Park, Chungho
    Kim, Donghyeon
    Ko, Hanseok
    [J]. SENSORS, 2021, 21 (24)
  • [24] Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
    Higuchi, Yosuke
    Moritz, Niko
    Le Roux, Jonathan
    Hori, Takaaki
    [J]. INTERSPEECH 2021, 2021, : 726 - 730
  • [25] slimIPL: Language-Model-Free Iterative Pseudo-Labeling
    Likhomanenko, Tatiana
    Xu, Qiantong
    Kahn, Jacob
    Synnaeve, Gabriel
    Collobert, Ronan
    [J]. INTERSPEECH 2021, 2021, : 741 - 745
  • [26] A pseudo-labeling framework for content-based image retrieval
    Yap, Kim-Hui
    Wu, Kui
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN IMAGE AND SIGNAL PROCESSING, 2007, : 266 - 270
  • [27] A Pseudo-labeling Approach to Semi-supervised Organ Segmentation
    Gao, Jianwei
    Xu, Juan
    Fei, Honggao
    [J]. FAST AND LOW-RESOURCE SEMI-SUPERVISED ABDOMINAL ORGAN SEGMENTATION, FLARE 2022, 2022, 13816 : 318 - 326
  • [28] UNCERTAINTY-AWARE PSEUDO-LABELING FOR SPOKEN LANGUAGE ASSESSMENT
    Lin, Binghuai
    Wang, Liyuan
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 885 - 891
  • [29] Informative pseudo-labeling for graph neural networks with few labels
    Li, Yayong
    Yin, Jie
    Chen, Ling
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 37 (01) : 228 - 254
  • [30] Question/Answering Systems
    Visser, Ubbo
    [J]. KUNSTLICHE INTELLIGENZ, 2012, 26 (02): : 191 - 195