Pseudo-labeling with transformers for improving Question Answering systems

被引:0
|
作者
Kuligowska, Karolina [1 ]
Kowalczuk, Bartlomiej [1 ]
机构
[1] Univ Warsaw, Fac Econ Sci, Dluga St 44-50, PL-00241 Warsaw, Poland
关键词
Natural Language Processing; Question Answering systems; pseudo-labeling; neural networks; transfer learning; knowledge distillation;
D O I
10.1016/j.procs.2021.08.119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in neural networks contributed to the fast development of Natural Language Processing systems. As a result, Question Answering systems have evolved and can classify and answer questions in an intuitive yet communicative way. However, the lack of large volumes of labeled data prevents large-scale training and development of Question Answering systems, confirming the need for further research. This paper aims to handle this real-world problem of lack of labeled datasets by applying a pseudolabeling technique relying on a neural network transformer model DistiIBERT. In order to evaluate our contribution, we examined the performance of a text classification transformer model that was fine-tuned on the data subject to prior pseudo-labeling. Research has shown the usefulness of the applied pseudo-labeling technique on a neural network text classification transformer model DistiIBERT. The results of our analysis indicated that the model with additional pseudo-labeled data achieved the best results among other compared neural network architectures. Based on that result, Question Answering systems may be directly improved by enriching their training steps with additional data acquired cost-effectively. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:1162 / 1169
页数:8
相关论文
共 50 条
  • [1] Improving the Robustness of Question Answering Systems to Question Paraphrasing
    Gan, Wee Chung
    Ng, Hwee Tou
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6065 - 6075
  • [2] Momentum Pseudo-Labeling: Semi-Supervised ASR With Continuously Improving Pseudo-Labels
    Higuchi, Yosuke
    Moritz, Niko
    Le Roux, Jonathan
    Hori, Takaaki
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1424 - 1438
  • [3] Continuous Soft Pseudo-Labeling in ASR
    Likhomanenko, Tatiana
    Collobert, Ronan
    Jaitly, Navdeep
    Bengio, Samy
    [J]. PROCEEDINGS ON I CAN'T BELIEVE IT'S NOT BETTER! - UNDERSTANDING DEEP LEARNING THROUGH EMPIRICAL FALSIFICATION, VOL 187, 2022, 187 : 66 - 84
  • [4] DistractFlow: Improving Optical Flow Estimation via Realistic Distractions and Pseudo-Labeling
    Jeong, Jisoo
    Cai, Hong
    Garrepalli, Risheek
    Porikli, Fatih
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13691 - 13700
  • [5] Iterative Pseudo-Labeling for Speech Recognition
    Xu, Qiantong
    Likhomanenko, Tatiana
    Kahn, Jacob
    Hannun, Awni
    Synnaeve, Gabriel
    Collobert, Ronan
    [J]. INTERSPEECH 2020, 2020, : 1006 - 1010
  • [6] PSEUDO-LABELING FOR MASSIVELY MULTILINGUAL SPEECH RECOGNITION
    Lugosch, Loren
    Likhomanenko, Tatiana
    Synnaeve, Gabriel
    Collobert, Ronan
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7687 - 7691
  • [7] Active Learning Method Based on Pseudo-labeling
    Hou, Xiaonan
    Wang, Chunlei
    [J]. 2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, : 453 - 458
  • [8] Autonomous Temporal Pseudo-Labeling for Fish Detection
    Veiga, Ricardo J. M.
    Ochoa, Inigo E.
    Belackova, Adela
    Bentes, Luis
    Silva, Joao P.
    Semiao, Jorge
    Rodrigues, Joao M. F.
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [9] Unbiased Pseudo-Labeling for Learning with Noisy Labels
    Higashimoto, Ryota
    Yoshida, Soh
    Horihata, Takashi
    Muneyasu, Mitsuji
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (01) : 44 - 48
  • [10] RPSC: Robust Pseudo-Labeling for Semantic Clustering
    Liu, Sihang
    Cao, Wenming
    Fu, Ruigang
    Yang, Kaixiang
    Yu, Zhiwen
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 14008 - 14016