Low-resource text classification using domain-adversarial learning

被引:9
|
作者
Griesshaber, Daniel [1 ]
Ngoc Thang Vu [2 ]
Maucher, Johannes [1 ]
机构
[1] Stuttgart Media Univ, Nobelstr 10, D-70569 Stuttgart, Germany
[2] Univ Stuttgart, Inst Nat Language Proc IMS, Pfaffenwaldring 5b, D-70569 Stuttgart, Germany
来源
关键词
NLP; Low-resource; Deep learning; Domain-adversarial;
D O I
10.1016/j.csl.2019.101056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning techniques have recently shown to be successful in many natural language processing tasks forming state-of-the-art systems. They require, however, a large amount of annotated data which is often missing. This paper explores the use of domain-adversarial learning as a regularizer to avoid overfitting when training domain invariant features for deep, complex neural networks in low-resource and zero-resource settings in new target domains or languages. In case of new languages, we show that monolingual word vectors can be directly used for training without prealignment. Their projection into a common space can be learnt ad-hoc at training time reaching the final performance of pretrained multilingual word vectors. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Low-Resource Text Classification Using Domain-Adversarial Learning
    Griesshaber, Daniel
    Ngoc Thang Vu
    Maucher, Johannes
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 129 - 139
  • [2] Domain-Adversarial Graph Neural Networks for Text Classification
    Wu, Man
    Pan, Shirui
    Zhu, Xingquan
    Zhou, Chuan
    Pan, Lei
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 648 - 657
  • [3] Out-of-Domain Detection for Low-Resource Text Classification Tasks
    Tan, Ming
    Yu, Yang
    Wang, Haoyu
    Wang, Dakuo
    Potdar, Saloni
    Chang, Shiyu
    Yu, Mo
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3566 - 3572
  • [4] Handwriting Recognition in Low-resource Scripts using Adversarial Learning
    Bhunia, Ayan Kumar
    Das, Abhirup
    Bhunia, Ankan Kumar
    Kishore, Perla Sai Raj
    Roy, Partha Pratim
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4762 - 4771
  • [5] Domain-Aligned Data Augmentation for Low-Resource and Imbalanced Text Classification
    Stylianou, Nikolaos
    Chatzakou, Despoina
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 172 - 187
  • [6] Domain Adaptation Speech-to-Text for Low-Resource European Portuguese Using Deep Learning
    Medeiros, Eduardo
    Corado, Leonel
    Rato, Luis
    Quaresma, Paulo
    Salgueiro, Pedro
    [J]. FUTURE INTERNET, 2023, 15 (05)
  • [7] Knowledge-Aware Meta-learning for Low-Resource Text Classification
    Yao, Huaxiu
    Wu, Yingxin
    Al-Shedivat, Maruan
    Xing, Eric P.
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1814 - 1821
  • [8] DaCon: Multi-Domain Text Classification Using Domain Adversarial Contrastive Learning
    Dai, Yingjun
    El-Roby, Ahmed
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 40 - 52
  • [9] Domain Adaptation for Learning from Label Proportions Using Domain-Adversarial Neural Network
    Li X.
    Culotta A.
    [J]. SN Computer Science, 4 (5)
  • [10] Meta adversarial learning improves low-resource speech recognition
    Chen, Yaqi
    Yang, Xukui
    Zhang, Hao
    Zhang, Wenlin
    Qu, Dan
    Chen, Cong
    [J]. COMPUTER SPEECH AND LANGUAGE, 2024, 84