Low-Resource Text Classification Using Domain-Adversarial Learning

被引:0
|
作者
Griesshaber, Daniel [1 ]
Ngoc Thang Vu [2 ]
Maucher, Johannes [1 ]
机构
[1] Stuttgart Media Univ, Nobelstr 10, D-70569 Stuttgart, Germany
[2] Univ Stuttgart, Inst Nat Language Proc IMS, Pfaffenwaldring 5B, D-70569 Stuttgart, Germany
关键词
NLP; Low-resource; Deep learning; Domain-adversarial;
D O I
10.1007/978-3-030-00810-9_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning techniques have recently shown to be successful in many natural language processing tasks forming state-of-the-art systems. They require, however, a large amount of annotated data which is often missing. This paper explores the use of domain-adversarial learning as a regularizer to avoid overfitting when training domain invariant features for deep, complex neural network in low-resource and zero-resource settings in new target domains or languages. In case of new languages, we show that monolingual word-vectors can be directly used for training without pre-alignment. Their projection into a common space can be learnt ad-hoc at training time reaching the final performance of pretrained multilingual word-vectors.
引用
下载
收藏
页码:129 / 139
页数:11
相关论文
共 50 条
  • [31] Co-Regularized Adversarial Learning for Multi-Domain Text Classification
    Wu, Yuan
    Inkpen, Diana
    El-Roby, Ahmed
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [32] Dual Adversarial Co-Learning for Multi-Domain Text Classification
    Wu, Yuan
    Guo, Yuhong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6438 - 6445
  • [33] Low-Resource Speech-to-Text Translation
    Bansal, Sameer
    Kamper, Herman
    Livescu, Karen
    Lopez, Adam
    Goldwater, Sharon
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1298 - 1302
  • [34] Text-to-speech for low-resource systems
    Schnell, M
    Küstner, M
    Jokisch, O
    Hoffmann, R
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 259 - 262
  • [35] ADVERSARIAL MULTILINGUAL TRAINING FOR LOW-RESOURCE SPEECH RECOGNITION
    Yi, Jiangyan
    Tao, Jianhua
    Wen, Zhengqi
    Bai, Ye
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4899 - 4903
  • [36] Adversarial Cross-Lingual Transfer Learning for Slot Tagging of Low-Resource Languages
    He, Keqing
    Yan, Yuanmeng
    Xu, Weiran
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [37] LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH USING DATA AUGMENTATION
    Huybrechts, Goeric
    Merritt, Thomas
    Comini, Giulia
    Perz, Bartek
    Shah, Raahil
    Lorenzo-Trueba, Jaime
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6593 - 6597
  • [38] DART: Domain-Adversarial Residual-Transfer networks for unsupervised cross-domain image classification
    Fang, Xianghong
    Bai, Haoli
    Guo, Ziyi
    Shen, Bin
    Hoi, Steven
    Xu, Zenglin
    NEURAL NETWORKS, 2020, 127 (127) : 182 - 192
  • [39] DANNMCTG: Domain-Adversarial Training of Neural Network for multicenter antenatal cardiotocography signal classification
    Chen, Li
    Fei, Yue
    Quan, Bin
    Hao, Yuexing
    Chen, Qinqun
    Liu, Guiqing
    Luo, Xiaomu
    Li, Li
    Wei, Hang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
  • [40] Condition Monitoring using Domain-Adversarial Networks with Convolutional Kernel Features
    Caceres-Castellanos, Cesar
    Fehsenfeld, Moritz
    Kortmann, Karl-Philipp
    IFAC PAPERSONLINE, 2023, 56 (02): : 7746 - 7752