Adversarial training for few-shot text classification

被引:2
|
作者
Croce, Danilo [1 ]
Castellucci, Giuseppe [1 ]
Basili, Roberto [1 ]
机构
[1] Univ Roma Tor Vergata, Dept Enterprise Engn, Rome, RM, Italy
关键词
Semi-supervised learning; generative adversarial network; kernel-based embedding spaces; universal sentence encoding; NYSTROM METHOD;
D O I
10.3233/IA-200051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Deep Learning methods have become very popular in classification tasks for Natural Language Processing (NLP); this is mainly due to their ability to reach high performances by relying on very simple input representations, i.e., raw tokens. One of the drawbacks of deep architectures is the large amount of annotated data required for an effective training. Usually, in Machine Learning this problem is mitigated by the usage of semi-supervised methods or, more recently, by using Transfer Learning, in the context of deep architectures. One recent promising method to enable semi-supervised learning in deep architectures has been formalized within Semi-Supervised Generative Adversarial Networks (SS-GANs) in the context of Computer Vision. In this paper, we adopt the SS-GAN framework to enable semi-supervised learning in the context of NLP. We demonstrate how an SS-GAN can boost the performances of simple architectures when operating in expressive low-dimensional embeddings; these are derived by combining the unsupervised approximation of linguistic Reproducing Kernel Hilbert Spaces and the so-called Universal Sentence Encoders. We experimentally evaluate the proposed approach over a semantic classification task, i.e., Question Classification, by considering different sizes of training material and different numbers of target classes. By applying such adversarial schema to a simple Multi-Layer Perceptron, a classifier trained over a subset derived from 1% of the original training material achieves 92% of accuracy. Moreover, when considering a complex classification schema, e.g., involving 50 classes, the proposed method outperforms state-of-the-art alternatives such as BERT.
引用
收藏
页码:201 / 214
页数:14
相关论文
共 50 条
  • [41] PupilTAN: A Few-Shot Adversarial Pupil Localizer
    Poulopoulos, Nikolaos
    Psarakis, Emmanouil Z.
    Kosmopoulos, Dimitrios
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3128 - 3136
  • [42] Federated Few-Shot Learning with Adversarial Learning
    Fan, Chenyou
    Huang, Jianwei
    [J]. 2021 19TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2021,
  • [43] Cross-Domain Few-Shot Classification via Adversarial Task Augmentation
    Wang, Haoqing
    Deng, Zhi-Hong
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1075 - 1081
  • [44] MetalGAN: A Cluster-Based Adaptive Training for Few-Shot Adversarial Colorization
    Fontanini, Tomaso
    Iotti, Eleonora
    Prati, Andrea
    [J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 280 - 291
  • [45] Attentional adversarial training for few-shot medical image segmentation without annotations
    Awudong, Buhailiqiemu
    Li, Qi
    Liang, Zili
    Tian, Lin
    Yan, Jingwen
    [J]. PLOS ONE, 2024, 19 (05):
  • [46] The Devil is in the Details: On Models and Training Regimes for Few-Shot Intent Classification
    Mesgar, Mohsen
    Thy Thy Tran
    Glavas, Goran
    Gurevych, Iryna
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1846 - 1857
  • [47] Rethinking Generalization in Few-Shot Classification
    Hiller, Markus
    Ma, Rongkai
    Harandi, Mehrtash
    Drummond, Tom
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [48] Relational Embedding for Few-Shot Classification
    Kang, Dahyun
    Kwon, Heeseung
    Min, Juhong
    Cho, Minsu
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8802 - 8813
  • [49] Label Hallucination for Few-Shot Classification
    Jian, Yiren
    Torresani, Lorenzo
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7005 - 7014
  • [50] Few-Shot Classification with Contrastive Learning
    Yang, Zhanyuan
    Wang, Jinghua
    Zhu, Yingying
    [J]. COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 293 - 309