Adversarial training for few-shot text classification

被引：2

作者：

Croce, Danilo ^{[1
]}

Castellucci, Giuseppe ^{[1
]}

Basili, Roberto ^{[1
]}

机构：

[1] Univ Roma Tor Vergata, Dept Enterprise Engn, Rome, RM, Italy

来源：

INTELLIGENZA ARTIFICIALE | 2020年 / 14卷 / 02期

关键词：

Semi-supervised learning; generative adversarial network; kernel-based embedding spaces; universal sentence encoding; NYSTROM METHOD;

D O I：

10.3233/IA-200051

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, Deep Learning methods have become very popular in classification tasks for Natural Language Processing (NLP); this is mainly due to their ability to reach high performances by relying on very simple input representations, i.e., raw tokens. One of the drawbacks of deep architectures is the large amount of annotated data required for an effective training. Usually, in Machine Learning this problem is mitigated by the usage of semi-supervised methods or, more recently, by using Transfer Learning, in the context of deep architectures. One recent promising method to enable semi-supervised learning in deep architectures has been formalized within Semi-Supervised Generative Adversarial Networks (SS-GANs) in the context of Computer Vision. In this paper, we adopt the SS-GAN framework to enable semi-supervised learning in the context of NLP. We demonstrate how an SS-GAN can boost the performances of simple architectures when operating in expressive low-dimensional embeddings; these are derived by combining the unsupervised approximation of linguistic Reproducing Kernel Hilbert Spaces and the so-called Universal Sentence Encoders. We experimentally evaluate the proposed approach over a semantic classification task, i.e., Question Classification, by considering different sizes of training material and different numbers of target classes. By applying such adversarial schema to a simple Multi-Layer Perceptron, a classifier trained over a subset derived from 1% of the original training material achieves 92% of accuracy. Moreover, when considering a complex classification schema, e.g., involving 50 classes, the proposed method outperforms state-of-the-art alternatives such as BERT.

引用

页码：201 / 214

页数：14

共 50 条

[41] PupilTAN: A Few-Shot Adversarial Pupil Localizer
Poulopoulos, Nikolaos
Psarakis, Emmanouil Z.
Kosmopoulos, Dimitrios
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3128 - 3136
[42] Federated Few-Shot Learning with Adversarial Learning
Fan, Chenyou
Huang, Jianwei
[J]. 2021 19TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2021,
[43] Cross-Domain Few-Shot Classification via Adversarial Task Augmentation
Wang, Haoqing
Deng, Zhi-Hong
[J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1075 - 1081
[44] MetalGAN: A Cluster-Based Adaptive Training for Few-Shot Adversarial Colorization
Fontanini, Tomaso
Iotti, Eleonora
Prati, Andrea
[J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 280 - 291
[45] Attentional adversarial training for few-shot medical image segmentation without annotations
Awudong, Buhailiqiemu
Li, Qi
Liang, Zili
Tian, Lin
Yan, Jingwen
[J]. PLOS ONE, 2024, 19 (05):
[46] The Devil is in the Details: On Models and Training Regimes for Few-Shot Intent Classification
Mesgar, Mohsen
Thy Thy Tran
Glavas, Goran
Gurevych, Iryna
[J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1846 - 1857
[47] Rethinking Generalization in Few-Shot Classification
Hiller, Markus
Ma, Rongkai
Harandi, Mehrtash
Drummond, Tom
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[48] Relational Embedding for Few-Shot Classification
Kang, Dahyun
Kwon, Heeseung
Min, Juhong
Cho, Minsu
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8802 - 8813
[49] Label Hallucination for Few-Shot Classification
Jian, Yiren
Torresani, Lorenzo
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7005 - 7014
[50] Few-Shot Classification with Contrastive Learning
Yang, Zhanyuan
Wang, Jinghua
Zhu, Yingying
[J]. COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 293 - 309

← 1 2 3 4 5 →