Zero-shot Text Classification via Reinforced Self-training

被引:0
|
作者
Ye, Zhiquan [1 ,2 ]
Geng, Yuxia [1 ,2 ]
Chen, Jiaoyan [4 ]
Xu, Xiaoxiao [3 ]
Zheng, Suhang [3 ]
Wang, Feng [3 ]
Chen, Jingmin [3 ]
Zhang, Jun [3 ]
Chen, Huajun [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou, Peoples R China
[2] AZFT Joint Lab Knowledge Engine, Hangzhou, Peoples R China
[3] Alibaba Grp, Hangzhou, Peoples R China
[4] Univ Oxford, Dept Comp Sci, Oxford, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning has been a tough problem since no labeled data is available for unseen classes during training, especially for classes with low similarity. In this situation, transferring from seen classes to unseen classes is extremely hard. To tackle this problem, in this paper we propose a self-training based method to efficiently leverage unlabeled data. Traditional self-training methods use fixed heuristics to select instances from unlabeled data, whose performance varies among different datasets. We propose a reinforcement learning framework to learn data selection strategy automatically and provide more reliable selection. Experimental results on both benchmarks and a real-world e-commerce dataset show that our approach significantly outperforms previous methods in zero-shot text classification.
引用
收藏
页码:3014 / 3024
页数:11
相关论文
共 50 条
  • [21] Cost Effective Annotation Framework Using Zero-Shot Text Classification
    Kasthuriarachchy, Buddhika
    Chetty, Madhu
    Shatte, Adrian
    Walls, Darren
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [22] Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach
    Yin, Wenpeng
    Hay, Jamaal
    Roth, Dan
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3914 - 3923
  • [23] Using Pseudo-Labelled Data for Zero-Shot Text Classification
    Wang, Congcong
    Nulty, Paul
    Lillis, David
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 35 - 46
  • [24] Prompt-based Zero-shot Text Classification with Conceptual Knowledge
    Wang, Yuqi
    Wang, Wei
    Chen, Qi
    Huang, Kaizhu
    Nguyen, Anh
    De, Suparna
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-SRW 2023, VOL 4, 2023, : 30 - 38
  • [25] A weakly supervised textual entailment approach to zero-shot text classification
    Pamies, Marc
    Llop, Joan
    Multari, Francesco
    Duran-Silva, Nicolau
    Parra-Rojas, Cesar
    Gonzalez-Agirre, Aitor
    Massucci, Francesco Alessandro
    Villegas, Marta
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 286 - 296
  • [26] CLZT: A Contrastive Learning Based Framework for Zero-Shot Text Classification
    Li, Kun
    Lin, Meng
    Hu, Songlin
    Li, Ruixuan
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 623 - 630
  • [27] Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8352 - 8359
  • [28] ProZe: Explainable and Prompt-Guided Zero-Shot Text Classification
    Harrando, Ismail
    Reboud, Alison
    Schleider, Thomas
    Ehrhart, Thibault
    Troncy, Raphael
    [J]. IEEE INTERNET COMPUTING, 2022, 26 (06) : 69 - 77
  • [29] Text Classification Based on Transfer Learning and Self-Training
    Zheng, Yabin
    Teng, Shaohua
    Liu, Zhiyuan
    Sun, Maosong
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2008, : 363 - 367
  • [30] CLAREL: Classification via retrieval loss for zero-shot learning
    Oreshkin, Boris N.
    Rostamzadeh, Negar
    Pinheiro, Pedro O.
    Pal, Christopher
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3989 - 3993