Improving Semi-Supervised Text Classification with Dual Meta-Learning

被引:1
|
作者
Li, Shujie [1 ]
Yuan, Guanghu [1 ]
Yang, Min [1 ]
Shen, Ying [2 ]
Li, Chengming [2 ]
Xu, Ruifeng [3 ]
Zhao, Xiaoyan [1 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, 1068 Xueyuan Ave,Univ Town,Xili, Shenzhen 518055, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, 66 Gongchang Rd, Guangzhou, Guangdong, Peoples R China
[3] Harbin Inst Technol Shenzhen, Shenzhen 518055, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised text classification; pseudo labeling; noise transition matrix; meta learning; consistency regularization;
D O I
10.1145/3648612
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of semi-supervised text classification (SSTC) is to train a model by exploring both a small number of labeled data and a large number of unlabeled data, such that the learned semi-supervised classifier performs better than the supervised classifier trained on solely the labeled samples. Pseudo-labeling is one of the most widely used SSTC techniques, which trains a teacher classifier with a small number of labeled examples to predict pseudo labels for the unlabeled data. The generated pseudo-labeled examples are then utilized to train a student classifier, such that the learned student classifier can outperform the teacher classifier. Nevertheless, the predicted pseudo labels may be inaccurate, making the performance of the student classifier degraded. The student classifier may perform even worse than the teacher classifier. To alleviate this issue, in this paper, we introduce a dual meta-learning (DML) technique for semi-supervised text classification, which improves the teacher and student classifiers simultaneously in an iterative manner. Specifically, we propose a meta-noise correction method to improve the student classifier by proposing a Noise Transition Matrix (NTM) with meta-learning to rectify the noisy pseudo labels. In addition, we devise a meta pseudo supervision method to improve the teacher classifier. Concretely, we exploit the feedback performance from the student classifier to further guide the teacher classifier to produce more accurate pseudo labels for the unlabeled data. In this way, both teacher and student classifiers can co-evolve in the iterative training process. Extensive experiments on four benchmark datasets highlight the effectiveness of our DML method against existing state-of-theart methods for semi-supervised text classification. We release our code and data of this paper publicly at https://github.com/GRIT621/DML.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [2] A review of semi-supervised learning for text classification
    Duarte, Jose Marcio
    Berton, Lilian
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) : 9401 - 9469
  • [3] Semi-Supervised Text Classification With Universum Learning
    Liu, Chien-Liang
    Hsaio, Wen-Hoar
    Lee, Chia-Hoang
    Chang, Tao-Hsing
    Kuo, Tsung-Hsun
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (02) : 462 - 473
  • [4] TEXT CLASSIFICATION BASED ON SEMI-SUPERVISED LEARNING
    Vo Duy Thanh
    Vo Trung Hung
    Pham Minh Tuan
    Doan Van Ban
    2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 232 - 236
  • [5] SEMI-SUPERVISED LEARNING FOR TEXT CLASSIFICATION BY LAYER PARTITIONING
    Li, Alexander Hanbo
    Sethy, Abhinav
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6164 - 6168
  • [6] DualGraph: Improving Semi-supervised Graph Classification via Dual Contrastive Learning
    Luo, Xiao
    Ju, Wei
    Qu, Meng
    Chen, Chong
    Deng, Minghua
    Hua, Xian-Sheng
    Zhang, Ming
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 699 - 712
  • [7] Semi-Supervised Meta-Learning via Self-Training
    Zhou, Meng
    Li, Yaoyi
    Lu, Hongtao
    Cai Nengbin
    Zhao Xuejun
    2020 THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS'2020), 2020, : 1 - 7
  • [8] Semi-supervised meta-learning elucidates understudied molecular interactions
    Wu, You
    Xie, Li
    Liu, Yang
    Xie, Lei
    COMMUNICATIONS BIOLOGY, 2024, 7 (01)
  • [9] A Dual-channel Semi-supervised Learning Framework on Graphs via Knowledge Transfer and Meta-learning
    Qiao, Ziyue
    Wang, Pengyang
    Wang, Pengfei
    Ning, Zhiyuan
    Fu, Yanjie
    Du, Yi
    Zhou, Yuanchun
    Huang, Jianqiang
    Hua, Xian-Sheng
    Xiong, Hui
    ACM TRANSACTIONS ON THE WEB, 2024, 18 (02)
  • [10] Improving Semi-Supervised Learning for Audio Classification with FixMatch
    Grollmisch, Sascha
    Cano, Estefania
    ELECTRONICS, 2021, 10 (15)