Improving Semi-Supervised Text Classification with Dual Meta-Learning

被引:1
|
作者
Li, Shujie [1 ]
Yuan, Guanghu [1 ]
Yang, Min [1 ]
Shen, Ying [2 ]
Li, Chengming [2 ]
Xu, Ruifeng [3 ]
Zhao, Xiaoyan [1 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, 1068 Xueyuan Ave,Univ Town,Xili, Shenzhen 518055, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, 66 Gongchang Rd, Guangzhou, Guangdong, Peoples R China
[3] Harbin Inst Technol Shenzhen, Shenzhen 518055, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised text classification; pseudo labeling; noise transition matrix; meta learning; consistency regularization;
D O I
10.1145/3648612
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of semi-supervised text classification (SSTC) is to train a model by exploring both a small number of labeled data and a large number of unlabeled data, such that the learned semi-supervised classifier performs better than the supervised classifier trained on solely the labeled samples. Pseudo-labeling is one of the most widely used SSTC techniques, which trains a teacher classifier with a small number of labeled examples to predict pseudo labels for the unlabeled data. The generated pseudo-labeled examples are then utilized to train a student classifier, such that the learned student classifier can outperform the teacher classifier. Nevertheless, the predicted pseudo labels may be inaccurate, making the performance of the student classifier degraded. The student classifier may perform even worse than the teacher classifier. To alleviate this issue, in this paper, we introduce a dual meta-learning (DML) technique for semi-supervised text classification, which improves the teacher and student classifiers simultaneously in an iterative manner. Specifically, we propose a meta-noise correction method to improve the student classifier by proposing a Noise Transition Matrix (NTM) with meta-learning to rectify the noisy pseudo labels. In addition, we devise a meta pseudo supervision method to improve the teacher classifier. Concretely, we exploit the feedback performance from the student classifier to further guide the teacher classifier to produce more accurate pseudo labels for the unlabeled data. In this way, both teacher and student classifiers can co-evolve in the iterative training process. Extensive experiments on four benchmark datasets highlight the effectiveness of our DML method against existing state-of-theart methods for semi-supervised text classification. We release our code and data of this paper publicly at https://github.com/GRIT621/DML.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Semi-supervised Meta-learning with Disentanglement for Domain-Generalised Medical Image Segmentation
    Liu, Xiao
    Thermos, Spyridon
    O'Neil, Alison
    Tsaftaris, Sotirios A.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 307 - 317
  • [42] Rough set and ensemble learning based semi-supervised algorithm for text classification
    Shi, Lei
    Ma, Xinming
    Xi, Lei
    Duan, Qiguo
    Zhao, Jingying
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 6300 - 6306
  • [43] An Extension of the Aspect PLSA Model to Active and Semi-Supervised Learning for Text Classification
    Krithara, Anastasia
    Amini, Massih-Reza
    Goutte, Cyril
    Renders, Jean-Michel
    ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, PROCEEDINGS, 2010, 6040 : 183 - +
  • [44] Resilient Semi-Supervised Meta-Learning Network based on wavelet transform and K-means optimization for fluid classification
    Li, Hengxiao
    Pang, Shanchen
    Sun, Youzhuang
    PHYSICS OF FLUIDS, 2024, 36 (12)
  • [45] A New SVM Method for Short Text Classification Based on Semi-Supervised Learning
    Yin, Chunyong
    Xiang, Jun
    Zhang, Hui
    Wang, Jin
    Yin, Zhichao
    Kim, Jeong-Uk
    2015 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGY AND SENSOR APPLICATION (AITS), 2015, : 100 - 103
  • [46] Learning Adversarial Networks for Semi-Supervised Text Classification via Policy Gradient
    Li, Yan
    Ye, Jieping
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1715 - 1723
  • [47] Use of Distributed Semi-Supervised Clustering for Text Classification
    Li, Pei
    Deng, Ze
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (08)
  • [48] Text classification with enhanced semi-supervised fuzzy clustering
    Keswani, G
    Hall, LO
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 621 - 626
  • [49] Different Similarity Measures in Semi-supervised Text Classification
    Wajeed, Mohammed Abdul
    Adilakshmi, T.
    2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS, 2011,
  • [50] Semi-supervised text classification using partitioned EM
    Cong, G
    Lee, WS
    Wu, HR
    Liu, B
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 482 - 493