Improving Semi-Supervised Text Classification with Dual Meta-Learning

被引:1
|
作者
Li, Shujie [1 ]
Yuan, Guanghu [1 ]
Yang, Min [1 ]
Shen, Ying [2 ]
Li, Chengming [2 ]
Xu, Ruifeng [3 ]
Zhao, Xiaoyan [1 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, 1068 Xueyuan Ave,Univ Town,Xili, Shenzhen 518055, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, 66 Gongchang Rd, Guangzhou, Guangdong, Peoples R China
[3] Harbin Inst Technol Shenzhen, Shenzhen 518055, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised text classification; pseudo labeling; noise transition matrix; meta learning; consistency regularization;
D O I
10.1145/3648612
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of semi-supervised text classification (SSTC) is to train a model by exploring both a small number of labeled data and a large number of unlabeled data, such that the learned semi-supervised classifier performs better than the supervised classifier trained on solely the labeled samples. Pseudo-labeling is one of the most widely used SSTC techniques, which trains a teacher classifier with a small number of labeled examples to predict pseudo labels for the unlabeled data. The generated pseudo-labeled examples are then utilized to train a student classifier, such that the learned student classifier can outperform the teacher classifier. Nevertheless, the predicted pseudo labels may be inaccurate, making the performance of the student classifier degraded. The student classifier may perform even worse than the teacher classifier. To alleviate this issue, in this paper, we introduce a dual meta-learning (DML) technique for semi-supervised text classification, which improves the teacher and student classifiers simultaneously in an iterative manner. Specifically, we propose a meta-noise correction method to improve the student classifier by proposing a Noise Transition Matrix (NTM) with meta-learning to rectify the noisy pseudo labels. In addition, we devise a meta pseudo supervision method to improve the teacher classifier. Concretely, we exploit the feedback performance from the student classifier to further guide the teacher classifier to produce more accurate pseudo labels for the unlabeled data. In this way, both teacher and student classifiers can co-evolve in the iterative training process. Extensive experiments on four benchmark datasets highlight the effectiveness of our DML method against existing state-of-theart methods for semi-supervised text classification. We release our code and data of this paper publicly at https://github.com/GRIT621/DML.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Semi-Supervised Learning for Classification with Uncertainty
    Zhang, Rui
    Liu, Tong-bo
    Zheng, Ming-wen
    MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 3584 - 3590
  • [32] Improving Uncertainty Estimations for Mammogram Classification using Semi-Supervised Learning
    Calderon-Ramirez, Saul
    Murillo-Hernandez, Diego
    Rojas-Salazar, Kevin
    Calvo-Valverde, Luis-Alexander
    Yang, Shengxiang
    Moemeni, Armaghan
    Elizondo, David
    Lopez-Rubio, Ezequiel
    Molina-Cabello, Miguel A.
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [33] Segmentation of Left Atrial MR Images via Self-supervised Semi-supervised Meta-learning
    Kiyasseh, Dani
    Swiston, Albert
    Chen, Ronghua
    Chen, Antong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 13 - 24
  • [34] Improving Colonoscopy Lesion Classification Using Semi-Supervised Deep Learning
    Golhar, Mayank
    Bobrow, Taylor L.
    Khoshknab, Mirmilad Pourmousavi
    Jit, Simran
    Ngamruengphong, Saowanee
    Durr, Nicholas J.
    IEEE ACCESS, 2021, 9 : 631 - 640
  • [35] Improving Semi-Supervised Classification using Clustering
    Arora, J.
    Tushir, M.
    Kashyap, R.
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (25) : 1 - 9
  • [36] Semi-supervised dual relation learning for multi-label classification
    Wang, Lichen
    Liu, Yunyu
    Di, Hang
    Qin, Can
    Sun, Gan
    Fu, Yun
    IEEE Transactions on Image Processing, 2021, 30 : 9125 - 9135
  • [37] Semi-Supervised Dual Relation Learning for Multi-Label Classification
    Wang, Lichen
    Liu, Yunyu
    Di, Hang
    Qin, Can
    Sun, Gan
    Fu, Yun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9125 - 9135
  • [38] Consistency-Guided Meta-learning for Bootstrapping Semi-supervised Medical Image Segmentation
    Wei, Qingyue
    Yu, Lequan
    Li, Xianhang
    Shao, Wei
    Xie, Cihang
    Xing, Lei
    Zhou, Yuyin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 183 - 193
  • [39] Semi-supervised Few-shot Network Intrusion Detection based on Meta-learning
    Liu, Yao
    Zhou, Le
    Liu, Qiao
    Lan, Tian
    Bai, Xiaoyu
    Zhou, Tinghao
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 495 - 502
  • [40] PLATINUM: Semi-Supervised Model Agnostic Meta-Learning using Submodular Mutual Information
    Li, Changbin
    Kothawade, Suraj
    Chen, Feng
    Iyer, Rishabh
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,