Contrastive learning with text augmentation for text classification

被引:2
|
作者
Jia, Ouyang [1 ]
Huang, Huimin [2 ]
Ren, Jiaxin [3 ]
Xie, Luodi [4 ]
Xiao, Yinyin [4 ]
机构
[1] Guangdong Polytech Normal Univ, Sch Cyber Secur, Guangzhou, Peoples R China
[2] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou, Peoples R China
[3] JDcom Inc, Beijing, Peoples R China
[4] Sun Yat Sen Univ, Sch Comp Sci, Guangzhou, Peoples R China
关键词
Machine learning; Contrastive learning; Text augmentation; Text;
D O I
10.1007/s10489-023-04453-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various contrastive learning models have been successfully applied to representation learning for downstream tasks. The positive samples used in contrastive learning are often derived from augmented data, which improve the performance of many computer vision tasks while still not being fully utilized for natural language processing tasks, such as text classification. The existing data augmentation methods have been rarely applied to contrastive learning in the field of NLP. In this paper, we propose a Text Augmentation Contrastive Learning Representation model, TACLR, that combines the easy text augmentation techniques (i.e., synonym replacement, random insertion, random swap and random deletion) and textMixup augmentation method with contrastive learning for text classification task. Furthermore, we propose a unified method that allows flexibly adapting supervised, semi-supervised and unsupervised learning. Experimental results on five text classification datasets show that our TACLR can significantly improve text classification accuracies. We also provide extensive ablation studies for exploring the validity of each component of our model.
引用
收藏
页码:19522 / 19531
页数:10
相关论文
共 50 条
  • [1] Contrastive learning with text augmentation for text classification
    Ouyang Jia
    Huimin Huang
    Jiaxin Ren
    Luodi Xie
    Yinyin Xiao
    [J]. Applied Intelligence, 2023, 53 : 19522 - 19531
  • [2] Graph-based Text Classification by Contrastive Learning with Text-level Graph Augmentation
    Li, Ximing
    Wang, Bing
    Wang, Yang
    Wang, Meng
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [3] Contrastive learning based on linguistic knowledge and adaptive augmentation for text classification
    Zhang, Shaokang
    Ran, Ning
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [4] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
    Ren, Shuhuai
    Zhang, Jinchao
    Li, Lei
    Sun, Xu
    Zhou, Jie
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9029 - 9043
  • [5] Contrastive adversarial learning in text classification tasks
    He, Jia-long
    Zhang, Xiao-Lin
    Wang, Yong-Ping
    Zhang, Huan-Xiang
    Gao, Lu
    Xu, En-Hui
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 3473 - 3484
  • [6] Contrastive Graph Convolutional Networks with adaptive augmentation for text classification
    Yang, Yintao
    Miao, Rui
    Wang, Yili
    Wang, Xin
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)
  • [7] Incorporating Hierarchy into Text Encoder: a Contrastive Learning Approach for Hierarchical Text Classification
    Wang, Zihan
    Wang, Peiyi
    Huang, Lianzhe
    Sun, Xin
    Wang, Houfeng
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7109 - 7119
  • [8] Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification
    Mou, Guanyi
    Li, Yichuan
    Lee, Kyumin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 876 - 887
  • [9] TextGCL: Graph Contrastive Learning for Transductive Text Classification
    Zhao, Yawei
    Song, Xiaoyang
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] Improved Graph Contrastive Learning for Short Text Classification
    Liu, Yonghao
    Huang, Lan
    Giunchiglia, Fausto
    Feng, Xiaoyue
    Guan, Renchu
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18716 - 18724