Contrastive learning with text augmentation for text classification

被引:2
|
作者
Jia, Ouyang [1 ]
Huang, Huimin [2 ]
Ren, Jiaxin [3 ]
Xie, Luodi [4 ]
Xiao, Yinyin [4 ]
机构
[1] Guangdong Polytech Normal Univ, Sch Cyber Secur, Guangzhou, Peoples R China
[2] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou, Peoples R China
[3] JDcom Inc, Beijing, Peoples R China
[4] Sun Yat Sen Univ, Sch Comp Sci, Guangzhou, Peoples R China
关键词
Machine learning; Contrastive learning; Text augmentation; Text;
D O I
10.1007/s10489-023-04453-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various contrastive learning models have been successfully applied to representation learning for downstream tasks. The positive samples used in contrastive learning are often derived from augmented data, which improve the performance of many computer vision tasks while still not being fully utilized for natural language processing tasks, such as text classification. The existing data augmentation methods have been rarely applied to contrastive learning in the field of NLP. In this paper, we propose a Text Augmentation Contrastive Learning Representation model, TACLR, that combines the easy text augmentation techniques (i.e., synonym replacement, random insertion, random swap and random deletion) and textMixup augmentation method with contrastive learning for text classification task. Furthermore, we propose a unified method that allows flexibly adapting supervised, semi-supervised and unsupervised learning. Experimental results on five text classification datasets show that our TACLR can significantly improve text classification accuracies. We also provide extensive ablation studies for exploring the validity of each component of our model.
引用
收藏
页码:19522 / 19531
页数:10
相关论文
共 50 条
  • [41] Syntactically Coherent Text Augmentation for Sequence Classification
    Pandey, Suraj
    Akhtar, Md. Shad
    Chakraborty, Tanmoy
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (06): : 1323 - 1332
  • [42] Hierarchical Data Augmentation and the Application in Text Classification
    Yu, Shujuan
    Yang, Jie
    Liu, Danlei
    Li, Runqi
    Zhang, Yun
    Zhao, Shengmei
    [J]. IEEE ACCESS, 2019, 7 : 185476 - 185485
  • [43] Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks
    Wu, Xing
    Gao, Chaochen
    Lin, Meng
    Zang, Liangjun
    Hu, Songlin
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 871 - 875
  • [44] DaCon: Multi-Domain Text Classification Using Domain Adversarial Contrastive Learning
    Dai, Yingjun
    El-Roby, Ahmed
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 40 - 52
  • [45] Exploring Contrastive Learning for Long-Tailed Multi-label Text Classification
    Audibert, Alexandre
    Gauffre, Aurelien
    Amini, Massih-Reza
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 245 - 261
  • [46] Text classification with active learning
    Novak, B
    Mladenic, D
    Grobelnik, M
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 398 - +
  • [47] Learning to Weight for Text Classification
    Moreo, Alejandro
    Esuli, Andrea
    Sebastiani, Fabrizio
    [J]. IEEE Transactions on Knowledge and Data Engineering, 2020, 32 (02): : 302 - 316
  • [48] Learning to Weight for Text Classification
    Moreo, Alejandro
    Esuli, Andrea
    Sebastiani, Fabrizio
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (02) : 302 - 316
  • [49] Contrastive classification: A label-independent generalization model for text classification
    Liang, Yi
    Tohti, Turdi
    Hamdulla, Askar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [50] UniTRec: A Unified Text-to-Text Transformer and Joint Contrastive Learning Framework for Text-based Recommendation
    Mao, Zhiming
    Wang, Huimin
    Du, Yiming
    Wong, Kam-Fai
    [J]. 61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1160 - 1170