Contrastive learning with text augmentation for text classification

被引:2
|
作者
Jia, Ouyang [1 ]
Huang, Huimin [2 ]
Ren, Jiaxin [3 ]
Xie, Luodi [4 ]
Xiao, Yinyin [4 ]
机构
[1] Guangdong Polytech Normal Univ, Sch Cyber Secur, Guangzhou, Peoples R China
[2] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou, Peoples R China
[3] JDcom Inc, Beijing, Peoples R China
[4] Sun Yat Sen Univ, Sch Comp Sci, Guangzhou, Peoples R China
关键词
Machine learning; Contrastive learning; Text augmentation; Text;
D O I
10.1007/s10489-023-04453-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various contrastive learning models have been successfully applied to representation learning for downstream tasks. The positive samples used in contrastive learning are often derived from augmented data, which improve the performance of many computer vision tasks while still not being fully utilized for natural language processing tasks, such as text classification. The existing data augmentation methods have been rarely applied to contrastive learning in the field of NLP. In this paper, we propose a Text Augmentation Contrastive Learning Representation model, TACLR, that combines the easy text augmentation techniques (i.e., synonym replacement, random insertion, random swap and random deletion) and textMixup augmentation method with contrastive learning for text classification task. Furthermore, we propose a unified method that allows flexibly adapting supervised, semi-supervised and unsupervised learning. Experimental results on five text classification datasets show that our TACLR can significantly improve text classification accuracies. We also provide extensive ablation studies for exploring the validity of each component of our model.
引用
收藏
页码:19522 / 19531
页数:10
相关论文
共 50 条
  • [21] A Survey on Data Augmentation for Text Classification
    Bayer, Markus
    Kaufhold, Marc-Andre
    Reuter, Christian
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (07)
  • [22] ACL-RoBERTa-CNN Text Classification Model Combined with Contrastive Learning
    Mu, Zhibo
    Zheng, Shuang
    Wang, Quanmin
    [J]. 2021 INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING AND EDUCATION (BDEE 2021), 2021, : 193 - 197
  • [23] CPCL: Conceptual prototypical contrastive learning for Few-Shot text classification
    Cheng, Tao
    Cheng, Hua
    Fang, Yiquan
    Liu, Yufei
    Gao, Caiting
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 11963 - 11975
  • [24] CLZT: A Contrastive Learning Based Framework for Zero-Shot Text Classification
    Li, Kun
    Lin, Meng
    Hu, Songlin
    Li, Ruixuan
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 623 - 630
  • [25] AHCL-TC: Adaptive Hypergraph Contrastive Learning Networks for Text Classification
    Zhang, Zhen
    Ni, Hao
    Jia, Xiyuan
    Su, Fangfang
    Liu, Mengqiu
    Yun, Wenhao
    Wu, Guohua
    [J]. NEUROCOMPUTING, 2024, 597
  • [26] Data Augmentation With Semantic Enrichment for Deep Learning Invoice Text Classification
    Chi, Wei Wen
    Tang, Tiong Yew
    Salleh, Narishah Mohamed
    Mukred, Muaadh
    Alsalman, Hussain
    Zohaib, Muhammad
    [J]. IEEE ACCESS, 2024, 12 : 57326 - 57344
  • [27] Supervised Contrast Learning Text Classification Model Based on DataQuality Augmentation
    Wu, Liang
    Zhang, Fangfang
    Cheng, Chao
    Song, Shinan
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (05)
  • [28] TABAS: Text augmentation based on attention score for text classification model
    Yu, Yeong Jae
    Yoon, Seung Joo
    Jun, So Young
    Kim, Jong Woo
    [J]. ICT EXPRESS, 2022, 8 (04): : 549 - 554
  • [29] CLUR: Uncertainty Estimation for Few-Shot Text Classification with Contrastive Learning
    He, Jianfeng
    Zhang, Xuchao
    Lei, Shuo
    Alhamadani, Abdulaziz
    Chen, Fanglan
    Xiao, Bei
    Lu, Chang-Tien
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 698 - 710
  • [30] C2L: Causally Contrastive Learning for Robust Text Classification
    Choi, Seungtaek
    Jeong, Myeongho
    Han, Hojae
    Hwang, Seung-won
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10526 - 10534