Advanced pseudo-labeling approach in mixing-based text data augmentation method

被引:0
|
作者
Park, Jungmin [1 ]
Lee, Younghoon [2 ]
机构
[1] Corp Business Strategy Off, Idea Lab, LG Elect, 128, Yeouidae-ro, Seoul 07336, South Korea
[2] Seoul Natl Univ Sci & Technol, Dept Ind Engn, 232 Gongneung Ro, Seoul 01811, South Korea
基金
新加坡国家研究基金会;
关键词
Text augmentation; Mix-up approach; Explainable artificial intelligence; Over-fitting prevention; Word-explainability; SENTIMENT CLASSIFICATION;
D O I
10.1007/s10044-024-01340-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text augmentation methods facilitate an increase in the amount of training data, without having to collect new training data, by generating transformed versions of real datasets. Among such methods, mixing-based approaches, which swap words between two or more sentences, are widely applied owing to their simplicity and noteworthy performance. However, existing mixing-based approaches do not consider the importance of manipulated words during the pseudo-labeling process because they utilize a naive linear interpolation method. Thus, this paper proposes an advanced mixing-based text augmentation approach based on artificial intelligence methods that explicitly reflect the importance of manipulated words in the pseudo-labeling process. In addition, to avoid overdependence on the pseudo-labeling quality in the training process, the difference between the original label and prediction is also reflected in the loss function. Experimental results indicate that the performance of the proposed method is significantly higher than that of existing approaches.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Active Learning Method Based on Pseudo-labeling
    Hou, Xiaonan
    Wang, Chunlei
    [J]. 2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, : 453 - 458
  • [2] Non-Outlier Pseudo-Labeling for Short Text Clustering
    Zhou, Fangquan
    Gui, Shenglin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IX, 2023, 14262 : 102 - 113
  • [3] A Novel Pseudo-Labeling Approach for Cell Detection Based on Adaptive Threshold
    Bai, Tian
    Zhang, Zhenting
    Zhao, Chen
    Luo, Xiao
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 254 - 265
  • [4] A Pseudo-labeling Approach to Semi-supervised Organ Segmentation
    Gao, Jianwei
    Xu, Juan
    Fei, Honggao
    [J]. FAST AND LOW-RESOURCE SEMI-SUPERVISED ABDOMINAL ORGAN SEGMENTATION, FLARE 2022, 2022, 13816 : 318 - 326
  • [5] Mixing Approach for Text Data Augmentation Based on an Ensemble of Explainable Artificial Intelligence Methods
    Jinyi Yu
    Jinhae Choi
    Younghoon Lee
    [J]. Neural Processing Letters, 2023, 55 : 1741 - 1757
  • [6] Mixing Approach for Text Data Augmentation Based on an Ensemble of Explainable Artificial Intelligence Methods
    Yu, Jinyi
    Choi, Jinhae
    Lee, Younghoon
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1741 - 1757
  • [7] A Semi-Supervised Learning Method for Spiking Neural Networks Based on Pseudo-Labeling
    Nguyen, Thao N. N.
    Veeravalli, Bharadwaj
    Fong, Xuanyao
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] A Sparse Representation-Based Sample Pseudo-Labeling Method for Hyperspectral Image Classification
    Cui, Binge
    Cui, Jiandi
    Lu, Yan
    Guo, Nannan
    Gong, Maoguo
    [J]. REMOTE SENSING, 2020, 12 (04)
  • [9] Teachers in Concordance for Pseudo-Labeling of 3D Sequential Data
    Gebrehiwot, Awet Haileslassie
    Vacek, Patrik
    Hurych, David
    Zimmermann, Karel
    Perez, Patrick
    Svoboda, Tomas
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 536 - 543
  • [10] A pseudo-labeling framework for content-based image retrieval
    Yap, Kim-Hui
    Wu, Kui
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN IMAGE AND SIGNAL PROCESSING, 2007, : 266 - 270