Context-based Virtual Adversarial Training for Text Classification with Noisy Labels

被引:0
|
作者
Lee, Do-Myoung [1 ]
Kim, Yeachan [2 ]
Seo, Chang-gyun [3 ]
机构
[1] ShinhanCard, Seoul, South Korea
[2] Deargen Inc, Daejeon, South Korea
[3] GC Co, Busan, South Korea
关键词
Text Classification; Learning with Noisy Labels;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep neural networks (DNNs) have a high capacity to completely memorize noisy labels given sufficient training time, and its memorization unfortunately leads to performance degradation. Recently, virtual adversarial training (VAT) attracts attention as it could further improve the generalization of DNNs in semi-supervised learning. The driving force behind VAT is to prevent the models from overffiting to data points by enforcing consistency between the inputs and the perturbed inputs. These strategy could be helpful in learning from noisy labels if it prevents neural models from learning noisy samples while encouraging the models to generalize clean samples. In this paper, we propose context-based virtual adversarial training (ConVAT) to prevent a text classifier from overfitting to noisy labels. Unlike the previous works, the proposed method performs the adversarial training in the context level rather than the inputs. It makes the classifier not only learn its label but also its contextual neighbors, which alleviate the learning from noisy labels by preserving contextual semantics on each data point. We conduct extensive experiments on four text classification datasets with two types of label noises. Comprehensive experimental results clearly show that the proposed method works quite well even with extremely noisy settings.
引用
收藏
页码:6139 / 6146
页数:8
相关论文
共 50 条
  • [1] Context-Based Term Frequency Assessment for Text Classification
    Liu, Rey-Long
    [J]. PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 1004 - 1009
  • [2] Context-Based Term Frequency Assessment for Text Classification
    Liu, Rey-Long
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (02): : 300 - 309
  • [3] Deep Learning Based Robust Text Classification Method via Virtual Adversarial Training
    Zhang, Wei
    Chen, Qian
    Chen, Yunfang
    [J]. IEEE ACCESS, 2020, 8 : 61174 - 61182
  • [4] Context-Based Filtering of Noisy Labels for Automatic Basemap Updating From UAV Data
    Gevaert, Caroline M.
    Persello, Claudio
    Elberink, Sander Oude
    Vosselman, George
    Sliuzas, Richard
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (08) : 2731 - 2741
  • [5] Hierarchical gated recurrent neural network with adversarial and virtual adversarial training on text classification
    Poon, Hoon-Keng
    Yap, Wun-She
    Tee, Yee-Kai
    Lee, Wai-Kong
    Goi, Bok-Min
    [J]. NEURAL NETWORKS, 2019, 119 : 299 - 312
  • [6] Improvements to adversarial training for text classification
    He, Jia-Long
    Zhang, Xiao-Lin
    Wang, Yong-Ping
    Gu, Rui-Chun
    Liu, Li-Xin
    Xu, En-Hui
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 5191 - 5202
  • [7] An adversarial training method for text classification
    Liu, Xiaoyang
    Dai, Shanghong
    Fiumara, Giacomo
    De Meo, Pasquale
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
  • [8] Context-based virtual metrology
    Ebersbach, Peter
    Urbanowicz, Adam M.
    Likhachev, Dmitriy
    Hartig, Carsten
    Shifrin, Michael
    [J]. METROLOGY, INSPECTION, AND PROCESS CONTROL FOR MICROLITHOGRAPHY XXXII, 2018, 10585
  • [9] Towards Robust Learning with Noisy and Pseudo Labels for Text Classification
    Wen, Murtadha Ahmeda Bo
    Ao, Luo
    Pan, Shengfeng
    Su, Jianlin
    Cao, Xinxin
    Liu, Yunfeng
    [J]. INFORMATION SCIENCES, 2024, 661
  • [10] Paragraph Context-Based Text Classification Approach for Large-Scale Judgment Text Structuring
    Weng Y.
    Gu S.
    Li J.
    Wang F.
    Li J.
    Li X.
    [J]. Li, Xin (scufxy4010@163.com), 1600, Tianjin University (54): : 418 - 425