Adversarial Training with Fast Gradient Projection Method against Synonym Substitution Based Text Attacks

被引:0
|
作者
Wang, Xiaosen [1 ]
Yang, Yichen [1 ]
Deng, Yihe [2 ]
He, Kun [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adversarial training is the most empirically successful approach in improving the robustness of deep neural networks for image classification. For text classification, however, existing synonym substitution based adversarial attacks are effective but not very efficient to be incorporated into practical text adversarial training. Gradient-based attacks, which are very efficient for images, are hard to be implemented for synonym substitution based text attacks due to the lexical, grammatical and semantic constraints and the discrete text input space. Thereby, we propose a fast text adversarial attack method called Fast Gradient Projection Method (FGPM) based on synonym substitution, which is about 20 times faster than existing text attack methods and could achieve similar attack performance. We then incorporate FGPM with adversarial training and propose a text defense method called Adversarial Training with FGPM enhanced by Logit pairing (ATFL). Experiments show that ATFL could significantly improve the model robustness and block the transferability of adversarial examples.
引用
收藏
页码:13997 / 14005
页数:9
相关论文
共 50 条
  • [41] MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks
    Song, Chang
    Cheng, Hsin-Pai
    Yang, Huanrui
    Li, Sicheng
    Wu, Chunpeng
    Wu, Qing
    Chen, Yiran
    Li, Hai
    2018 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2018, : 476 - 481
  • [42] Text's Armor: Optimized Local Adversarial Perturbation Against Scene Text Editing Attacks
    Xiang, Tao
    Liu, Hangcheng
    Guo, Shangwei
    Liu, Hantao
    Zhang, Tianwei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2777 - 2785
  • [43] Sensitivity of Adversarial Perturbation in Fast Gradient Sign Method
    Liu, Yujie
    Mao, Shuai
    Mei, Xiang
    Yang, Tao
    Zhao, Xuran
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 433 - 436
  • [44] Fast Gradient Scaled Method for Generating Adversarial Examples
    Xu, Zhefeng
    Luo, Zhijian
    Mu, Jinlong
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 189 - 193
  • [45] A2HD: Adaptive Adversarial Training for Hyperdimensional Computing-Based Intrusion Detection Against Adversarial Attacks
    Gungor, Onat
    Rosing, Tajana
    Aksanli, Bans
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2024, : 107 - 113
  • [46] The Application of Adversarial Training Based on Gradient Constraint Optimization Method to Sentiment Analysis
    Xie, Zhichun
    Liu, Jianhua
    Hu, Renyuan
    Wang, Jiacan
    Wang, Xiaofeng
    Journal of Network Intelligence, 2024, 9 (01): : 587 - 598
  • [47] ASCL: Adversarial supervised contrastive learning for defense against word substitution attacks
    Shi, Jiahui
    Li, Linjing
    Zeng, Daniel
    NEUROCOMPUTING, 2022, 510 : 59 - 68
  • [48] Defense-VAE: A Fast and Accurate Defense Against Adversarial Attacks
    Li, Xiang
    Ji, Shihao
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 191 - 207
  • [49] Fast Generalized Predictive Control Based on Accelerated Dual Gradient Projection Method
    Peccin, Vinicius Berndsen
    Lima, Daniel Martins
    Costa Flesch, Rodolfo Cesar
    Normey-Rico, Julio Elias
    IFAC PAPERSONLINE, 2019, 52 (01): : 480 - 485
  • [50] Improved Gradient-Based Adversarial Attacks for Quantized Networks
    Gupta, Kartik
    Ajanthan, Thalaiyasingam
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6810 - 6818