ENHANCING ADVERSARIAL ROBUSTNESS FOR IMAGE CLASSIFICATION BY REGULARIZING CLASS LEVEL FEATURE DISTRIBUTION

被引:3
|
作者
Yu, Cheng [1 ]
Xue, Youze [1 ]
Chen, Jiansheng [1 ,2 ,3 ]
Wang, Yu [1 ]
Ma, Huimin [3 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Univ Sci & Technol Beijing, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年
基金
中国国家自然科学基金;
关键词
Adversarial Training; Intra and Inter Class Feature Regularization; Robustness;
D O I
10.1109/ICIP42928.2021.9506383
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent researches have shown that deep neural networks (DNNs) are vulnerable to adversarial examples. Adversarial training is practically the most effective approach to improve the robustness of DNNs against adversarial examples. However, conventional adversarial training methods only focus on the classification results or the instance level relationship on feature representations for adversarial examples. Inspired by the fact that adversarial examples break the distinguishability of the feature representations of DNNs for different classes, we propose Intra and Inter Class Feature Regularization ((IFR)-F-2) to make the feature distribution of adversarial examples maintain the same classification property as clean examples. On the one hand, the intra-class regularization restricts the distance of features between adversarial examples and both the corresponding clean data and samples for the same class. On the other hand, the inter-class regularization prevents the feature of adversarial examples from getting close to other classes. By adding (IFR)-F-2 in both adversarial example generation and model training steps in adversarial training, we can get stronger and more diverse adversarial examples, and the neural network learns a more distinguishable and reasonable feature distribution. Experiments on various adversarial training frameworks demonstrate that (IFR)-F-2 is adaptive for multiple training frameworks and outperforms the state-of-the-art methods for classification of both clean data and adversarial examples.
引用
收藏
页码:494 / 498
页数:5
相关论文
共 50 条
  • [1] Between-Class Adversarial Training for Improving Adversarial Robustness of Image Classification
    Wang, Desheng
    Jin, Weidong
    Wu, Yunpu
    SENSORS, 2023, 23 (06)
  • [2] Benchmarking Adversarial Robustness on Image Classification
    Dong, Yinpeng
    Fu, Qi-An
    Yang, Xiao
    Pang, Tianyu
    Su, Hang
    Xiao, Zihao
    Zhu, Jun
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 318 - 328
  • [3] Enhancing Image Classification Robustness through Adversarial Sampling with Delta Data Augmentation (DDA)
    Reyes-Amezcua, Ivan
    Ochoa-Ruiz, Gilberto
    Mendez-Vazquez, Andres
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 274 - 283
  • [4] A Review of Adversarial Robustness Evaluation for Image Classification
    Li, Zituo
    Sun, Jianbin
    Yang, Kewei
    Xiong, Dehui
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (10): : 2164 - 2189
  • [5] AdHierNet: Enhancing Adversarial Robustness and Interpretability in Text Classification
    Chen, Kai
    Deng, Yingping
    Chen, Qingcai
    Li, Dongfeng
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 41 - 45
  • [6] Enhancing Intrinsic Adversarial Robustness via Feature Pyramid Decoder
    Li, Guanlin
    Ding, Shuya
    Luo, Jun
    Liu, Chang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 797 - 805
  • [7] Toward a Better Tradeoff Between Accuracy and Robustness for Image Classification via Adversarial Feature Diversity
    Xue, Wei
    Wang, Yonghao
    Wang, Yuchi
    Wang, Yue
    Du, Mingyang
    Zheng, Xiao
    IEEE JOURNAL ON MINIATURIZATION FOR AIR AND SPACE SYSTEMS, 2024, 5 (04): : 254 - 264
  • [8] Enhancing the adversarial robustness in medical image classification: exploring adversarial machine learning with vision transformers-based models
    Elif Kanca Gulsoy
    Selen Ayas
    Elif Baykal Kablan
    Murat Ekinci
    Neural Computing and Applications, 2025, 37 (12) : 7971 - 7989
  • [9] Adversarial Robustness on Image Classification With k-Means
    Omari, Rollin
    Kim, Junae
    Montague, Paul
    IEEE ACCESS, 2024, 12 : 28853 - 28859
  • [10] Edge enhancement improves adversarial robustness in image classification
    He, Lirong
    Ai, Qingzhong
    Lei, Yuqing
    Pan, Lili
    Ren, Yazhou
    Xu, Zenglin
    NEUROCOMPUTING, 2023, 518 : 122 - 132