ENHANCING ADVERSARIAL ROBUSTNESS FOR IMAGE CLASSIFICATION BY REGULARIZING CLASS LEVEL FEATURE DISTRIBUTION

被引:3
|
作者
Yu, Cheng [1 ]
Xue, Youze [1 ]
Chen, Jiansheng [1 ,2 ,3 ]
Wang, Yu [1 ]
Ma, Huimin [3 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Univ Sci & Technol Beijing, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年
基金
中国国家自然科学基金;
关键词
Adversarial Training; Intra and Inter Class Feature Regularization; Robustness;
D O I
10.1109/ICIP42928.2021.9506383
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent researches have shown that deep neural networks (DNNs) are vulnerable to adversarial examples. Adversarial training is practically the most effective approach to improve the robustness of DNNs against adversarial examples. However, conventional adversarial training methods only focus on the classification results or the instance level relationship on feature representations for adversarial examples. Inspired by the fact that adversarial examples break the distinguishability of the feature representations of DNNs for different classes, we propose Intra and Inter Class Feature Regularization ((IFR)-F-2) to make the feature distribution of adversarial examples maintain the same classification property as clean examples. On the one hand, the intra-class regularization restricts the distance of features between adversarial examples and both the corresponding clean data and samples for the same class. On the other hand, the inter-class regularization prevents the feature of adversarial examples from getting close to other classes. By adding (IFR)-F-2 in both adversarial example generation and model training steps in adversarial training, we can get stronger and more diverse adversarial examples, and the neural network learns a more distinguishable and reasonable feature distribution. Experiments on various adversarial training frameworks demonstrate that (IFR)-F-2 is adaptive for multiple training frameworks and outperforms the state-of-the-art methods for classification of both clean data and adversarial examples.
引用
收藏
页码:494 / 498
页数:5
相关论文
共 50 条
  • [21] Measuring Robustness to Natural Distribution Shifts in Image Classification
    Taori, Rohan
    Dave, Achal
    Shankar, Vaishaal
    Carlini, Nicholas
    Recht, Benjamin
    Schmidt, Ludwig
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [22] STUDY TO IMAGE CLASSIFICATION ON FEATURE LEVEL FUSION
    Xu Xuebin
    Zhang Xinman
    Zhang Deyun
    ADVANCES IN BIOMEDICAL PHOTONICS AND IMAGING, 2008, : 356 - 359
  • [23] Image classification and adversarial robustness analysis based on hybrid convolutional neural network
    Huang, Shui-Yuan
    An, Wan-Jia
    Zhang, De-Shun
    Zhou, Nan-Run
    OPTICS COMMUNICATIONS, 2023, 533
  • [24] ADVERSARIAL ROBUSTNESS OF DEEP LEARNING METHODS FOR SAR IMAGE CLASSIFICATION: AN EXPLAINABILITY VIEW
    Chen, Tianrui
    Wu, Juanping
    Guo, Weiwei
    Zhang, Zenghui
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 1987 - 1991
  • [25] Adversarial explanations for understanding image classification decisions and improved neural network robustness
    Walt Woods
    Jack Chen
    Christof Teuscher
    Nature Machine Intelligence, 2019, 1 : 508 - 516
  • [26] Adversarial explanations for understanding image classification decisions and improved neural network robustness
    Woods, Walt
    Chen, Jack
    Teuscher, Christof
    NATURE MACHINE INTELLIGENCE, 2019, 1 (11) : 508 - 516
  • [27] CANARY: An Adversarial Robustness Evaluation Platform for Deep Learning Models on Image Classification
    Sun, Jiazheng
    Chen, Li
    Xia, Chenxiao
    Zhang, Da
    Huang, Rong
    Qiu, Zhi
    Xiong, Wenqi
    Zheng, Jun
    Tan, Yu-An
    ELECTRONICS, 2023, 12 (17)
  • [28] Enhancing image steganography via adversarial optimization of the stego distribution
    Zha, Hongyue
    Zhang, Weiming
    Yu, Nenghai
    Fan, Zexin
    SIGNAL PROCESSING, 2023, 212
  • [29] A knowledge distillation strategy for enhancing the adversarial robustness of lightweight automatic modulation classification models
    Xu, Fanghao
    Wang, Chao
    Liang, Jiakai
    Zuo, Chenyang
    Yue, Keqiang
    Li, Wenjun
    IET COMMUNICATIONS, 2024, 18 (14) : 827 - 845
  • [30] Hyperspectral Image Classification with Feature-Oriented Adversarial Active Learning
    Wang, Guangxing
    Ren, Peng
    REMOTE SENSING, 2020, 12 (23) : 1 - 19