ENHANCING ADVERSARIAL ROBUSTNESS FOR IMAGE CLASSIFICATION BY REGULARIZING CLASS LEVEL FEATURE DISTRIBUTION

被引:3
|
作者
Yu, Cheng [1 ]
Xue, Youze [1 ]
Chen, Jiansheng [1 ,2 ,3 ]
Wang, Yu [1 ]
Ma, Huimin [3 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Univ Sci & Technol Beijing, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年
基金
中国国家自然科学基金;
关键词
Adversarial Training; Intra and Inter Class Feature Regularization; Robustness;
D O I
10.1109/ICIP42928.2021.9506383
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent researches have shown that deep neural networks (DNNs) are vulnerable to adversarial examples. Adversarial training is practically the most effective approach to improve the robustness of DNNs against adversarial examples. However, conventional adversarial training methods only focus on the classification results or the instance level relationship on feature representations for adversarial examples. Inspired by the fact that adversarial examples break the distinguishability of the feature representations of DNNs for different classes, we propose Intra and Inter Class Feature Regularization ((IFR)-F-2) to make the feature distribution of adversarial examples maintain the same classification property as clean examples. On the one hand, the intra-class regularization restricts the distance of features between adversarial examples and both the corresponding clean data and samples for the same class. On the other hand, the inter-class regularization prevents the feature of adversarial examples from getting close to other classes. By adding (IFR)-F-2 in both adversarial example generation and model training steps in adversarial training, we can get stronger and more diverse adversarial examples, and the neural network learns a more distinguishable and reasonable feature distribution. Experiments on various adversarial training frameworks demonstrate that (IFR)-F-2 is adaptive for multiple training frameworks and outperforms the state-of-the-art methods for classification of both clean data and adversarial examples.
引用
收藏
页码:494 / 498
页数:5
相关论文
共 50 条
  • [31] Feature-aware transferable adversarial attacks against image classification
    Cheng, Shuyan
    Li, Peng
    Han, Keji
    Xu, He
    APPLIED SOFT COMPUTING, 2024, 161
  • [32] Global Learnable Pooling With Enhancing Distinctive Feature for Image Classification
    Zhang, Xingpeng
    Zhang, Xiaohong
    IEEE ACCESS, 2020, 8 : 98539 - 98547
  • [33] Global Learnable Pooling with Enhancing Distinctive Feature for Image Classification
    Zhang, Xingpeng
    Zhang, Xiaohong
    IEEE Access, 2020, 8 : 98539 - 98547
  • [34] Quantum Transfer Learning with Adversarial Robustness for Classification of High-Resolution Image Datasets
    Khatun, Amena
    Usman, Muhammad
    ADVANCED QUANTUM TECHNOLOGIES, 2025, 8 (01)
  • [35] Robustness of Image-Based Malware Classification Models trained with Generative Adversarial Networks
    Reilly, Ciaran
    O'Shaughnessy, Stephen
    Thorpe, Christina
    PROCEEDINGS OF THE 2023 EUROPEAN INTERDISCIPLINARY CYBERSECURITY CONFERENCE, EICC 2023, 2023, : 92 - 99
  • [36] Class Reconstruction Driven Adversarial Domain Adaptation for Hyperspectral Image Classification
    Pande, Shivam
    Banerjee, Biplab
    Pizurica, Aleksandra
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT I, 2020, 11867 : 472 - 484
  • [37] Rethinking Feature Distribution for Loss Functions in Image Classification
    Wan, Weitao
    Zhong, Yuanyi
    Li, Tianpeng
    Chen, Jiansheng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9117 - 9126
  • [38] Attack-invariant attention feature for adversarial defense in hyperspectral image classification
    Shi, Cheng
    Liu, Ying
    Zhao, Minghua
    Pun, Chi-Man
    Miao, Qiguang
    PATTERN RECOGNITION, 2024, 145
  • [39] Active Deep Feature Extraction for Hyperspectral Image Classification Based on Adversarial Learning
    Wang, Xue
    Tan, Kun
    Pan, Cen
    Ding, Jianwei
    Liu, Zhaoxian
    Han, Bo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [40] FEATURE EXTRACTION FRAMEWORK IN CLASS SPACE FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Zhao, Ji
    Zhong, Yanfei
    Gao, Rongrong
    Zhang, Liangpei
    Shu, Hong
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 3164 - 3167