ENHANCING ADVERSARIAL ROBUSTNESS FOR IMAGE CLASSIFICATION BY REGULARIZING CLASS LEVEL FEATURE DISTRIBUTION

被引：3

作者：

Yu, Cheng ^{[1
]}

Xue, Youze ^{[1
]}

Chen, Jiansheng ^{[1
,2
,3
]}

Wang, Yu ^{[1
]}

Ma, Huimin ^{[3
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China

[3] Univ Sci & Technol Beijing, Beijing, Peoples R China

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年

基金：

中国国家自然科学基金;

关键词：

Adversarial Training; Intra and Inter Class Feature Regularization; Robustness;

D O I：

10.1109/ICIP42928.2021.9506383

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent researches have shown that deep neural networks (DNNs) are vulnerable to adversarial examples. Adversarial training is practically the most effective approach to improve the robustness of DNNs against adversarial examples. However, conventional adversarial training methods only focus on the classification results or the instance level relationship on feature representations for adversarial examples. Inspired by the fact that adversarial examples break the distinguishability of the feature representations of DNNs for different classes, we propose Intra and Inter Class Feature Regularization ((IFR)-F-2) to make the feature distribution of adversarial examples maintain the same classification property as clean examples. On the one hand, the intra-class regularization restricts the distance of features between adversarial examples and both the corresponding clean data and samples for the same class. On the other hand, the inter-class regularization prevents the feature of adversarial examples from getting close to other classes. By adding (IFR)-F-2 in both adversarial example generation and model training steps in adversarial training, we can get stronger and more diverse adversarial examples, and the neural network learns a more distinguishable and reasonable feature distribution. Experiments on various adversarial training frameworks demonstrate that (IFR)-F-2 is adaptive for multiple training frameworks and outperforms the state-of-the-art methods for classification of both clean data and adversarial examples.

引用

页码：494 / 498

页数：5

共 50 条

[21] Measuring Robustness to Natural Distribution Shifts in Image Classification
Taori, Rohan
Dave, Achal
Shankar, Vaishaal
Carlini, Nicholas
Recht, Benjamin
Schmidt, Ludwig
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[22] STUDY TO IMAGE CLASSIFICATION ON FEATURE LEVEL FUSION
Xu Xuebin
Zhang Xinman
Zhang Deyun
ADVANCES IN BIOMEDICAL PHOTONICS AND IMAGING, 2008, : 356 - 359
[23] Image classification and adversarial robustness analysis based on hybrid convolutional neural network
Huang, Shui-Yuan
An, Wan-Jia
Zhang, De-Shun
Zhou, Nan-Run
OPTICS COMMUNICATIONS, 2023, 533
[24] ADVERSARIAL ROBUSTNESS OF DEEP LEARNING METHODS FOR SAR IMAGE CLASSIFICATION: AN EXPLAINABILITY VIEW
Chen, Tianrui
Wu, Juanping
Guo, Weiwei
Zhang, Zenghui
IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 1987 - 1991
[25] Adversarial explanations for understanding image classification decisions and improved neural network robustness
Walt Woods
Jack Chen
Christof Teuscher
Nature Machine Intelligence, 2019, 1 : 508 - 516
[26] Adversarial explanations for understanding image classification decisions and improved neural network robustness
Woods, Walt
Chen, Jack
Teuscher, Christof
NATURE MACHINE INTELLIGENCE, 2019, 1 (11) : 508 - 516
[27] CANARY: An Adversarial Robustness Evaluation Platform for Deep Learning Models on Image Classification
Sun, Jiazheng
Chen, Li
Xia, Chenxiao
Zhang, Da
Huang, Rong
Qiu, Zhi
Xiong, Wenqi
Zheng, Jun
Tan, Yu-An
ELECTRONICS, 2023, 12 (17)
[28] Enhancing image steganography via adversarial optimization of the stego distribution
Zha, Hongyue
Zhang, Weiming
Yu, Nenghai
Fan, Zexin
SIGNAL PROCESSING, 2023, 212
[29] A knowledge distillation strategy for enhancing the adversarial robustness of lightweight automatic modulation classification models
Xu, Fanghao
Wang, Chao
Liang, Jiakai
Zuo, Chenyang
Yue, Keqiang
Li, Wenjun
IET COMMUNICATIONS, 2024, 18 (14) : 827 - 845
[30] Hyperspectral Image Classification with Feature-Oriented Adversarial Active Learning
Wang, Guangxing
Ren, Peng
REMOTE SENSING, 2020, 12 (23) : 1 - 19

← 1 2 3 4 5 →