Sanitizing hidden activations for improving adversarial robustness of convolutional neural networks

被引：0

作者：

Mu, Tianshi ^{[1
]}

Lin, Kequan ^{[1
]}

Zhang, Huabing ^{[1
]}

Wang, Jian ^{[1
]}

机构：

[1] China Southern Power Grid, Digital Grid Res Inst, Guangzhou 510700, Peoples R China

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2021年 / 41卷 / 02期

关键词：

Adversarial examples; sanitizing hidden activations; adversarial robustness; convolutional neural networks;

D O I：

10.3233/JIFS-210371

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning is gaining significant traction in a wide range of areas. Whereas, recent studies have demonstrated that deep learning exhibits the fatal weakness on adversarial examples. Due to the black-box nature and un-transparency problem of deep learning, it is difficult to explain the reason for the existence of adversarial examples and also hard to defend against them. This study focuses on improving the adversarial robustness of convolutional neural networks. We first explore how adversarial examples behave inside the network through visualization. We find that adversarial examples produce perturbations in hidden activations, which forms an amplification effect to fool the network. Motivated by this observation, we propose an approach, termed as sanitizing hidden activations, to help the network correctly recognize adversarial examples by eliminating or reducing the perturbations in hidden activations. To demonstrate the effectiveness of our approach, we conduct experiments on three widely used datasets: MNIST, CIFAR-10 and ImageNet, and also compare with state-of-the-art defense techniques. The experimental results show that our sanitizing approach is more generalized to defend against different kinds of attacks and can effectively improve the adversarial robustness of convolutional neural networks.

引用

页码：3993 / 4003

页数：11

共 50 条

[1] Uncovering Hidden Vulnerabilities in Convolutional Neural Networks through Graph-based Adversarial Robustness Evaluation
Wang, Ke
Chen, Zicong
Dang, Xilin
Fan, Xuan
Han, Xuming
Chen, Chien-Ming
Ding, Weiping
Yiu, Siu-Ming
Weng, Jian
[J]. PATTERN RECOGNITION, 2023, 143
[2] An orthogonal classifier for improving the adversarial robustness of neural networks
Xu, Cong
Li, Xiang
Yang, Min
[J]. INFORMATION SCIENCES, 2022, 591 : 251 - 262
[3] Exploring adversarial examples and adversarial robustness of convolutional neural networks by mutual information
Jiebao Zhang
Wenhua Qian
Jinde Cao
Dan Xu
[J]. Neural Computing and Applications, 2024, 36 (23) : 14379 - 14394
[4] Adversarial Robustness of Multi-bit Convolutional Neural Networks
Frickenstein, Lukas
Sampath, Shambhavi Balamuthu
Mori, Pierpaolo
Vemparala, Manoj-Rohit
Fasfous, Nael
Frickenstein, Alexander
Unger, Christian
Passerone, Claudio
Stechele, Walter
[J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, INTELLISYS 2023, 2024, 824 : 157 - 174
[5] Adversarial Robustness of Vision Transformers Versus Convolutional Neural Networks
Ali, Kazim
Bhatti, Muhammad Shahid
Saeed, Atif
Athar, Atifa
Al Ghamdi, Mohammed A.
Almotiri, Sultan H.
Akram, Samina
[J]. IEEE ACCESS, 2024, 12 : 105281 - 105293
[6] Unreasonable Effectiveness of Last Hidden Layer Activations for Adversarial Robustness
Tuna, Omer Faruk
Catak, Ferhat Ozgur
Eskil, M. Taner
[J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1098 - 1103
[7] Strategies for Improving the Error Robustness of Convolutional Neural Networks
Morais, Antonio
Barbosa, Raul
Lourenco, Nuno
Cerveira, Frederico
Lombardi, Michele
Madeira, Henrique
[J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 874 - 883
[8] Towards Improving Robustness of Deep Neural Networks to Adversarial Perturbations
Amini, Sajjad
Ghaemmaghami, Shahrokh
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1889 - 1903
[9] IMPROVING THE ROBUSTNESS OF CONVOLUTIONAL NEURAL NETWORKS VIA SKETCH ATTENTION
Chu, Tianshu
Yang, Zuopeng
Yang, Jie
Huang, Xiaolin
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 869 - 873
[10] Improving Face Liveness Detection Robustness with Deep Convolutional Generative Adversarial Networks
Padnevych, Ruslan
Semedo, David
Carmo, David
Magalhaes, Joao
[J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1866 - 1870

← 1 2 3 4 5 →