Exploring adversarial examples and adversarial robustness of convolutional neural networks by mutual information

被引：0

作者：

Zhang J. ^{[1
]}

Qian W. ^{[1
]}

Cao J. ^{[2
,3
]}

Xu D. ^{[1
]}

机构：

[1] School of Information Science and Engineering, Yunnan University, Kunming

[2] School of Mathematics, Southeast University, Nanjing

[3] Ahlia University, Manama

来源：

Neural Computing and Applications | 2024年 / 36卷 / 23期

基金：

中国国家自然科学基金;

关键词：

Adversarial attacks; Adversarial examples; Deep neural networks; Mutual information;

D O I：

10.1007/s00521-024-09774-z

中图分类号：

学科分类号：

摘要：

Convolutional neural networks (CNNs) are susceptible to adversarial examples, which are similar to original examples but contain malicious perturbations. Adversarial training is a simple and effective defense method to improve the robustness of CNNs to adversarial examples. Many works explore the mechanism behind adversarial examples and adversarial training. However, mutual information is rarely present in the interpretation of these counter-intuitive phenomena. This work investigates similarities and differences between normally trained CNNs (NT-CNNs) and adversarially trained CNNs (AT-CNNs) from the mutual information perspective. We show that although mutual information trends of NT-CNNs and AT-CNNs are similar throughout training for original and adversarial examples, there exists an obvious difference. Compared with NT-CNNs, AT-CNNs achieve a lower clean accuracy and extract less information from the input. CNNs trained with different methods have different preferences for certain types of information; NT-CNNs tend to extract texture-based information from the input, while AT-CNNs prefer shape-based information. The reason why adversarial examples mislead CNNs may be that they contain more texture-based information about other classes. Furthermore, we also analyze the mutual information estimators used in this work and find that they outline the geometric properties of the middle layer’s output. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

引用

页码：14379 / 14394

页数：15

共 50 条

[1] Robustness of deep neural networks in adversarial examples
Song, Xiao (songxiao@buaa.edu.cn), 1600, University of Cincinnati (24):
[2] ROBUSTNESS OF DEEP NEURAL NETWORKS IN ADVERSARIAL EXAMPLES
Teng, Da
Song, Xiao m
Gong, Guanghong
Han, Liang
INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2017, 24 (02): : 123 - 133
[3] Detecting Adversarial Examples on Deep Neural Networks With Mutual Information Neural Estimation
Gao, Song
Wang, Ruxin
Wang, Xiaoxuan
Yu, Shui
Dong, Yunyun
Yao, Shaowen
Zhou, Wei
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (06) : 5168 - 5181
[4] Adversarial Robustness of Multi-bit Convolutional Neural Networks
Frickenstein, Lukas
Sampath, Shambhavi Balamuthu
Mori, Pierpaolo
Vemparala, Manoj-Rohit
Fasfous, Nael
Frickenstein, Alexander
Unger, Christian
Passerone, Claudio
Stechele, Walter
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, INTELLISYS 2023, 2024, 824 : 157 - 174
[5] Adversarial Robustness of Vision Transformers Versus Convolutional Neural Networks
Ali, Kazim
Bhatti, Muhammad Shahid
Saeed, Atif
Athar, Atifa
Al Ghamdi, Mohammed A.
Almotiri, Sultan H.
Akram, Samina
IEEE ACCESS, 2024, 12 : 105281 - 105293
[6] Explaining Adversarial Examples by Local Properties of Convolutional Neural Networks
Aghdam, Hamed H.
Heravi, Elnaz J.
Puig, Domenec
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 226 - 234
[7] Parseval Networks: Improving Robustness to Adversarial Examples
Cisse, Moustapha
Bojanowski, Piotr
Grave, Edouard
Dauphin, Yann
Usunier, Nicolas
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[8] Sanitizing hidden activations for improving adversarial robustness of convolutional neural networks
Mu, Tianshi
Lin, Kequan
Zhang, Huabing
Wang, Jian
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (02) : 3993 - 4003
[9] Retrieval-Augmented Convolutional Neural Networks against Adversarial Examples
Zhao , Jake
Cho, Kyunghyun
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11555 - 11563
[10] Exploring the Impact of Conceptual Bottlenecks on Adversarial Robustness of Deep Neural Networks
Rasheed, Bader
Abdelhamid, Mohamed
Khan, Adil
Menezes, Igor
Khatak, Asad Masood
IEEE ACCESS, 2024, 12 : 131323 - 131335

← 1 2 3 4 5 →