Defensive Dropout for Hardening Deep Neural Networks under Adversarial Attacks

被引：41

作者：

Wang, Siyue ^{[1
]}

Wang, Xiao ^{[2
]}

Zhao, Pu ^{[1
]}

Wen, Wujie ^{[3
]}

Kaeli, David ^{[1
]}

Chin, Peter ^{[2
]}

Lin, Xue ^{[1
]}

机构：

[1] Northeastern Univ, Boston, MA 02115 USA

[2] Boston Univ, Boston, MA 02215 USA

[3] Florida Int Univ, Miami, FL 33199 USA

来源：

2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS | 2018年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1145/3240765.3264699

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Deep neural networks (DNNs) are known vulnerable to adversarial attacks. That is, adversarial examples, obtained by adding delicately crafted distortions onto original legal inputs, can mislead a DNN to classify them as any target labels. This work provides a solution to hardening DNNs under adversarial attacks through defensive dropout. Besides using dropout during training for the best test accuracy, we propose to use dropout also at test time to achieve strong defense effects. We consider the problem of building robust DNNs as an attacker-defender two -player game, Where the attacker and the defender know each others' strategies and try to optimize their own strategies towards an equilibrium. Based on the observations of the effect of test dropout rate on test accuracy and attack success rate, we propose a defensive dropout algorithm to determine an optimal test dropout rate given the neural network model and the attacker's strategy for generating adversarial examples. We also investigate the mechanism behind the outstanding defense effects achieved by the proposed defensive dropout. Comparing with stochastic activation pnining (SAP), another defense method through introducing randomness into the DNN model, we find that our defensive dropout achieves much larger variances of the gradients, which is the key for the improved defense effects (much lower attack success rate). For example, our defensive dropout can reduce the attack success rate from 100% to 13.89% under the currently strongest attack i.e., C&W attack on MNIST dataset.

引用

页数：8

共 50 条

[1] Is Approximation Universally Defensive Against Adversarial Attacks in Deep Neural Networks?
Siddique, Ayesha
Hoque, Khaza Anuarul
[J]. PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 364 - 369
[2] Defending Against Adversarial Attacks in Deep Neural Networks
You, Suya
Kuo, C-C Jay
[J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS, 2019, 11006
[3] Detecting adversarial example attacks to deep neural networks
Carrara, Fabio
Falchi, Fabrizio
Caldelli, Roberto
Amato, Giuseppe
Fumarola, Roberta
Becarelli, Rudy
[J]. PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
[4] Hardening Deep Neural Networks via Adversarial Model Cascades
Vijaykeerthy, Deepak
Suri, Anshuman
Mehta, Sameep
Kumaraguru, Ponnurangam
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[5] Adversarial Dropout for Recurrent Neural Networks
Park, Sungrae
Song, Kyungwoo
Ji, Mingi
Lee, Wonsung
Moon, Il-Chul
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4699 - 4706
[6] Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks
Guo, Haoqiang
Peng, Lu
Zhang, Jian
Qi, Fang
Duan, Lide
[J]. 2019 TENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2019,
[7] Bluff: Interactively Deciphering Adversarial Attacks on Deep Neural Networks
Das, Nilaksh
Park, Haekyu
Wang, Zijie J.
Hohman, Fred
Firstman, Robert
Rogers, Emily
Chau, Duen Horng
[J]. 2020 IEEE VISUALIZATION CONFERENCE - SHORT PAPERS (VIS 2020), 2020, : 271 - 275
[8] A survey on the vulnerability of deep neural networks against adversarial attacks
Andy Michel
Sumit Kumar Jha
Rickard Ewetz
[J]. Progress in Artificial Intelligence, 2022, 11 : 131 - 141
[9] Reinforced Adversarial Attacks on Deep Neural Networks Using ADMM
Zhao, Pu
Xu, Kaidi
Zhang, Tianyun
Fardad, Makan
Wang, Yanzhi
Lin, Xue
[J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 1169 - 1173
[10] Adversarial Attacks on Deep Neural Networks Based Modulation Recognition
Liu, Mingqian
Zhang, Zhenju
Zhao, Nan
Chen, Yunfei
[J]. IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,

← 1 2 3 4 5 →