Sanitizing hidden activations for improving adversarial robustness of convolutional neural networks

被引:0
|
作者
Mu, Tianshi [1 ]
Lin, Kequan [1 ]
Zhang, Huabing [1 ]
Wang, Jian [1 ]
机构
[1] China Southern Power Grid, Digital Grid Res Inst, Guangzhou 510700, Peoples R China
关键词
Adversarial examples; sanitizing hidden activations; adversarial robustness; convolutional neural networks;
D O I
10.3233/JIFS-210371
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning is gaining significant traction in a wide range of areas. Whereas, recent studies have demonstrated that deep learning exhibits the fatal weakness on adversarial examples. Due to the black-box nature and un-transparency problem of deep learning, it is difficult to explain the reason for the existence of adversarial examples and also hard to defend against them. This study focuses on improving the adversarial robustness of convolutional neural networks. We first explore how adversarial examples behave inside the network through visualization. We find that adversarial examples produce perturbations in hidden activations, which forms an amplification effect to fool the network. Motivated by this observation, we propose an approach, termed as sanitizing hidden activations, to help the network correctly recognize adversarial examples by eliminating or reducing the perturbations in hidden activations. To demonstrate the effectiveness of our approach, we conduct experiments on three widely used datasets: MNIST, CIFAR-10 and ImageNet, and also compare with state-of-the-art defense techniques. The experimental results show that our sanitizing approach is more generalized to defend against different kinds of attacks and can effectively improve the adversarial robustness of convolutional neural networks.
引用
收藏
页码:3993 / 4003
页数:11
相关论文
共 50 条
  • [1] Uncovering Hidden Vulnerabilities in Convolutional Neural Networks through Graph-based Adversarial Robustness Evaluation
    Wang, Ke
    Chen, Zicong
    Dang, Xilin
    Fan, Xuan
    Han, Xuming
    Chen, Chien-Ming
    Ding, Weiping
    Yiu, Siu-Ming
    Weng, Jian
    [J]. PATTERN RECOGNITION, 2023, 143
  • [2] An orthogonal classifier for improving the adversarial robustness of neural networks
    Xu, Cong
    Li, Xiang
    Yang, Min
    [J]. INFORMATION SCIENCES, 2022, 591 : 251 - 262
  • [3] Exploring adversarial examples and adversarial robustness of convolutional neural networks by mutual information
    Jiebao Zhang
    Wenhua Qian
    Jinde Cao
    Dan Xu
    [J]. Neural Computing and Applications, 2024, 36 (23) : 14379 - 14394
  • [4] Adversarial Robustness of Multi-bit Convolutional Neural Networks
    Frickenstein, Lukas
    Sampath, Shambhavi Balamuthu
    Mori, Pierpaolo
    Vemparala, Manoj-Rohit
    Fasfous, Nael
    Frickenstein, Alexander
    Unger, Christian
    Passerone, Claudio
    Stechele, Walter
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, INTELLISYS 2023, 2024, 824 : 157 - 174
  • [5] Adversarial Robustness of Vision Transformers Versus Convolutional Neural Networks
    Ali, Kazim
    Bhatti, Muhammad Shahid
    Saeed, Atif
    Athar, Atifa
    Al Ghamdi, Mohammed A.
    Almotiri, Sultan H.
    Akram, Samina
    [J]. IEEE ACCESS, 2024, 12 : 105281 - 105293
  • [6] Unreasonable Effectiveness of Last Hidden Layer Activations for Adversarial Robustness
    Tuna, Omer Faruk
    Catak, Ferhat Ozgur
    Eskil, M. Taner
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1098 - 1103
  • [7] Strategies for Improving the Error Robustness of Convolutional Neural Networks
    Morais, Antonio
    Barbosa, Raul
    Lourenco, Nuno
    Cerveira, Frederico
    Lombardi, Michele
    Madeira, Henrique
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 874 - 883
  • [8] Towards Improving Robustness of Deep Neural Networks to Adversarial Perturbations
    Amini, Sajjad
    Ghaemmaghami, Shahrokh
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1889 - 1903
  • [9] IMPROVING THE ROBUSTNESS OF CONVOLUTIONAL NEURAL NETWORKS VIA SKETCH ATTENTION
    Chu, Tianshu
    Yang, Zuopeng
    Yang, Jie
    Huang, Xiaolin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 869 - 873
  • [10] Improving Face Liveness Detection Robustness with Deep Convolutional Generative Adversarial Networks
    Padnevych, Ruslan
    Semedo, David
    Carmo, David
    Magalhaes, Joao
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1866 - 1870