Saliency Map-Based Local White-Box Adversarial Attack Against Deep Neural Networks

被引:1
|
作者
Liu, Haohan [1 ,2 ]
Zuo, Xingquan [1 ,2 ]
Huang, Hai [1 ]
Wan, Xing [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing, Peoples R China
[2] Minist Educ, Key Lab Trustworthy Distributed Comp & Serv, Beijing, Peoples R China
来源
关键词
Deep learning; Saliency map; Local white-box attack; Adversarial attack;
D O I
10.1007/978-3-031-20500-2_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current deep neural networks (DNN) are easily fooled by adversarial examples, which are generated by adding some small, well-designed and human-imperceptible perturbations to clean examples. Adversarial examples will mislead deep learning (DL) model to make wrong predictions. At present, many existing white-box attack methods in the image field are mainly based on the global gradient of the model. That is, the global gradient is first calculated, and then the perturbation is added into the gradient direction. Those methods usually have a high attack success rate. However, there are also some shortcomings, such as excessive perturbation and easy detection by the human's eye. Therefore, in this paper we propose a SaliencyMap-based Local white-box Adversarial Attack method (SMLAA). The saliencymap used in the interpretability of artificial intelligence is introduced in SMLAA. First, Gradient-weighted Class Activation Mapping (Grad-CAM) is utilized to provide a visual interpretation of model decisions to find important areas in an image. Then, the perturbation is added only to important local areas to reduce the magnitude of perturbations. Experimental results show that compared with the global attack method, SMLAA reduces the average robustness measure by 9%-24% while ensuring the attack success rate. It means that SMLAA has a high attack success rate with fewer pixels changed.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 50 条
  • [1] Dynamic Programming-Based White Box Adversarial Attack for Deep Neural Networks
    Aggarwal, Swati
    Mittal, Anshul
    Aggarwal, Sanchit
    Singh, Anshul Kumar
    AI, 2024, 5 (03) : 1216 - 1234
  • [2] Robustness of Bayesian Neural Networks to White-Box Adversarial Attacks
    Uchendu, Adaku
    Campoy, Daniel
    Menart, Christopher
    Hildenbrandt, Alexandra
    2021 IEEE FOURTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE 2021), 2021, : 72 - 80
  • [3] DI-AA: An interpretable white-box attack for fooling deep neural networks
    Wang, Yixiang
    Liu, Jiqiang
    Chang, Xiaolin
    Rodriguez, Ricardo J.
    Wang, Jianhua
    INFORMATION SCIENCES, 2022, 610 : 14 - 32
  • [4] Impact of White-Box Adversarial Attacks on Convolutional Neural Networks
    Podder, Rakesh
    Ghosh, Sudipto
    2024 International Conference on Emerging Trends in Networks and Computer Communications, ETNCC 2024 - Proceedings, 2024, : 41 - 49
  • [5] DI-AA: An interpretable white-box attack for fooling deep neural networks
    Wang, Yixiang
    Liu, Jiqiang
    Chang, Xiaolin
    Rodríguez, Ricardo J.
    Wang, Jianhua
    Information Sciences, 2022, 610 : 14 - 32
  • [6] A White-Box Testing for Deep Neural Networks Based on Neuron Coverage
    Yu, Jing
    Duan, Shukai
    Ye, Xiaojun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9185 - 9197
  • [7] Black-box Adversarial Attack against Visual Interpreters for Deep Neural Networks
    Hirose, Yudai
    Ono, Satoshi
    2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [8] Understanding and defending against White-box membership inference attack in deep learning
    Wu, Di
    Qi, Saiyu
    Qi, Yong
    Li, Qian
    Cai, Bowen
    Guo, Qi
    Cheng, Jingxian
    KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [9] Diversity Adversarial Training against Adversarial Attack on Deep Neural Networks
    Kwon, Hyun
    Lee, Jun
    SYMMETRY-BASEL, 2021, 13 (03):
  • [10] White-Box Multi-Objective Adversarial Attack on Dialogue Generation
    Li, Yufei
    Li, Zexin
    Gao, Yingfan
    Liu, Cong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1778 - 1792