Improving the transferability of adversarial attacks via self-ensemble

被引:1
|
作者
Cheng, Shuyan [1 ]
Li, Peng [1 ]
Liu, Jianguo [1 ]
Xu, He [1 ]
Yao, Yudong [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Stevens Inst Technol, Dept Elect & Comp Engn, Hoboken, NJ 07030 USA
基金
中国国家自然科学基金;
关键词
Black-box attacks; Transferability; Adversarial examples; Self-ensemble; Feature importance;
D O I
10.1007/s10489-024-05728-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have been used extensively for diverse visual tasks, including object detection, face recognition, and image classification. However, they face several security threats, such as adversarial attacks. To improve the resistance of neural networks to adversarial attacks, researchers have investigated the security issues of models from the perspectives of both attacks and defenses. Recently, the transferability of adversarial attacks has received extensive attention, which promotes the application of adversarial attacks in practical scenarios. However, existing transferable attacks tend to trap into a poor local optimum and significantly degrade the transferability because the production of adversarial samples lacks randomness. Therefore, we propose a self-ensemble-based feature-level adversarial attack (SEFA) to boost transferability by randomly disrupting salient features. We provide theoretical analysis to demonstrate the superiority of the proposed method. In particular, perturbing the refined feature importance weighted intermediate features suppresses positive features and encourages negative features to realize adversarial attacks. Subsequently, self-ensemble is introduced to solve the optimization problem, thus enhancing the diversity from an optimization perspective. The diverse orthogonal initial perturbations disrupt these features stochastically, searching the space of transferable perturbations exhaustively to avoid poor local optima and improve transferability effectively. Extensive experiments show the effectiveness and superiority of the proposed SEFA, i.e., the success rates against undefended models and defense models are improved by 7.7% and 13.4%, respectively, compared with existing transferable attacks. Our code is available at https://github.com/chengshuyan/SEFA.
引用
收藏
页码:10608 / 10626
页数:19
相关论文
共 50 条
  • [31] Improving Adversarial Robustness via Promoting Ensemble Diversity
    Pang, Tianyu
    Xu, Kun
    Du, Chao
    Chen, Ning
    Zhu, Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [32] SELF-ENSEMBLE VARIANCE REGULARIZATION FOR DOMAIN ADAPTATION
    Liu, Xinyi
    Dai, Tao
    Xia, Shu-Tao
    Jiang, Yong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3853 - 3857
  • [33] AcneTyper: An automatic diagnosis method of dermoscopic acne image via self-ensemble and stacking
    Liu, Shuai
    Chen, Ruili
    Gu, Yun
    Yu, Qiong
    Su, Guoxiong
    Ren, Yanjiao
    Huang, Lan
    Zhou, Fengfeng
    TECHNOLOGY AND HEALTH CARE, 2023, 31 (04) : 1171 - 1187
  • [34] Stochastic Variance Reduced Ensemble Adversarial Attack for Boosting the Adversarial Transferability
    Xiong, Yifeng
    Lin, Jiadong
    Zhang, Min
    Hopcroft, John E.
    He, Kun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14963 - 14972
  • [35] Exploring Non-target Knowledge for Improving Ensemble Universal Adversarial Attacks
    Weng, Juanjuan
    Luo, Zhiming
    Zhong, Zhun
    Lin, Dazhen
    Li, Shaozi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2768 - 2775
  • [36] Improving the Semantic Consistency of Textual Adversarial Attacks via Prompt
    Yu, Xiaoyan
    Yin, Qilei
    Shi, Zhixin
    Ma, Yuru
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [37] Improving Adversarial Transferability via Frequency-based Stationary Point Search
    Zhu, Zhiyu
    Chen, Huaming
    Zhang, Jiayu
    Wang, Xinyi
    Jin, Zhibo
    Lu, Qinghua
    Shen, Jun
    Choo, Kim-Kwang Raymond
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3626 - 3635
  • [38] Boosting the transferability of adversarial attacks with global momentum initialization
    Wang, Jiafeng
    Chen, Zhaoyu
    Jiang, Kaixun
    Yang, Dingkang
    Hong, Lingyi
    Guo, Pinxue
    Guo, Haijing
    Zhang, Wenqiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [39] A STUDY ON THE TRANSFERABILITY OF ADVERSARIAL ATTACKS IN SOUND EVENT CLASSIFICATION
    Subramanian, Vinod
    Pankajakshan, Arjun
    Benetos, Emmanouil
    Xu, Ning
    McDonald, SKoT
    Sandler, Mark
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 301 - 305
  • [40] Enhancing the Transferability of Targeted Attacks with Adversarial Perturbation Transform
    Deng, Zhengjie
    Xiao, Wen
    Li, Xiyan
    He, Shuqian
    Wang, Yizhen
    ELECTRONICS, 2023, 12 (18)