Improving the transferability of adversarial attacks via self-ensemble

被引：1

作者：

Cheng, Shuyan ^{[1
]}

Li, Peng ^{[1
]}

Liu, Jianguo ^{[1
]}

Xu, He ^{[1
]}

Yao, Yudong ^{[2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China

[2] Stevens Inst Technol, Dept Elect & Comp Engn, Hoboken, NJ 07030 USA

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 21期

基金：

中国国家自然科学基金;

关键词：

Black-box attacks; Transferability; Adversarial examples; Self-ensemble; Feature importance;

D O I：

10.1007/s10489-024-05728-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks have been used extensively for diverse visual tasks, including object detection, face recognition, and image classification. However, they face several security threats, such as adversarial attacks. To improve the resistance of neural networks to adversarial attacks, researchers have investigated the security issues of models from the perspectives of both attacks and defenses. Recently, the transferability of adversarial attacks has received extensive attention, which promotes the application of adversarial attacks in practical scenarios. However, existing transferable attacks tend to trap into a poor local optimum and significantly degrade the transferability because the production of adversarial samples lacks randomness. Therefore, we propose a self-ensemble-based feature-level adversarial attack (SEFA) to boost transferability by randomly disrupting salient features. We provide theoretical analysis to demonstrate the superiority of the proposed method. In particular, perturbing the refined feature importance weighted intermediate features suppresses positive features and encourages negative features to realize adversarial attacks. Subsequently, self-ensemble is introduced to solve the optimization problem, thus enhancing the diversity from an optimization perspective. The diverse orthogonal initial perturbations disrupt these features stochastically, searching the space of transferable perturbations exhaustively to avoid poor local optima and improve transferability effectively. Extensive experiments show the effectiveness and superiority of the proposed SEFA, i.e., the success rates against undefended models and defense models are improved by 7.7% and 13.4%, respectively, compared with existing transferable attacks. Our code is available at https://github.com/chengshuyan/SEFA.

引用

页码：10608 / 10626

页数：19

共 50 条

[31] Improving Adversarial Robustness via Promoting Ensemble Diversity
Pang, Tianyu
Xu, Kun
Du, Chao
Chen, Ning
Zhu, Jun
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[32] SELF-ENSEMBLE VARIANCE REGULARIZATION FOR DOMAIN ADAPTATION
Liu, Xinyi
Dai, Tao
Xia, Shu-Tao
Jiang, Yong
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3853 - 3857
[33] AcneTyper: An automatic diagnosis method of dermoscopic acne image via self-ensemble and stacking
Liu, Shuai
Chen, Ruili
Gu, Yun
Yu, Qiong
Su, Guoxiong
Ren, Yanjiao
Huang, Lan
Zhou, Fengfeng
TECHNOLOGY AND HEALTH CARE, 2023, 31 (04) : 1171 - 1187
[34] Stochastic Variance Reduced Ensemble Adversarial Attack for Boosting the Adversarial Transferability
Xiong, Yifeng
Lin, Jiadong
Zhang, Min
Hopcroft, John E.
He, Kun
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14963 - 14972
[35] Exploring Non-target Knowledge for Improving Ensemble Universal Adversarial Attacks
Weng, Juanjuan
Luo, Zhiming
Zhong, Zhun
Lin, Dazhen
Li, Shaozi
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2768 - 2775
[36] Improving the Semantic Consistency of Textual Adversarial Attacks via Prompt
Yu, Xiaoyan
Yin, Qilei
Shi, Zhixin
Ma, Yuru
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[37] Improving Adversarial Transferability via Frequency-based Stationary Point Search
Zhu, Zhiyu
Chen, Huaming
Zhang, Jiayu
Wang, Xinyi
Jin, Zhibo
Lu, Qinghua
Shen, Jun
Choo, Kim-Kwang Raymond
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3626 - 3635
[38] Boosting the transferability of adversarial attacks with global momentum initialization
Wang, Jiafeng
Chen, Zhaoyu
Jiang, Kaixun
Yang, Dingkang
Hong, Lingyi
Guo, Pinxue
Guo, Haijing
Zhang, Wenqiang
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
[39] A STUDY ON THE TRANSFERABILITY OF ADVERSARIAL ATTACKS IN SOUND EVENT CLASSIFICATION
Subramanian, Vinod
Pankajakshan, Arjun
Benetos, Emmanouil
Xu, Ning
McDonald, SKoT
Sandler, Mark
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 301 - 305
[40] Enhancing the Transferability of Targeted Attacks with Adversarial Perturbation Transform
Deng, Zhengjie
Xiao, Wen
Li, Xiyan
He, Shuqian
Wang, Yizhen
ELECTRONICS, 2023, 12 (18)

← 1 2 3 4 5 →