Towards transferable adversarial attacks on vision transformers for image classification

被引:1
|
作者
Guo, Xu [1 ]
Chen, Peng [1 ]
Lu, Zhihui [1 ,2 ]
Chai, Hongfeng [1 ,3 ]
Du, Xin [1 ]
Wu, Xudong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Shanghai Blockchain Engn Res Ctr, Shanghai 200433, Peoples R China
[3] Fudan Univ, Inst Financial Technol, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Adversarial example; Transfer attack; Surrogate model; Vision transformer; Fintech regulation; Image classification;
D O I
10.1016/j.sysarc.2024.103155
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The deployment of high-performance Vision Transformer (ViT) models has garnered attention from both industry and academia. However, their vulnerability to adversarial examples highlights security risks for scenarios such as intelligent surveillance, autonomous driving, and fintech regulation. As a black-box attack technique, transfer attacks leverage a surrogate model to generate transferable adversarial examples to attack a target victim model, which mainly focuses on a forward (input diversification) and a backward (gradient modification) approach. However, both approaches are currently implemented straightforwardly and limit the transferability of surrogate models. In this paper, we propose a Forward-Backward Transferable Adversarial Attack framework (FBTA) that can generate highly transferable adversarial examples against different models by fully leveraging ViT's distinctive intermediate layer structures. In the forward inference process of FBTA, we propose a Dropout-based Transferable Attack (DTA) approach to diversify the intermediate states of ViT models, simulating an ensemble learning effect; in the backward process, a Backpropagation Gradient Clipping (BGC) method is designed to refine the gradients within intermediate layers of ViT models intricately. Extensive experiments on state-of-the-art ViTs and robust CNNs demonstrate that our FBTA framework achieves an average performance improvement of 2.79% compared to state-of-the-art transfer-based attacks, offering insights for the comprehension and defense against transfer attacks.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Towards Transferable Adversarial Attacks on Vision Transformers
    Wei, Zhipeng
    Chen, Jingjing
    Goldblum, Micah
    Wu, Zuxuan
    Goldstein, Tom
    Jiang, Yu-Gang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2668 - 2676
  • [2] Towards Transferable Adversarial Attacks on Image and Video Transformers
    Wei, Zhipeng
    Chen, Jingjing
    Goldblum, Micah
    Wu, Zuxuan
    Goldstein, Tom
    Jiang, Yu-Gang
    Davis, Larry S.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6346 - 6358
  • [3] Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization
    Zhang, Jianping
    Huang, Yizhan
    Wu, Weibin
    Lyu, Michael R.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16415 - 16424
  • [4] Towards universal and transferable adversarial attacks against network traffic classification
    Ding, Ruiyang
    Sun, Lei
    Zang, Weifei
    Dai, Leyu
    Ding, Zhiyi
    Xu, Bayi
    Computer Networks, 2024, 254
  • [5] Feature-aware transferable adversarial attacks against image classification
    Cheng, Shuyan
    Li, Peng
    Han, Keji
    Xu, He
    APPLIED SOFT COMPUTING, 2024, 161
  • [6] Generating Transferable Adversarial Examples against Vision Transformers
    Wang, Yuxuan
    Wang, Jiakai
    Yin, Zinxin
    Gong, Ruihao
    Wang, Jingyi
    Liu, Aishan
    Liu, Xianglong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5181 - 5190
  • [7] Towards Transferable Adversarial Attacks with Centralized Perturbation
    Wu, Shangbo
    Tan, Yu-an
    Wang, Yajie
    Ma, Ruinan
    Ma, Wencong
    Li, Yuanzhang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6109 - 6116
  • [8] Transferable Adversarial Attacks for Image and Video Object Detection
    Wei, Xingxing
    Liang, Siyuan
    Chen, Ning
    Cao, Xiaochun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 954 - 960
  • [9] Towards Efficient Adversarial Training on Vision Transformers
    Wu, Boxi
    Gu, Jindong
    Li, Zhifeng
    Cai, Deng
    He, Xiaofei
    Liu, Wei
    COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 307 - 325
  • [10] Transferable adversarial attacks for multi-model systems coupling image fusion with classification models
    Pengcheng Zhu
    Xin Jin
    Qian Jiang
    Xueshuai Gao
    Puming Wang
    Shaowen Yao
    Wei Zhou
    Cybersecurity, 8 (1)