On Generating Transferable Targeted Perturbations

被引:31
|
作者
Naseer, Muzammal [1 ,4 ]
Khan, Salman [4 ]
Hayat, Munawar [2 ]
Khan, Fahad Shahbaz [4 ,5 ]
Porikli, Fatih [3 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
[2] Monash Univ, Melbourne, Vic, Australia
[3] Qualcomm, San Diego, CA USA
[4] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[5] Linkoping Univ, Linkoping, Sweden
关键词
D O I
10.1109/ICCV48922.2021.00761
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While the untargeted black-box transferability of adversarial perturbations has been extensively studied before, changing an unseen model's decisions to a specific 'targeted' class remains a challenging feat. In this paper, we propose a new generative approach for highly transferable targeted perturbations (TTP). We note that the existing methods are less suitable for this task due to their reliance on class-boundary information that changes from one model to another, thus reducing transferability. In contrast, our approach matches the perturbed image 'distribution' with that of the target class, leading to high targeted transferability rates. To this end, we propose a new objective function that not only aligns the global distributions of source and target images, but also matches the local neighbourhood structure between the two domains. Based on the proposed objective, we train a generator function that can adaptively synthesize perturbations specific to a given input. Our generative approach is independent of the source or target domain labels, while consistently performs well against state-of-the-art methods on a wide range of attack settings. As an example, we achieve 32.63% target transferability from (an adversarially weak) VGG19(BN) to (a strong) WideResNet on ImageNet val. set, which is 4x higher than the previous best generative attack and 16x better than instance-specific iterative attack. Code is available at: https://github.com/Muzammal-Naseer/TTP.
引用
收藏
页码:7688 / 7697
页数:10
相关论文
共 50 条
  • [31] Generating Universal Adversarial Perturbations for Quantum Classifiers
    Anil, Gautham
    Vinod, Vishnu
    Narayan, Apurva
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 10891 - 10899
  • [32] Generating transferable adversarial examples based on perceptually-aligned perturbation
    Chen, Hongqiao
    Lu, Keda
    Wang, Xianmin
    Li, Jin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (11) : 3295 - 3307
  • [33] Generating transferable adversarial examples based on perceptually-aligned perturbation
    Hongqiao Chen
    Keda Lu
    Xianmin Wang
    Jin Li
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 3295 - 3307
  • [34] Generalizable Seizure Detection Model Using Generating Transferable Adversarial Features
    Nasiri, Samaneh
    Clifford, Gari D.
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 568 - 572
  • [35] GADIFF: a transferable graph attention diffusion model for generating molecular conformations
    Wang, Donghan
    Dong, Xu
    Zhang, Xueyou
    Hu, Lihong
    BRIEFINGS IN BIOINFORMATICS, 2024, 26 (01)
  • [36] TransRPN: Towards the Transferable Adversarial Perturbations using Region Proposal Networks and Beyond
    Li, Yuezun
    Chang, Ming-Ching
    Sun, Pu
    Qi, Honggang
    Dong, Junyu
    Lyu, Siwei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 213
  • [37] Dynamic loss yielding more transferable targeted adversarial examples
    Zhang, Ming
    Chen, Yongkang
    Li, Hu
    Qian, Cheng
    Kuang, Xiaohui
    NEUROCOMPUTING, 2024, 590
  • [38] Conditions for generating scale-invariant density perturbations
    Gratton, S
    Khoury, J
    Steinhardt, PJ
    Turok, N
    PHYSICAL REVIEW D, 2004, 69 (10):
  • [39] Singular perturbations generating complexification phenomena for elliptic shells
    Bechet, F.
    Sanchez-Palencia, E.
    Millet, O.
    COMPUTATIONAL MECHANICS, 2009, 43 (02) : 207 - 221
  • [40] Generating ekpyrotic curvature perturbations before the big bang
    Lehners, Jean-Luc
    McFadden, Paul
    Turok, Neil
    Steinhardt, Paul J.
    PHYSICAL REVIEW D, 2007, 76 (10):