On Generating Transferable Targeted Perturbations

被引:31
|
作者
Naseer, Muzammal [1 ,4 ]
Khan, Salman [4 ]
Hayat, Munawar [2 ]
Khan, Fahad Shahbaz [4 ,5 ]
Porikli, Fatih [3 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
[2] Monash Univ, Melbourne, Vic, Australia
[3] Qualcomm, San Diego, CA USA
[4] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[5] Linkoping Univ, Linkoping, Sweden
关键词
D O I
10.1109/ICCV48922.2021.00761
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While the untargeted black-box transferability of adversarial perturbations has been extensively studied before, changing an unseen model's decisions to a specific 'targeted' class remains a challenging feat. In this paper, we propose a new generative approach for highly transferable targeted perturbations (TTP). We note that the existing methods are less suitable for this task due to their reliance on class-boundary information that changes from one model to another, thus reducing transferability. In contrast, our approach matches the perturbed image 'distribution' with that of the target class, leading to high targeted transferability rates. To this end, we propose a new objective function that not only aligns the global distributions of source and target images, but also matches the local neighbourhood structure between the two domains. Based on the proposed objective, we train a generator function that can adaptively synthesize perturbations specific to a given input. Our generative approach is independent of the source or target domain labels, while consistently performs well against state-of-the-art methods on a wide range of attack settings. As an example, we achieve 32.63% target transferability from (an adversarially weak) VGG19(BN) to (a strong) WideResNet on ImageNet val. set, which is 4x higher than the previous best generative attack and 16x better than instance-specific iterative attack. Code is available at: https://github.com/Muzammal-Naseer/TTP.
引用
收藏
页码:7688 / 7697
页数:10
相关论文
共 50 条
  • [21] DIVERSE GENERATIVE PERTURBATIONS ON ATTENTION SPACE FOR TRANSFERABLE ADVERSARIAL ATTACKS
    Kim, Woo Jae
    Hong, Seunghoon
    Yoon, Sung-Eui
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 281 - 285
  • [22] Generating Transferable Adversarial Examples From the Perspective of Ensemble and Distribution
    Zhang, Huangyi
    Liu, Ximeng
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 173 - 177
  • [23] GENERATING TRANSFERABLE TIGHT-BINDING PARAMETERS - APPLICATION TO SILICON
    GOODWIN, L
    SKINNER, AJ
    PETTIFOR, DG
    EUROPHYSICS LETTERS, 1989, 9 (07): : 701 - 706
  • [24] On Success and Simplicity: A Second Look at Transferable Targeted Attacks
    Zhao, Zhengyu
    Liu, Zhuoran
    Larson, Martha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] Enhancing the Self-Universality for Transferable Targeted Attacks
    Wei, Zhipeng
    Chen, Jingjing
    Wu, Zuxuan
    Jiang, Yu-Gang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12281 - 12290
  • [26] Generating mice with targeted mutations
    Mario R. Capecchi
    Nature Medicine, 2001, 7 : 1086 - 1090
  • [27] Generating curvature perturbations with or without MSSM at directions
    Matsuda, Tomohiro
    JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2007, (06):
  • [28] On the Effectiveness of Perturbations in Generating Evasive Malware Variants
    Jin, Beomjin
    Choi, Jusop
    Hong, Jin B.
    Kim, Hyoungshick
    IEEE ACCESS, 2023, 11 : 31062 - 31074
  • [29] Generating mice with targeted mutations
    Capecchi, MR
    NATURE MEDICINE, 2001, 7 (10) : 1086 - 1090
  • [30] Generating graph perturbations to enhance the generalization of GNNs
    Ennadir, Sofiane
    Nikolentzos, Giannis
    Vazirgiannis, Michalis
    Bostrom, Henrik
    AI OPEN, 2024, 5 : 216 - 223