Generating Semantic Adversarial Examples via Feature Manipulation in Latent Space

被引:0
|
作者
Wang, Shuo [1 ]
Chen, Shangyu [2 ]
Chen, Tianle [3 ]
Nepal, Surya [1 ]
Rudolph, Carsten [2 ]
Grobler, Marthie [1 ]
机构
[1] CSIRO, Data61 & Cybersecur CRC, Marsfield, NSW 2122, Australia
[2] Monash Univ, Fac Informat Technol, Melbourne, Vic 3800, Australia
[3] Univ Queensland, St Lucia, Qld 4072, Australia
关键词
Adversarial examples; feature manipulation; latent representation; neural networks; variational autoencoder (VAE);
D O I
10.1109/TNNLS.2023.3299408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The susceptibility of deep neural networks (DNNs) to adversarial intrusions, exemplified by adversarial examples, is well-documented. Conventional attacks implement unstructured, pixel-wise perturbations to mislead classifiers, which often results in a noticeable departure from natural samples and lacks human-perceptible interpretability. In this work, we present an adversarial attack strategy that implements fine-granularity, semantic-meaning-oriented structural perturbations. Our proposed methodology manipulates the semantic attributes of images through the use of disentangled latent codes. We engineer adversarial perturbations by manipulating either a single latent code or a combination thereof. To this end, we propose two unsupervised semantic manipulation strategies: one based on vector-disentangled representation and the other on feature map-disentangled representation, taking into consideration the complexity of the latent codes and the smoothness of the reconstructed images. Our empirical evaluations, conducted extensively on real-world image data, showcase the potency of our attacks, particularly against black-box classifiers. Furthermore, we establish the existence of a universal semantic adversarial example that is agnostic to specific images.
引用
收藏
页码:17070 / 17084
页数:15
相关论文
共 50 条
  • [1] Generating Adversarial Examples through Latent Space Exploration of Generative Adversarial Networks
    Clare, Luana
    Correia, Joao
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 1760 - 1767
  • [2] Generating traceable adversarial text examples by watermarking in the semantic space
    Li, Mingjie
    Wu, Hanzhou
    Zhang, Xinpeng
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [3] Defending Adversarial Attacks via Semantic Feature Manipulation
    Wang, Shuo
    Nepal, Surya
    Rudolph, Carsten
    Grobler, Marthie
    Chen, Shangyu
    Chen, Tianle
    An, Zike
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (06) : 3184 - 3197
  • [4] Generating Adversarial Attacks in the Latent Space
    Shukla, Nitish
    Banerjee, Sudipta
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 730 - 739
  • [5] Generating Adversarial Examples Against Remote Sensing Scene Classification via Feature Approximation
    Zhu, Rui
    Ma, Shiping
    Lian, Jiawei
    He, Linyuan
    Mei, Shaohui
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 10174 - 10187
  • [6] Traffic Flow Synthesis Using Generative Adversarial Networks via Semantic Latent Codes Manipulation
    Chen, Yuanyuan
    Lv, Yisheng
    Zhu, Fenghua
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1451 - 1456
  • [7] Generating unrestricted adversarial examples via three parameteres
    Hanieh Naderi
    Leili Goli
    Shohreh Kasaei
    Multimedia Tools and Applications, 2022, 81 : 21919 - 21938
  • [8] Generating unrestricted adversarial examples via three parameteres
    Naderi, Hanieh
    Goli, Leili
    Kasaei, Shohreh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (15) : 21919 - 21938
  • [9] Generating adversarial examples via enhancing latent spatial features of benign traffic and preserving malicious functions
    Zhang, Rongqian
    Luo, Senlin
    Pan, Limin
    Hao, Jingwei
    Zhang, Ji
    Neurocomputing, 2022, 490 : 413 - 430
  • [10] Generating adversarial examples via enhancing latent spatial features of benign traffic and preserving malicious functions
    Zhang, Rongqian
    Luo, Senlin
    Pan, Limin
    Hao, Jingwei
    Zhang, Ji
    NEUROCOMPUTING, 2022, 490 : 413 - 430