Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

被引:0
|
作者
Feng, Ji [1 ,2 ]
Cai, Qi-Zhi [2 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[2] Sinovat Ventures AI Inst, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we consider one challenging training time attack by modifying training data with bounded perturbation, hoping to manipulate the behavior (both targeted or non-targeted) of any corresponding trained classifier during test time when facing clean samples. To achieve this, we proposed to use an auto-encoder-like network to generate such adversarial perturbations on the training data together with one imaginary victim differentiable classifier. The perturbation generator will learn to update its weights so as to produce the most harmful noise, aiming to cause the lowest performance for the victim classifier during test time. This can be formulated into a non-linear equality constrained optimization problem. Unlike GANs, solving such problem is computationally challenging, we then proposed a simple yet effective procedure to decouple the alternating updates for the two networks for stability. By teaching the perturbation generator to hijacking the training trajectory of the victim classifier, the generator can thus learn to move against the victim classifier step by step. The method proposed in this paper can be easily extended to the label specific setting where the attacker can manipulate the predictions of the victim classifier according to some predefined rules rather than only making wrong predictions. Experiments on various datasets including CIFAR-10 and a reduced version of ImageNet confirmed the effectiveness of the proposed method and empirical results showed that, such bounded perturbations have good transferability across different types of victim classifiers.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Generating adversarial samples by manipulating image features with auto-encoder
    Jianxin Yang
    Mingwen Shao
    Huan Liu
    Xinkai Zhuang
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 2499 - 2509
  • [2] Generating adversarial samples by manipulating image features with auto-encoder
    Yang, Jianxin
    Shao, Mingwen
    Liu, Huan
    Zhuang, Xinkai
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2499 - 2509
  • [3] Auto-encoder generative adversarial networks
    Zhai, Zhonghua
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (03) : 3043 - 3049
  • [4] Data expansion method and application of couple adversarial auto-encoder
    Xu, Xiaowei
    Ao, Jinyan
    Liu, Guanghua
    Wang, Yawei
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (12): : 29 - 36
  • [5] Improving Gradient-based Adversarial Training for Text Classification by Contrastive Learning and Auto-Encoder
    Qiu, Yao
    Zhang, Jinchao
    Zhou, Jie
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1698 - 1707
  • [6] Data Fused Motor Fault Identification Based on Adversarial Auto-Encoder
    Wang, Botao
    Shen, Chuanwen
    Yu, Chenxi
    Yang, Yutao
    [J]. 2019 IEEE 10TH INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS FOR DISTRIBUTED GENERATION SYSTEMS (PEDG 2019), 2019, : 299 - 305
  • [7] Multi-Modality Adversarial Auto-Encoder for Zero-Shot Learning
    Ji, Zhong
    Dai, Guangwen
    Yu, Yunlong
    [J]. IEEE ACCESS, 2020, 8 : 9287 - 9295
  • [8] Coupled generative adversarial stacked Auto-encoder: CoGASA
    Kiasari, Mohammad Ahangar
    Moirangthem, Dennis Singh
    Lee, Minho
    [J]. NEURAL NETWORKS, 2018, 100 : 1 - 9
  • [9] Adversarial auto-encoder for rating prediction with ratings and reviews
    Yi, Jin
    Huang, Jiajin
    Qin, Jin
    [J]. WEB INTELLIGENCE, 2020, 18 (04) : 285 - 294
  • [10] Adversarial auto-encoder for unsupervised deep domain adaptation
    Shao, Rui
    Lan, Xiangyuan
    [J]. IET IMAGE PROCESSING, 2019, 13 (14) : 2772 - 2777