Variational Adversarial Defense: A Bayes Perspective for Adversarial Training

被引:2
|
作者
Zhao, Chenglong [1 ]
Mei, Shibin [1 ]
Ni, Bingbing [1 ]
Yuan, Shengchao [1 ]
Yu, Zhenbo [1 ]
Wang, Jun [2 ]
机构
[1] Shanghai Jiaotong Univ Shanghai, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] China Elect Technol Grp Corp, HIK Res Inst, Jiaxing 310052, Zhejiang, Peoples R China
基金
美国国家科学基金会;
关键词
Variational inference; adversarial defense; model robustness;
D O I
10.1109/TPAMI.2023.3341639
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various methods have been proposed to defend against adversarial attacks. However, there is a lack of enough theoretical guarantee of the performance, thus leading to two problems: First, deficiency of necessary adversarial training samples might attenuate the normal gradient's back-propagation, which leads to overfitting and gradient masking potentially. Second, point-wise adversarial sampling offers an insufficient support region for adversarial data and thus cannot form a robust decision-boundary. To solve these issues, we provide a theoretical analysis to reveal the relationship between robust accuracy and the complexity of the training set in adversarial training. As a result, we propose a novel training scheme called Variational Adversarial Defense. Based on the distribution of adversarial samples, this novel construction upgrades the defend scheme from local point-wise to distribution-wise, yielding an enlarged support region for safeguarding robust training, thus possessing a higher promising to defense attacks. The proposed method features the following advantages: 1) Instead of seeking adversarial examples point-by-point (in a sequential way), we draw diverse adversarial examples from the inferred distribution; and 2) Augmenting the training set by a larger support region consolidates the smoothness of the decision boundary. Finally, the proposed method is analyzed via the Taylor expansion technique, which casts our solution with natural interpretability.
引用
收藏
页码:3047 / 3063
页数:17
相关论文
共 50 条
  • [1] Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks
    Mescheder, Lars
    Nowozin, Sebastian
    Geiger, Andreas
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] Importance Weighted Adversarial Variational Bayes
    Gomez-Sancho, Marta
    Hernandez-Lobato, Daniel
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 374 - 386
  • [3] Attack-less adversarial training for a robust adversarial defense
    Jiacang Ho
    Byung-Gook Lee
    Dae-Ki Kang
    [J]. Applied Intelligence, 2022, 52 : 4364 - 4381
  • [4] Attack-less adversarial training for a robust adversarial defense
    Ho, Jiacang
    Lee, Byung-Gook
    Kang, Dae-Ki
    [J]. APPLIED INTELLIGENCE, 2022, 52 (04) : 4364 - 4381
  • [5] Graph Representation Learning via Adversarial Variational Bayes
    Li, Yunhe
    Hu, Yaochen
    Zhang, Yingxue
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3237 - 3241
  • [6] ADVERSARIAL DEFENSE FOR DEEP SPEAKER RECOGNITION USING HYBRID ADVERSARIAL TRAINING
    Pal, Monisankha
    Jati, Arindam
    Peri, Raghuveer
    Hsu, Chin-Cheng
    AbdAlmageed, Wael
    Narayanan, Shrikanth
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6164 - 6168
  • [7] Efficient Adversarial Defense without Adversarial Training: A Batch Normalization Approach
    Zhu, Yao
    Wei, Xiao
    Zhu, Yue
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Adversarial Training Defense Based on Second-order Adversarial Examples
    Qian Yaguan
    Zhang Ximin
    Wang Bin
    Gu Zhaoquan
    Li Wei
    Yun Bensheng
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (11) : 3367 - 3373
  • [9] Defense Against Adversarial Attacks Using Topology Aligning Adversarial Training
    Kuang, Huafeng
    Liu, Hong
    Lin, Xianming
    Ji, Rongrong
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3659 - 3673
  • [10] Monge blunts Bayes: Hardness Results for Adversarial Training
    Cranko, Zac
    Menon, Aditya Krishna
    Nock, Richard
    Ong, Cheng Soon
    Shi, Zhan
    Walder, Christian
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97