Variational Adversarial Defense: A Bayes Perspective for Adversarial Training

被引：2

作者：

Zhao, Chenglong ^{[1
]}

Mei, Shibin ^{[1
]}

Ni, Bingbing ^{[1
]}

Yuan, Shengchao ^{[1
]}

Yu, Zhenbo ^{[1
]}

Wang, Jun ^{[2
]}

机构：

[1] Shanghai Jiaotong Univ Shanghai, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

[2] China Elect Technol Grp Corp, HIK Res Inst, Jiaxing 310052, Zhejiang, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 05期

基金：

美国国家科学基金会;

关键词：

Variational inference; adversarial defense; model robustness;

D O I：

10.1109/TPAMI.2023.3341639

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Various methods have been proposed to defend against adversarial attacks. However, there is a lack of enough theoretical guarantee of the performance, thus leading to two problems: First, deficiency of necessary adversarial training samples might attenuate the normal gradient's back-propagation, which leads to overfitting and gradient masking potentially. Second, point-wise adversarial sampling offers an insufficient support region for adversarial data and thus cannot form a robust decision-boundary. To solve these issues, we provide a theoretical analysis to reveal the relationship between robust accuracy and the complexity of the training set in adversarial training. As a result, we propose a novel training scheme called Variational Adversarial Defense. Based on the distribution of adversarial samples, this novel construction upgrades the defend scheme from local point-wise to distribution-wise, yielding an enlarged support region for safeguarding robust training, thus possessing a higher promising to defense attacks. The proposed method features the following advantages: 1) Instead of seeking adversarial examples point-by-point (in a sequential way), we draw diverse adversarial examples from the inferred distribution; and 2) Augmenting the training set by a larger support region consolidates the smoothness of the decision boundary. Finally, the proposed method is analyzed via the Taylor expansion technique, which casts our solution with natural interpretability.

引用

页码：3047 / 3063

页数：17

共 50 条

[1] Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks
Mescheder, Lars
Nowozin, Sebastian
Geiger, Andreas
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[2] Importance Weighted Adversarial Variational Bayes
Gomez-Sancho, Marta
Hernandez-Lobato, Daniel
[J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 374 - 386
[3] Attack-less adversarial training for a robust adversarial defense
Jiacang Ho
Byung-Gook Lee
Dae-Ki Kang
[J]. Applied Intelligence, 2022, 52 : 4364 - 4381
[4] Attack-less adversarial training for a robust adversarial defense
Ho, Jiacang
Lee, Byung-Gook
Kang, Dae-Ki
[J]. APPLIED INTELLIGENCE, 2022, 52 (04) : 4364 - 4381
[5] Graph Representation Learning via Adversarial Variational Bayes
Li, Yunhe
Hu, Yaochen
Zhang, Yingxue
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3237 - 3241
[6] ADVERSARIAL DEFENSE FOR DEEP SPEAKER RECOGNITION USING HYBRID ADVERSARIAL TRAINING
Pal, Monisankha
Jati, Arindam
Peri, Raghuveer
Hsu, Chin-Cheng
AbdAlmageed, Wael
Narayanan, Shrikanth
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6164 - 6168
[7] Efficient Adversarial Defense without Adversarial Training: A Batch Normalization Approach
Zhu, Yao
Wei, Xiao
Zhu, Yue
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[8] Adversarial Training Defense Based on Second-order Adversarial Examples
Qian Yaguan
Zhang Ximin
Wang Bin
Gu Zhaoquan
Li Wei
Yun Bensheng
[J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (11) : 3367 - 3373
[9] Defense Against Adversarial Attacks Using Topology Aligning Adversarial Training
Kuang, Huafeng
Liu, Hong
Lin, Xianming
Ji, Rongrong
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3659 - 3673
[10] Monge blunts Bayes: Hardness Results for Adversarial Training
Cranko, Zac
Menon, Aditya Krishna
Nock, Richard
Ong, Cheng Soon
Shi, Zhan
Walder, Christian
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97

← 1 2 3 4 5 →