Generalization Properties of Adversarial Training for l0 -Bounded Adversarial Attacks

被引:0
|
作者
Delgosha, Payam [1 ]
Hassani, Hamed [2 ]
Pedarsani, Ramtin [3 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Univ Penn, Philadelphia, PA USA
[3] Univ Calif Santa Barbara, Santa Barbara, CA USA
关键词
D O I
10.1109/ITW55543.2023.10161648
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We have widely observed that neural networks are vulnerable to small additive perturbations to the input causing misclassification. In this paper, we focus on the l(0) -bounded adversarial attacks, and aim to theoretically characterize the performance of adversarial training for an important class of truncated classifiers. Such classifiers are shown to have strong performance empirically, as well as theoretically in the Gaussian mixture model, in the l(0)-adversarial setting. The main contribution of this paper is to prove a novel generalization bound for the binary classification setting with l(0)-bounded adversarial perturbation that is distribution-independent. Deriving a generalization bound in this setting has two main challenges: (i) the truncated inner product which is highly non-linear; and (ii) maximization over the l(0) ball due to adversarial training is non-convex and highly non-smooth. To tackle these challenges, we develop new coding techniques for bounding the combinatorial dimension of the truncated hypothesis class.
引用
下载
收藏
页码:113 / 118
页数:6
相关论文
共 50 条
  • [21] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Desheng Wang
    Weidong Jin
    Yunpu Wu
    Aamir Khan
    Applied Intelligence, 2023, 53 : 24492 - 24508
  • [22] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Wang, Desheng
    Jin, Weidong
    Wu, Yunpu
    Khan, Aamir
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
  • [23] Training on Foveated Images Improves Robustness to Adversarial Attacks
    Shah, Muhammad A.
    Kashaf, Aqsa
    Raj, Bhiksha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] PD-Net: Point Dropping Network for Flexible Adversarial Example Generation with L0 Regularization
    Wang, Zhengyi
    Wang, Xupeng
    Sohel, Ferdous
    Liao, Yong
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [25] PatchBreaker: defending against adversarial attacks by cutting-inpainting patches and joint adversarial training
    Huang, Shiyu
    Ye, Feng
    Huang, Zuchao
    Li, Wei
    Huang, Tianqiang
    Huang, Liqing
    APPLIED INTELLIGENCE, 2024, 54 (21) : 10819 - 10832
  • [26] Adversarial Training Against Adversarial Attacks for Machine Learning-Based Intrusion Detection Systems
    Haroon, Muhammad Shahzad
    Ali, Husnain Mansoor
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3513 - 3527
  • [27] Improved OOD Generalization via Adversarial Training and Pre-training
    Yi, Mingyangi
    Hou, Lu
    Sun, Jiacheng
    Shang, Lifeng
    Jiang, Xin
    Liu, Qun
    Ma, Zhi-Ming
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [28] Improving Generalization in Deepfake Detection via Augmentation with Recurrent Adversarial Attacks
    Stanciu, Dan-Cristian
    Ionescu, Bogdan
    PROCEEDINGS OF THE 3RD ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA AI AGAINST DISINFORMATION, MAD 2024, 2024, : 46 - 54
  • [29] Global Wasserstein Margin maximization for boosting generalization in adversarial training
    Tingyue Yu
    Shen Wang
    Xiangzhan Yu
    Applied Intelligence, 2023, 53 : 11490 - 11504
  • [30] On Domain Generalization for Batched Prediction: the Benefit of Contextual Adversarial Training
    Li, Chune
    Mao, Yongyi
    Zhang, Richong
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 577 - 584