Between-Class Adversarial Training for Improving Adversarial Robustness of Image Classification

被引:0
|
作者
Wang, Desheng [1 ]
Jin, Weidong [1 ,2 ]
Wu, Yunpu [3 ]
机构
[1] Southwest Jiaotong Univ, Sch Elect Engn, Chengdu 611756, Peoples R China
[2] Nanning Univ, China ASEAN Int Joint Lab Integrated Transportat, Nanning 541699, Peoples R China
[3] Xihua Univ, Sch Elect Engn & Elect Informat, Chengdu 610039, Peoples R China
基金
中国国家自然科学基金;
关键词
adversarial training; between-class learning; robustness; regularization;
D O I
10.3390/s23063252
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Deep neural networks (DNNs) have been known to be vulnerable to adversarial attacks. Adversarial training (AT) is, so far, the only method that can guarantee the robustness of DNNs to adversarial attacks. However, the robustness generalization accuracy gain of AT is still far lower than the standard generalization accuracy of an undefended model, and there is known to be a trade-off between the standard generalization accuracy and the robustness generalization accuracy of an adversarially trained model. In order to improve the robustness generalization and the standard generalization performance trade-off of AT, we propose a novel defense algorithm called Between-Class Adversarial Training (BCAT) that combines Between-Class learning (BC-learning) with standard AT. Specifically, BCAT mixes two adversarial examples from different classes and uses the mixed between-class adversarial examples to train a model instead of original adversarial examples during AT. We further propose BCAT+ which adopts a more powerful mixing method. BCAT and BCAT+ impose effective regularization on the feature distribution of adversarial examples to enlarge between-class distance, thus improving the robustness generalization and the standard generalization performance of AT. The proposed algorithms do not introduce any hyperparameters into standard AT; therefore, the process of hyperparameters searching can be avoided. We evaluate the proposed algorithms under both white-box attacks and black-box attacks using a spectrum of perturbation values on CIFAR-10, CIFAR-100, and SVHN datasets. The research findings indicate that our algorithms achieve better global robustness generalization performance than the state-of-the-art adversarial defense methods.
引用
下载
收藏
页数:23
相关论文
共 50 条
  • [1] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Desheng Wang
    Weidong Jin
    Yunpu Wu
    Aamir Khan
    Applied Intelligence, 2023, 53 : 24492 - 24508
  • [2] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Wang, Desheng
    Jin, Weidong
    Wu, Yunpu
    Khan, Aamir
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
  • [3] Sliced Wasserstein adversarial training for improving adversarial robustness
    Lee W.
    Lee S.
    Kim H.
    Lee J.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (08) : 3229 - 3242
  • [4] Benchmarking Adversarial Robustness on Image Classification
    Dong, Yinpeng
    Fu, Qi-An
    Yang, Xiao
    Pang, Tianyu
    Su, Hang
    Xiao, Zihao
    Zhu, Jun
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 318 - 328
  • [5] Improving Adversarial Robustness With Adversarial Augmentations
    Chen, Chuanxi
    Ye, Dengpan
    He, Yiheng
    Tang, Long
    Xu, Yue
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) : 5105 - 5117
  • [6] A Review of Adversarial Robustness Evaluation for Image Classification
    Li, Zituo
    Sun, Jianbin
    Yang, Kewei
    Xiong, Dehui
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (10): : 2164 - 2189
  • [7] ENHANCING ADVERSARIAL ROBUSTNESS FOR IMAGE CLASSIFICATION BY REGULARIZING CLASS LEVEL FEATURE DISTRIBUTION
    Yu, Cheng
    Xue, Youze
    Chen, Jiansheng
    Wang, Yu
    Ma, Huimin
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 494 - 498
  • [8] Between-class Learning for Image Classification
    Tokozume, Yuji
    Ushiku, Yoshitaka
    Harada, Tatsuya
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5486 - 5494
  • [9] Recent Advances in Adversarial Training for Adversarial Robustness
    Bai, Tao
    Luo, Jinqi
    Zhao, Jun
    Wen, Bihan
    Wang, Qian
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4312 - 4321
  • [10] Weighted Adaptive Perturbations Adversarial Training for Improving Robustness
    Wang, Yan
    Zhang, Dongmei
    Zhang, Haiyang
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 402 - 415