Effective Targeted Attacks for Adversarial Self-Supervised Learning

被引:0
|
作者
Kim, Minseon [1 ]
Ha, Hyeonjeong [1 ]
Son, Sooel [1 ]
Hwang, Sung Ju [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Seoul, South Korea
[2] DeepAuto Ai, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, unsupervised adversarial training (AT) has been highlighted as a means of achieving robustness in models without any label information. Previous studies in unsupervised AT have mostly focused on implementing self-supervised learning (SSL) frameworks, which maximize the instance-wise classification loss to generate adversarial examples. However, we observe that simply maximizing the self-supervised training loss with an untargeted adversarial attack often results in generating ineffective adversaries that may not help improve the robustness of the trained model, especially for non-contrastive SSL frameworks without negative examples. To tackle this problem, we propose a novel positive mining for targeted adversarial attack to generate effective adversaries for adversarial SSL frameworks. Specifically, we introduce an algorithm that selects the most confusing yet similar target example for a given instance based on entropy and similarity, and subsequently perturbs the given instance towards the selected target. Our method demonstrates significant enhancements in robustness when applied to non-contrastive SSL frameworks, and less but consistent robustness improvements with contrastive SSL frameworks, on the benchmark datasets.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Self-Supervised Adversarial Variational Learning
    Ye, Fei
    Bors, Adrian. G.
    PATTERN RECOGNITION, 2024, 148
  • [2] Adversarial Masking for Self-Supervised Learning
    Shi, Yuge
    Siddharth, N.
    Torr, Philip H. S.
    Kosiorek, Adam R.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Graph Adversarial Self-Supervised Learning
    Yang, Longqi
    Zhang, Liangliang
    Yang, Wenjing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Adversarial Self-Supervised Contrastive Learning
    Kim, Minseon
    Tack, Jihoon
    Hwang, Sung Ju
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [5] Self-Supervised Adversarial Imitation Learning
    Monteiro, Juarez
    Gavenski, Nathan
    Meneguzzi, Felipe
    Barros, Rodrigo C.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Backdoor Attacks on Self-Supervised Learning
    Saha, Aniruddha
    Tejankar, Ajinkya
    Koohpayegani, Soroush Abbasi
    Pirsiavash, Hamed
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13327 - 13336
  • [7] Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness
    Zhang, Chaoning
    Zhang, Kang
    Zhang, Chenshuang
    Niu, Axi
    Feng, Jiu
    Yoo, Chang D.
    Kweon, In So
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 725 - 742
  • [8] Self-Supervised Effective Resolution Estimation with Adversarial Augmentations
    Kansy, Manuel
    Balletshofer, Julian
    Naruniec, Jacek
    Schroers, Christopher
    Mignone, Graziana
    Gross, Markus
    Weber, Romann M.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 573 - 582
  • [9] Self-Supervised Vessel Segmentation via Adversarial Learning
    Ma, Yuxin
    Hua, Yang
    Deng, Hanming
    Song, Tao
    Wang, Hao
    Xue, Zhengui
    Cao, Heng
    Ma, Ruhui
    Guan, Haibing
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7516 - 7525
  • [10] Self-supervised graph representations with generative adversarial learning
    Sun, Xuecheng
    Wang, Zonghui
    Lu, Zheming
    Lu, Ziqian
    NEUROCOMPUTING, 2024, 592