Common knowledge learning for generating transferable adversarial examples

被引:0
|
作者
Yang, Ruijie [1 ]
Guo, Yuanfang [1 ]
Wang, Junfu [1 ]
Zhou, Jiantao [2 ,3 ]
Wang, Yunhong [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Lab Intelligent Recognit & Image Proc, Beijing 100191, Peoples R China
[2] Univ Macau, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China
[3] Univ Macau, Dept Comp & Sci, Macau 999078, Peoples R China
基金
中国国家自然科学基金;
关键词
black-box attack; adversarial transferability; deep neural networks;
D O I
10.1007/s11704-024-40533-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on an important type of black-box attacks, i.e., transfer-based adversarial attacks, where the adversary generates adversarial examples using a substitute (source) model and utilizes them to attack an unseen target model, without knowing its information. Existing methods tend to give unsatisfactory adversarial transferability when the source and target models are from different types of DNN architectures (e.g., ResNet-18 and Swin Transformer). In this paper, we observe that the above phenomenon is induced by the output inconsistency problem. To alleviate this problem while effectively utilizing the existing DNN models, we propose a common knowledge learning (CKL) framework to learn better network weights to generate adversarial examples with better transferability, under fixed network architectures. Specifically, to reduce the model-specific features and obtain better output distributions, we construct a multi-teacher framework, where the knowledge is distilled from different teacher architectures into one student network. By considering that the gradient of input is usually utilized to generate adversarial examples, we impose constraints on the gradients between the student and teacher models, to further alleviate the output inconsistency problem and enhance the adversarial transferability. Extensive experiments demonstrate that our proposed work can significantly improve the adversarial transferability.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Generating Transferable Adversarial Examples for Speech Classification
    Kim, Hoki
    Park, Jinseong
    Lee, Jaewook
    PATTERN RECOGNITION, 2023, 137
  • [2] Learning Indistinguishable and Transferable Adversarial Examples
    Zhang, Wu
    Zou, Junhua
    Duan, Yexin
    Zhou, Xingyu
    Pan, Zhisong
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 152 - 164
  • [3] Generating Transferable Adversarial Examples against Vision Transformers
    Wang, Yuxuan
    Wang, Jiakai
    Yin, Zinxin
    Gong, Ruihao
    Wang, Jingyi
    Liu, Aishan
    Liu, Xianglong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5181 - 5190
  • [4] Generating Transferable Adversarial Examples From the Perspective of Ensemble and Distribution
    Zhang, Huangyi
    Liu, Ximeng
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 173 - 177
  • [5] Towards Transferable Adversarial Examples Using Meta Learning
    Fan, Mingyuan
    Yin, Jia-Li
    Liu, Ximeng
    Guo, Wenzhong
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT I, 2022, 13155 : 178 - 192
  • [6] Generating transferable adversarial examples based on perceptually-aligned perturbation
    Chen, Hongqiao
    Lu, Keda
    Wang, Xianmin
    Li, Jin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (11) : 3295 - 3307
  • [7] Learning Transferable Adversarial Examples via Ghost Networks
    Li, Yingwei
    Bai, Song
    Zhou, Yuyin
    Xie, Cihang
    Zhang, Zhishuai
    Yuille, Alan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11458 - 11465
  • [8] Generating transferable adversarial examples based on perceptually-aligned perturbation
    Hongqiao Chen
    Keda Lu
    Xianmin Wang
    Jin Li
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 3295 - 3307
  • [9] A hypothetical defenses-based training framework for generating transferable adversarial examples
    Hao, Lingguang
    Hao, Kuangrong
    Jin, Yaochu
    Zhao, Hongzhi
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [10] Efficient Adversarial Training with Transferable Adversarial Examples
    Zheng, Haizhong
    Zhang, Ziqi
    Gu, Juncheng
    Lee, Honglak
    Prakash, Atul
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1178 - 1187