Common knowledge learning for generating transferable adversarial examples

被引：0

作者：

Yang, Ruijie ^{[1
]}

Guo, Yuanfang ^{[1
]}

Wang, Junfu ^{[1
]}

Zhou, Jiantao ^{[2
,3
]}

Wang, Yunhong ^{[1
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, Lab Intelligent Recognit & Image Proc, Beijing 100191, Peoples R China

[2] Univ Macau, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China

[3] Univ Macau, Dept Comp & Sci, Macau 999078, Peoples R China

来源：

FRONTIERS OF COMPUTER SCIENCE | 2025年 / 19卷 / 10期

基金：

中国国家自然科学基金;

关键词：

black-box attack; adversarial transferability; deep neural networks;

D O I：

10.1007/s11704-024-40533-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper focuses on an important type of black-box attacks, i.e., transfer-based adversarial attacks, where the adversary generates adversarial examples using a substitute (source) model and utilizes them to attack an unseen target model, without knowing its information. Existing methods tend to give unsatisfactory adversarial transferability when the source and target models are from different types of DNN architectures (e.g., ResNet-18 and Swin Transformer). In this paper, we observe that the above phenomenon is induced by the output inconsistency problem. To alleviate this problem while effectively utilizing the existing DNN models, we propose a common knowledge learning (CKL) framework to learn better network weights to generate adversarial examples with better transferability, under fixed network architectures. Specifically, to reduce the model-specific features and obtain better output distributions, we construct a multi-teacher framework, where the knowledge is distilled from different teacher architectures into one student network. By considering that the gradient of input is usually utilized to generate adversarial examples, we impose constraints on the gradients between the student and teacher models, to further alleviate the output inconsistency problem and enhance the adversarial transferability. Extensive experiments demonstrate that our proposed work can significantly improve the adversarial transferability.

引用

页数：14

共 50 条

[1] Generating Transferable Adversarial Examples for Speech Classification
Kim, Hoki
Park, Jinseong
Lee, Jaewook
PATTERN RECOGNITION, 2023, 137
[2] Learning Indistinguishable and Transferable Adversarial Examples
Zhang, Wu
Zou, Junhua
Duan, Yexin
Zhou, Xingyu
Pan, Zhisong
PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 152 - 164
[3] Generating Transferable Adversarial Examples against Vision Transformers
Wang, Yuxuan
Wang, Jiakai
Yin, Zinxin
Gong, Ruihao
Wang, Jingyi
Liu, Aishan
Liu, Xianglong
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5181 - 5190
[4] Generating Transferable Adversarial Examples From the Perspective of Ensemble and Distribution
Zhang, Huangyi
Liu, Ximeng
PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 173 - 177
[5] Towards Transferable Adversarial Examples Using Meta Learning
Fan, Mingyuan
Yin, Jia-Li
Liu, Ximeng
Guo, Wenzhong
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT I, 2022, 13155 : 178 - 192
[6] Generating transferable adversarial examples based on perceptually-aligned perturbation
Chen, Hongqiao
Lu, Keda
Wang, Xianmin
Li, Jin
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (11) : 3295 - 3307
[7] Learning Transferable Adversarial Examples via Ghost Networks
Li, Yingwei
Bai, Song
Zhou, Yuyin
Xie, Cihang
Zhang, Zhishuai
Yuille, Alan
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11458 - 11465
[8] Generating transferable adversarial examples based on perceptually-aligned perturbation
Hongqiao Chen
Keda Lu
Xianmin Wang
Jin Li
International Journal of Machine Learning and Cybernetics, 2021, 12 : 3295 - 3307
[9] A hypothetical defenses-based training framework for generating transferable adversarial examples
Hao, Lingguang
Hao, Kuangrong
Jin, Yaochu
Zhao, Hongzhi
KNOWLEDGE-BASED SYSTEMS, 2024, 305
[10] Efficient Adversarial Training with Transferable Adversarial Examples
Zheng, Haizhong
Zhang, Ziqi
Gu, Juncheng
Lee, Honglak
Prakash, Atul
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1178 - 1187

← 1 2 3 4 5 →