Backdoor Attacks via Machine Unlearning

被引：0

作者：

Liu, Zihao ^{[1
]}

Wang, Tianhao ^{[2
]}

Huai, Mengdi ^{[1
]}

Miao, Chenglin ^{[1
]}

机构：

[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA

[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13 | 2024年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a new paradigm to erase data from a model and protect user privacy, machine unlearning has drawn significant attention. However, existing studies on machine unlearning mainly focus on its effectiveness and efficiency, neglecting the security challenges introduced by this technique. In this paper, we aim to bridge this gap and study the possibility of conducting malicious attacks leveraging machine unlearning. Specifically, we consider the backdoor attack via machine unlearning, where an attacker seeks to inject a backdoor in the unlearned model by submitting malicious unlearning requests, so that the prediction made by the unlearned model can be changed when a particular trigger presents. In our study, we propose two attack approaches. The first attack approach does not require the attacker to poison any training data of the model. The attacker can achieve the attack goal only by requesting to unlearn a small subset of his contributed training data. The second approach allows the attacker to poison a few training instances with a pre-defined trigger upfront, and then activate the attack via submitting a malicious unlearning request. Both attack approaches are proposed with the goal of maximizing the attack utility while ensuring attack stealthiness. The effectiveness of the proposed attacks is demonstrated with different machine unlearning algorithms as well as different models on different datasets.

引用

页码：14115 / 14123

页数：9

共 50 条

[31] Backdoor Pony: Evaluating backdoor attacks and defenses in different domains
Mercier, Arthur
Smolin, Nikita
Sihlovec, Oliver
Koffas, Stefanos
Picek, Stjepan
SOFTWAREX, 2023, 22
[32] Backdoor smoothing: Demystifying backdoor attacks on deep neural networks
Grosse, Kathrin
Lee, Taesung
Biggio, Battista
Park, Youngja
Backes, Michael
Molloy, Ian
COMPUTERS & SECURITY, 2022, 120
[33] Backdoor smoothing: Demystifying backdoor attacks on deep neural networks
Grosse, Kathrin
Lee, Taesung
Biggio, Battista
Park, Youngja
Backes, Michael
Molloy, Ian
Computers and Security, 2022, 120
[34] Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation
Kim, Hyunjune
Lee, Sangyong
Woo, Simon S.
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21241 - 21248
[35] Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective
Qin, Zhen
Chen, Feiyi
Zhi, Chen
Yan, Xueqiang
Deng, Shuiguang
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14677 - 14685
[36] Label-only membership inference attacks on machine unlearning without dependence of posteriors
Lu, Zhaobo
Liang, Hai
Zhao, Minghao
Lv, Qingzhe
Liang, Tiancai
Wang, Yilei
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 9424 - 9441
[37] Coded Machine Unlearning
Aldaghri, Nasser
Mahdavifar, Hessam
Beirami, Ahmad
IEEE ACCESS, 2021, 9 : 88137 - 88150
[38] Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger
Yu, Yi
Wang, Yufei
Yang, Wenhan
Lu, Shijian
Tan, Yap-Peng
Kot, Alex C.
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12250 - 12259
[39] Toward Stealthy Backdoor Attacks Against Speech Recognition via Elements of Sound
Cai, Hanbo
Zhang, Pengcheng
Dong, Hai
Xiao, Yan
Koffas, Stefanos
Li, Yiming
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5852 - 5866
[40] BadSAM: Exploring Security Vulnerabilities of SAM via Backdoor Attacks (Student Abstract)
Guan, Zihan
Hu, Mengxuan
Zhou, Zhongliang
Zhang, Jielu
Li, Sheng
Liu, Ninghao
THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23506 - 23507

← 1 2 3 4 5 →