Backdoor Attacks via Machine Unlearning

被引:0
|
作者
Liu, Zihao [1 ]
Wang, Tianhao [2 ]
Huai, Mengdi [1 ]
Miao, Chenglin [1 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a new paradigm to erase data from a model and protect user privacy, machine unlearning has drawn significant attention. However, existing studies on machine unlearning mainly focus on its effectiveness and efficiency, neglecting the security challenges introduced by this technique. In this paper, we aim to bridge this gap and study the possibility of conducting malicious attacks leveraging machine unlearning. Specifically, we consider the backdoor attack via machine unlearning, where an attacker seeks to inject a backdoor in the unlearned model by submitting malicious unlearning requests, so that the prediction made by the unlearned model can be changed when a particular trigger presents. In our study, we propose two attack approaches. The first attack approach does not require the attacker to poison any training data of the model. The attacker can achieve the attack goal only by requesting to unlearn a small subset of his contributed training data. The second approach allows the attacker to poison a few training instances with a pre-defined trigger upfront, and then activate the attack via submitting a malicious unlearning request. Both attack approaches are proposed with the goal of maximizing the attack utility while ensuring attack stealthiness. The effectiveness of the proposed attacks is demonstrated with different machine unlearning algorithms as well as different models on different datasets.
引用
收藏
页码:14115 / 14123
页数:9
相关论文
共 50 条
  • [31] Backdoor Pony: Evaluating backdoor attacks and defenses in different domains
    Mercier, Arthur
    Smolin, Nikita
    Sihlovec, Oliver
    Koffas, Stefanos
    Picek, Stjepan
    SOFTWAREX, 2023, 22
  • [32] Backdoor smoothing: Demystifying backdoor attacks on deep neural networks
    Grosse, Kathrin
    Lee, Taesung
    Biggio, Battista
    Park, Youngja
    Backes, Michael
    Molloy, Ian
    COMPUTERS & SECURITY, 2022, 120
  • [33] Backdoor smoothing: Demystifying backdoor attacks on deep neural networks
    Grosse, Kathrin
    Lee, Taesung
    Biggio, Battista
    Park, Youngja
    Backes, Michael
    Molloy, Ian
    Computers and Security, 2022, 120
  • [34] Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation
    Kim, Hyunjune
    Lee, Sangyong
    Woo, Simon S.
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21241 - 21248
  • [35] Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective
    Qin, Zhen
    Chen, Feiyi
    Zhi, Chen
    Yan, Xueqiang
    Deng, Shuiguang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14677 - 14685
  • [36] Label-only membership inference attacks on machine unlearning without dependence of posteriors
    Lu, Zhaobo
    Liang, Hai
    Zhao, Minghao
    Lv, Qingzhe
    Liang, Tiancai
    Wang, Yilei
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 9424 - 9441
  • [37] Coded Machine Unlearning
    Aldaghri, Nasser
    Mahdavifar, Hessam
    Beirami, Ahmad
    IEEE ACCESS, 2021, 9 : 88137 - 88150
  • [38] Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger
    Yu, Yi
    Wang, Yufei
    Yang, Wenhan
    Lu, Shijian
    Tan, Yap-Peng
    Kot, Alex C.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12250 - 12259
  • [39] Toward Stealthy Backdoor Attacks Against Speech Recognition via Elements of Sound
    Cai, Hanbo
    Zhang, Pengcheng
    Dong, Hai
    Xiao, Yan
    Koffas, Stefanos
    Li, Yiming
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5852 - 5866
  • [40] BadSAM: Exploring Security Vulnerabilities of SAM via Backdoor Attacks (Student Abstract)
    Guan, Zihan
    Hu, Mengxuan
    Zhou, Zhongliang
    Zhang, Jielu
    Li, Sheng
    Liu, Ninghao
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23506 - 23507