Knowledge-Driven Backdoor Removal in Deep Neural Networks via Reinforcement Learning

被引:0
|
作者
Song, Jiayin [1 ]
Li, Yike [1 ]
Tian, Yunzhe [1 ]
Wu, Xingyu [1 ]
Li, Qiong [1 ]
Tong, Endong [1 ,2 ]
Niu, Wenjia [1 ]
Zhang, Zhenguo [3 ]
Liu, Jiqiang [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Trans, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Tangshan Res Inst, Tangshan 063000, Peoples R China
[3] Hebei Boshilin Technol Dev Co Ltd, Shijiazhuang, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Backdoor Removal; Reinforcement Learning; Neuron Activate; Backdoor Attack; Deep Learning;
D O I
10.1007/978-981-97-5498-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Backdoor attacks have become a major security threat to deep neural networks (DNNs), promoting significant studies in backdoor removal to mitigate these attacks. However, existing backdoor removal methods often work independently and struggle to generalize across various attacks, which limits their effectiveness when the specific methods used by attackers are unknown. To effectively defend against multiple backdoor attacks, in this paper, we propose the Reinforcement Learning-based Backdoor Removal (RLBR) framework, which integrates multiple defense strategies and dynamically switches various defense methods during the removal process. Driven by the knowledge we observed that a) neuron activation patterns vary significantly under different attacks, and b) these patterns dynamically change during the removal process, we take the neuron activation pattern of the poisoned models as the environment state in the RLBR framework. Besides, we evaluate the defense effectiveness as rewards to guide the selection of optimal defense strategy at each decision point. Through extensive experiments against six state-of-the-art backdoor attacks on two benchmark datasets, RLBR improved defensive performance by 6.91% while maintaining an accuracy of 92.63% on clean datasets, compared to seven baseline backdoor defense methods.
引用
收藏
页码:336 / 348
页数:13
相关论文
共 50 条
  • [1] Knowledge-Driven Interpretation of Convolutional Neural Networks
    Massidda, Riccardo
    Bacciu, Davide
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 13713 : 356 - 371
  • [2] BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning
    Chen, Xuan
    Guo, Wenbo
    Tao, Guanhong
    Zhang, Xiangyu
    Song, Dawn
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Crafting Binary Protocol Reversing via Deep Learning With Knowledge-Driven Augmentation
    Zhao, Sen
    Yang, Shouguo
    Wang, Zhen
    Liu, Yongji
    Zhu, Hongsong
    Sun, Limin
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (06) : 5399 - 5414
  • [4] Deep Learning for Knowledge-Driven Ontology Stream Prediction
    Deng, Shumin
    Pan, Jeff Z.
    Chen, Jiaoyan
    Chen, Huajun
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 52 - 64
  • [5] Knowledge-Driven Service Offloading Decision for Vehicular Edge Computing: A Deep Reinforcement Learning Approach
    Qi, Qi
    Wang, Jingyu
    Ma, Zhanyu
    Sun, Haifeng
    Cao, Yufei
    Zhang, Lingxin
    Liao, Jianxin
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4192 - 4203
  • [6] Backdoor Attacks on Deep Neural Networks via Transfer Learning from Natural Images
    Matsuo, Yuki
    Takemoto, Kazuhiro
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [7] Natural Backdoor Attacks on Deep Neural Networks via Raindrops
    Zhao, Feng
    Zhou, Li
    Zhong, Qi
    Lan, Rushi
    Zhang, Leo Yu
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [8] Backdoor Mitigation in Deep Neural Networks via Strategic Retraining
    Dhonthi, Akshay
    Hahn, Ernst Moritz
    Hashemi, Vahid
    FORMAL METHODS, FM 2023, 2023, 14000 : 635 - 647
  • [9] Knowledge-Driven Active Learning
    Ciravegna, Gabriele
    Precioso, Frederic
    Betti, Alessandro
    Mottin, Kevin
    Gori, Marco
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 38 - 54
  • [10] Learning Bayesian Networks Structures with an Effective Knowledge-driven GA
    Zhang, Weijian
    Fang, Wei
    Sun, Jun
    Chen, Qidong
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,