Knowledge-Driven Backdoor Removal in Deep Neural Networks via Reinforcement Learning

被引:0
|
作者
Song, Jiayin [1 ]
Li, Yike [1 ]
Tian, Yunzhe [1 ]
Wu, Xingyu [1 ]
Li, Qiong [1 ]
Tong, Endong [1 ,2 ]
Niu, Wenjia [1 ]
Zhang, Zhenguo [3 ]
Liu, Jiqiang [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Trans, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Tangshan Res Inst, Tangshan 063000, Peoples R China
[3] Hebei Boshilin Technol Dev Co Ltd, Shijiazhuang, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Backdoor Removal; Reinforcement Learning; Neuron Activate; Backdoor Attack; Deep Learning;
D O I
10.1007/978-981-97-5498-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Backdoor attacks have become a major security threat to deep neural networks (DNNs), promoting significant studies in backdoor removal to mitigate these attacks. However, existing backdoor removal methods often work independently and struggle to generalize across various attacks, which limits their effectiveness when the specific methods used by attackers are unknown. To effectively defend against multiple backdoor attacks, in this paper, we propose the Reinforcement Learning-based Backdoor Removal (RLBR) framework, which integrates multiple defense strategies and dynamically switches various defense methods during the removal process. Driven by the knowledge we observed that a) neuron activation patterns vary significantly under different attacks, and b) these patterns dynamically change during the removal process, we take the neuron activation pattern of the poisoned models as the environment state in the RLBR framework. Besides, we evaluate the defense effectiveness as rewards to guide the selection of optimal defense strategy at each decision point. Through extensive experiments against six state-of-the-art backdoor attacks on two benchmark datasets, RLBR improved defensive performance by 6.91% while maintaining an accuracy of 92.63% on clean datasets, compared to seven baseline backdoor defense methods.
引用
收藏
页码:336 / 348
页数:13
相关论文
共 50 条
  • [41] Adaptive Backdoor Attack against Deep Neural Networks
    He, Honglu
    Zhu, Zhiying
    Zhang, Xinpeng
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (03): : 2617 - 2633
  • [42] Knowledge-based recurrent neural networks in reinforcement learning
    Le, Tien Dung
    Komeda, Takashi
    Takagi, Motoki
    PROCEDINGS OF THE 11TH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2007, : 169 - 174
  • [43] Reinforcement Learning and Deep Neural Networks for PI Controller Tuning
    Shipman, William J.
    Coetzee, Loutjie C.
    IFAC PAPERSONLINE, 2019, 52 (14): : 111 - 116
  • [44] Open-world electrocardiogram classification via domain knowledge-driven contrastive learning
    Zhou, Shuang
    Huang, Xiao
    Liu, Ninghao
    Zhang, Wen
    Zhang, Yuan-Ting
    Chung, Fu-Lai
    NEURAL NETWORKS, 2024, 179
  • [45] Deep Auto-Encoder Neural Networks in Reinforcement Learning
    Lange, Sascha
    Riedmiller, Martin
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [46] Knowledge-Driven Resource Allocation for Wireless Networks: A WMMSE Unrolled Graph Neural Network Approach
    Yang, Hao
    Cheng, Nan
    Sun, Ruijin
    Quan, Wei
    Chai, Rong
    Aldubaikhy, Khalid
    Alqasir, Abdullah
    Shen, Xuemin
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 18902 - 18916
  • [47] Benchmarking knowledge-driven zero-shot learning
    Geng, Yuxia
    Chen, Jiaoyan
    Zhuang, Xiang
    Chen, Zhuo
    Pan, Jeff Z.
    Li, Juan
    Yuan, Zonggang
    Chen, Huajun
    JOURNAL OF WEB SEMANTICS, 2023, 75
  • [48] Knowledge-Driven Transfer Learning for Tree Species Recognition
    Chattoraj, Joyjit
    Yang, Feng
    Lim, Chi Wan
    Gobeawan, Like
    Liu, Xuan
    Raghavan, Venugopalan S. G.
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 149 - 154
  • [49] Static Neural Compiler Optimization via Deep Reinforcement Learning
    Mammadli, Rahim
    Jannesari, Ali
    Wolf, Felix
    PROCEEDINGS OF SIXTH WORKSHOP ON THE LLVM COMPILER INFRASTRUCTURE IN HPC AND WORKSHOP ON HIERARCHICAL PARALLELISM FOR EXASCALE COMPUTING (LLVM-HPC2020 AND HIPAR 2020), 2020, : 1 - 11
  • [50] Knowledge-Driven Meta-Learning for CSI Feedback
    Xiao, Han
    Tian, Wenqiang
    Liu, Wendong
    Guo, Jiajia
    Zhang, Zhi
    Jin, Shi
    Shi, Zhihua
    Guo, Li
    Shen, Jia
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 5694 - 5709