Knowledge-Driven Backdoor Removal in Deep Neural Networks via Reinforcement Learning

被引：0

作者：

Song, Jiayin ^{[1
]}

Li, Yike ^{[1
]}

Tian, Yunzhe ^{[1
]}

Wu, Xingyu ^{[1
]}

Li, Qiong ^{[1
]}

Tong, Endong ^{[1
,2
]}

Niu, Wenjia ^{[1
]}

Zhang, Zhenguo ^{[3
]}

Liu, Jiqiang ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Trans, Beijing 100044, Peoples R China

[2] Beijing Jiaotong Univ, Tangshan Res Inst, Tangshan 063000, Peoples R China

[3] Hebei Boshilin Technol Dev Co Ltd, Shijiazhuang, Hebei, Peoples R China

来源：

KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024 | 2024年 / 14886卷

基金：

中国国家自然科学基金;

关键词：

Backdoor Removal; Reinforcement Learning; Neuron Activate; Backdoor Attack; Deep Learning;

D O I：

10.1007/978-981-97-5498-4_26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Backdoor attacks have become a major security threat to deep neural networks (DNNs), promoting significant studies in backdoor removal to mitigate these attacks. However, existing backdoor removal methods often work independently and struggle to generalize across various attacks, which limits their effectiveness when the specific methods used by attackers are unknown. To effectively defend against multiple backdoor attacks, in this paper, we propose the Reinforcement Learning-based Backdoor Removal (RLBR) framework, which integrates multiple defense strategies and dynamically switches various defense methods during the removal process. Driven by the knowledge we observed that a) neuron activation patterns vary significantly under different attacks, and b) these patterns dynamically change during the removal process, we take the neuron activation pattern of the poisoned models as the environment state in the RLBR framework. Besides, we evaluate the defense effectiveness as rewards to guide the selection of optimal defense strategy at each decision point. Through extensive experiments against six state-of-the-art backdoor attacks on two benchmark datasets, RLBR improved defensive performance by 6.91% while maintaining an accuracy of 92.63% on clean datasets, compared to seven baseline backdoor defense methods.

引用

页码：336 / 348

页数：13

共 50 条

[41] Adaptive Backdoor Attack against Deep Neural Networks
He, Honglu
Zhu, Zhiying
Zhang, Xinpeng
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (03): : 2617 - 2633
[42] Knowledge-based recurrent neural networks in reinforcement learning
Le, Tien Dung
Komeda, Takashi
Takagi, Motoki
PROCEDINGS OF THE 11TH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2007, : 169 - 174
[43] Reinforcement Learning and Deep Neural Networks for PI Controller Tuning
Shipman, William J.
Coetzee, Loutjie C.
IFAC PAPERSONLINE, 2019, 52 (14): : 111 - 116
[44] Open-world electrocardiogram classification via domain knowledge-driven contrastive learning
Zhou, Shuang
Huang, Xiao
Liu, Ninghao
Zhang, Wen
Zhang, Yuan-Ting
Chung, Fu-Lai
NEURAL NETWORKS, 2024, 179
[45] Deep Auto-Encoder Neural Networks in Reinforcement Learning
Lange, Sascha
Riedmiller, Martin
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[46] Knowledge-Driven Resource Allocation for Wireless Networks: A WMMSE Unrolled Graph Neural Network Approach
Yang, Hao
Cheng, Nan
Sun, Ruijin
Quan, Wei
Chai, Rong
Aldubaikhy, Khalid
Alqasir, Abdullah
Shen, Xuemin
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 18902 - 18916
[47] Benchmarking knowledge-driven zero-shot learning
Geng, Yuxia
Chen, Jiaoyan
Zhuang, Xiang
Chen, Zhuo
Pan, Jeff Z.
Li, Juan
Yuan, Zonggang
Chen, Huajun
JOURNAL OF WEB SEMANTICS, 2023, 75
[48] Knowledge-Driven Transfer Learning for Tree Species Recognition
Chattoraj, Joyjit
Yang, Feng
Lim, Chi Wan
Gobeawan, Like
Liu, Xuan
Raghavan, Venugopalan S. G.
2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 149 - 154
[49] Static Neural Compiler Optimization via Deep Reinforcement Learning
Mammadli, Rahim
Jannesari, Ali
Wolf, Felix
PROCEEDINGS OF SIXTH WORKSHOP ON THE LLVM COMPILER INFRASTRUCTURE IN HPC AND WORKSHOP ON HIERARCHICAL PARALLELISM FOR EXASCALE COMPUTING (LLVM-HPC2020 AND HIPAR 2020), 2020, : 1 - 11
[50] Knowledge-Driven Meta-Learning for CSI Feedback
Xiao, Han
Tian, Wenqiang
Liu, Wendong
Guo, Jiajia
Zhang, Zhi
Jin, Shi
Shi, Zhihua
Guo, Li
Shen, Jia
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 5694 - 5709

← 1 2 3 4 5 →