Knowledge-Driven Backdoor Removal in Deep Neural Networks via Reinforcement Learning

被引：0

作者：

Song, Jiayin ^{[1
]}

Li, Yike ^{[1
]}

Tian, Yunzhe ^{[1
]}

Wu, Xingyu ^{[1
]}

Li, Qiong ^{[1
]}

Tong, Endong ^{[1
,2
]}

Niu, Wenjia ^{[1
]}

Zhang, Zhenguo ^{[3
]}

Liu, Jiqiang ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Trans, Beijing 100044, Peoples R China

[2] Beijing Jiaotong Univ, Tangshan Res Inst, Tangshan 063000, Peoples R China

[3] Hebei Boshilin Technol Dev Co Ltd, Shijiazhuang, Hebei, Peoples R China

来源：

KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024 | 2024年 / 14886卷

基金：

中国国家自然科学基金;

关键词：

Backdoor Removal; Reinforcement Learning; Neuron Activate; Backdoor Attack; Deep Learning;

D O I：

10.1007/978-981-97-5498-4_26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Backdoor attacks have become a major security threat to deep neural networks (DNNs), promoting significant studies in backdoor removal to mitigate these attacks. However, existing backdoor removal methods often work independently and struggle to generalize across various attacks, which limits their effectiveness when the specific methods used by attackers are unknown. To effectively defend against multiple backdoor attacks, in this paper, we propose the Reinforcement Learning-based Backdoor Removal (RLBR) framework, which integrates multiple defense strategies and dynamically switches various defense methods during the removal process. Driven by the knowledge we observed that a) neuron activation patterns vary significantly under different attacks, and b) these patterns dynamically change during the removal process, we take the neuron activation pattern of the poisoned models as the environment state in the RLBR framework. Besides, we evaluate the defense effectiveness as rewards to guide the selection of optimal defense strategy at each decision point. Through extensive experiments against six state-of-the-art backdoor attacks on two benchmark datasets, RLBR improved defensive performance by 6.91% while maintaining an accuracy of 92.63% on clean datasets, compared to seven baseline backdoor defense methods.

引用

页码：336 / 348

页数：13

共 50 条

[1] Knowledge-Driven Interpretation of Convolutional Neural Networks
Massidda, Riccardo
Bacciu, Davide
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 13713 : 356 - 371
[2] BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning
Chen, Xuan
Guo, Wenbo
Tao, Guanhong
Zhang, Xiangyu
Song, Dawn
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[3] Crafting Binary Protocol Reversing via Deep Learning With Knowledge-Driven Augmentation
Zhao, Sen
Yang, Shouguo
Wang, Zhen
Liu, Yongji
Zhu, Hongsong
Sun, Limin
IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (06) : 5399 - 5414
[4] Deep Learning for Knowledge-Driven Ontology Stream Prediction
Deng, Shumin
Pan, Jeff Z.
Chen, Jiaoyan
Chen, Huajun
KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 52 - 64
[5] Knowledge-Driven Service Offloading Decision for Vehicular Edge Computing: A Deep Reinforcement Learning Approach
Qi, Qi
Wang, Jingyu
Ma, Zhanyu
Sun, Haifeng
Cao, Yufei
Zhang, Lingxin
Liao, Jianxin
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4192 - 4203
[6] Backdoor Attacks on Deep Neural Networks via Transfer Learning from Natural Images
Matsuo, Yuki
Takemoto, Kazuhiro
APPLIED SCIENCES-BASEL, 2022, 12 (24):
[7] Natural Backdoor Attacks on Deep Neural Networks via Raindrops
Zhao, Feng
Zhou, Li
Zhong, Qi
Lan, Rushi
Zhang, Leo Yu
SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
[8] Backdoor Mitigation in Deep Neural Networks via Strategic Retraining
Dhonthi, Akshay
Hahn, Ernst Moritz
Hashemi, Vahid
FORMAL METHODS, FM 2023, 2023, 14000 : 635 - 647
[9] Knowledge-Driven Active Learning
Ciravegna, Gabriele
Precioso, Frederic
Betti, Alessandro
Mottin, Kevin
Gori, Marco
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 38 - 54
[10] Learning Bayesian Networks Structures with an Effective Knowledge-driven GA
Zhang, Weijian
Fang, Wei
Sun, Jun
Chen, Qidong
2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,

← 1 2 3 4 5 →