A Textual Backdoor Defense Method Based on Deep Feature Classification

被引:1
|
作者
Shao, Kun [1 ]
Yang, Junan [1 ]
Hu, Pengjiang [1 ]
Li, Xiaoshuai [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Engn, Hefei 230037, Peoples R China
关键词
deep neural networks; natural language processing; adversarial machine learning; backdoor attacks; backdoor defenses; ATTACKS;
D O I
10.3390/e25020220
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Natural language processing (NLP) models based on deep neural networks (DNNs) are vulnerable to backdoor attacks. Existing backdoor defense methods have limited effectiveness and coverage scenarios. We propose a textual backdoor defense method based on deep feature classification. The method includes deep feature extraction and classifier construction. The method exploits the distinguishability of deep features of poisoned data and benign data. Backdoor defense is implemented in both offline and online scenarios. We conducted defense experiments on two datasets and two models for a variety of backdoor attacks. The experimental results demonstrate the effectiveness of this defense approach and outperform the baseline defense method.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A defense method against backdoor attacks on neural networks
    Kaviani, Sara
    Shamshiri, Samaneh
    Sohn, Insoo
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [22] Detecting Textual Backdoor Attacks via Class Difference for Text Classification System
    Kwon, Hyun
    Lee, Jun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2025, E108D (02) : 114 - 123
  • [23] Deep feature–based automatic classification of mammograms
    Ridhi Arora
    Prateek Kumar Rai
    Balasubramanian Raman
    Medical & Biological Engineering & Computing, 2020, 58 : 1199 - 1211
  • [24] An Invisible Backdoor Attack Based on Semantic Feature
    Chen, Yangming
    Xu, Xiaowei
    Wang, Xiaodong
    Li, Zewen
    Chen, Wenmin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2025,
  • [25] HSRRS Classification Method Based on Deep Transfer Learning And Multi-Feature Fusion
    Wang, Ziteng
    Li, Zhaojie
    Wang, Yu
    Li, Wenmei
    Yang, Jie
    Ohtsuki, Tomoaki
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [26] Autonomous deep feature extraction based method for epileptic EEG brain seizure classification
    Woodbright, Mitchell
    Verma, Brijesh
    Haidar, Ali
    NEUROCOMPUTING, 2021, 444 (444) : 30 - 37
  • [27] Brain Functional Connection Classification Method Based on Prototype Learning and Deep Feature Fusion
    Liang Y.-Z.
    Ji J.-Z.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (02): : 504 - 514
  • [28] Towards robustness evaluation of backdoor defense on quantized deep learning models
    Zhu, Yifan
    Peng, Huaibing
    Fu, Anmin
    Yang, Wei
    Ma, Hua
    Al-Sarawi, Said F.
    Abbott, Derek
    Gao, Yansong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [29] Federated Learning Backdoor Defense Based on Watermark Integrity
    Hou, Yinjian
    Zhao, Yancheng
    Yao, Kaiqi
    2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 288 - 294
  • [30] Rough set feature selection algorithms for textual case-based classification
    Gupta, Kalyan Moy
    Aha, David W.
    Moore, Philip
    ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2006, 4106 : 166 - 181