A Textual Backdoor Defense Method Based on Deep Feature Classification

被引:1
|
作者
Shao, Kun [1 ]
Yang, Junan [1 ]
Hu, Pengjiang [1 ]
Li, Xiaoshuai [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Engn, Hefei 230037, Peoples R China
关键词
deep neural networks; natural language processing; adversarial machine learning; backdoor attacks; backdoor defenses; ATTACKS;
D O I
10.3390/e25020220
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Natural language processing (NLP) models based on deep neural networks (DNNs) are vulnerable to backdoor attacks. Existing backdoor defense methods have limited effectiveness and coverage scenarios. We propose a textual backdoor defense method based on deep feature classification. The method includes deep feature extraction and classifier construction. The method exploits the distinguishability of deep features of poisoned data and benign data. Backdoor defense is implemented in both offline and online scenarios. We conducted defense experiments on two datasets and two models for a variety of backdoor attacks. The experimental results demonstrate the effectiveness of this defense approach and outperform the baseline defense method.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Malware classification method based on feature fusion
    Yan, Hao
    Zhang, Jian
    Tang, Zhangguo
    Long, Hancheng
    Zhu, Min
    Zhang, Tianyue
    Luo, Linglong
    Li, Huanzhou
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2025, 24 (02)
  • [42] A Similarity-based deep Feature extraction method using Divide and Conquer for image classification
    Jang, Seunghui
    Kim, Yanggon
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1132 - 1135
  • [43] Research on data classification and feature fusion method of cancer nuclei image based on deep learning
    Liu, Shanshan
    Hu, Ruo
    Wu, Jianfang
    Zhang, Xizheng
    He, Jun
    Zhao, Huimin
    Wang, Huajia
    Li, Xiangjun
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (03) : 969 - 981
  • [44] Hyperspectral Image Spectral-Spatial Classification Method Based on Deep Adaptive Feature Fusion
    Mu, Caihong
    Liu, Yijin
    Liu, Yi
    REMOTE SENSING, 2021, 13 (04) : 1 - 21
  • [45] Intelligent Hybrid Feature Selection for Textual Sentiment Classification
    Khan, Jawad
    Alam, Aftab
    Lee, Youngmoon
    IEEE ACCESS, 2021, 9 : 140590 - 140608
  • [46] Interpretability-Guided Defense Against Backdoor Attacks to Deep Neural Networks
    Jiang, Wei
    Wen, Xiangyu
    Zhan, Jinyu
    Wang, Xupeng
    Song, Ziwei
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (08) : 2611 - 2624
  • [47] Survey of Backdoor Attack and Defense Algorithms Based on Federated Learning
    Liu, Jialang
    Guo, Yanming
    Lao, Mingrui
    Yu, Tianyuan
    Wu, Yulun
    Feng, Yunhao
    Wu, Jiazhuang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (10): : 2607 - 2626
  • [48] FederatedReverse: A Detection and Defense Method Against Backdoor Attacks in Federated Learning
    Zhao, Chen
    Wen, Yu
    Li, Shuailou
    Liu, Fucheng
    Meng, Dan
    PROCEEDINGS OF THE 2021 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2021, 2021, : 51 - 62
  • [49] Method of Feature Reduction in Short Text Classification Based on Feature Clustering
    Li, Fangfang
    Yin, Yao
    Shi, Jinjing
    Mao, Xingliang
    Shi, Ronghua
    APPLIED SCIENCES-BASEL, 2019, 9 (08):
  • [50] Accurate deep and direction classification model based on the antiprism graph pattern feature generator using underwater acoustic for defense system
    Orhan Yaman
    Turker Tuncer
    Multimedia Tools and Applications, 2023, 82 : 9961 - 9985