A Textual Backdoor Defense Method Based on Deep Feature Classification

被引:1
|
作者
Shao, Kun [1 ]
Yang, Junan [1 ]
Hu, Pengjiang [1 ]
Li, Xiaoshuai [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Engn, Hefei 230037, Peoples R China
关键词
deep neural networks; natural language processing; adversarial machine learning; backdoor attacks; backdoor defenses; ATTACKS;
D O I
10.3390/e25020220
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Natural language processing (NLP) models based on deep neural networks (DNNs) are vulnerable to backdoor attacks. Existing backdoor defense methods have limited effectiveness and coverage scenarios. We propose a textual backdoor defense method based on deep feature classification. The method includes deep feature extraction and classifier construction. The method exploits the distinguishability of deep features of poisoned data and benign data. Backdoor defense is implemented in both offline and online scenarios. We conducted defense experiments on two datasets and two models for a variety of backdoor attacks. The experimental results demonstrate the effectiveness of this defense approach and outperform the baseline defense method.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] A lightweight backdoor defense framework based on image inpainting
    Wei, Yier
    Gao, Haichang
    Wang, Yufei
    Gao, Yipeng
    Liu, Huan
    NEUROCOMPUTING, 2023, 537 : 22 - 36
  • [32] DB-COVIDNet: A Defense Method against Backdoor Attacks
    Shamshiri, Samaneh
    Han, Ki Jin
    Sohn, Insoo
    MATHEMATICS, 2023, 11 (20)
  • [33] Feature Attention Distillation Defense for Backdoor Attack in Artificial-Neural-Network-Based Electricity Theft Detection
    Li, Shizhong
    Meng, Wenchao
    Liu, Chen
    He, Shibo
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 39880 - 39889
  • [34] Backdoor Attacks on Image Classification Models in Deep Neural Networks
    Zhang, Quanxin
    Ma, Wencong
    Wang, Yajie
    Zhang, Yaoyuan
    Shi, Zhiwei
    Li, Yuanzhang
    CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (02) : 199 - 212
  • [35] A deep feature manifold embedding method for hyperspectral image classification
    Liu, Jiamin
    Yang, Song
    Huang, Hong
    Li, Zhengying
    Shi, Guangyao
    REMOTE SENSING LETTERS, 2020, 11 (07) : 620 - 629
  • [36] Deep feature-based automatic classification of mammograms
    Arora, Ridhi
    Rai, Prateek Kumar
    Raman, Balasubramanian
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (06) : 1199 - 1211
  • [37] IMAGE CLASSIFICATION BASED ON DEEP LOCAL FEATURE CODING
    Wang, Qian
    Zhu, Jianqing
    Shao, Wei
    Wang, Lei
    Zhu, Xiaobin
    2017 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2017), 2017, : 480 - 485
  • [38] A deep feature based framework for breast masses classification
    Jiao, Zhicheng
    Gao, Xinbo
    Wang, Ying
    Li, Jie
    NEUROCOMPUTING, 2016, 197 : 221 - 231
  • [39] Backdoor Attacks on Image Classification Models in Deep Neural Networks
    ZHANG Quanxin
    MA Wencong
    WANG Yajie
    ZHANG Yaoyuan
    SHI Zhiwei
    LI Yuanzhang
    Chinese Journal of Electronics, 2022, 31 (02) : 199 - 212
  • [40] Diffense: Defense Against Backdoor Attacks on Deep Neural Networks With Latent Diffusion
    Hu, Bowen
    Chang, Chip-Hong
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2024, 14 (04) : 729 - 742