Defending Against Backdoor Attacks by Layer-wise Feature Analysis (Extended Abstract)

被引:0
|
作者
Jebreel, Najeeb Moharram [1 ]
Domingo-Ferrer, Josep [1 ]
Li, Yiming [2 ]
机构
[1] Univ Rovira Virgili, Tarragona, Spain
[2] Zhejiang Univ, State Key Lab Blockchain & Data Secur, Hangzhou, Zhejiang, Peoples R China
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training deep neural networks (DNNs) usually requires massive training data and computational resources. Users who cannot afford this may prefer to outsource training to a third party or resort to publicly available pre-trained models. Unfortunately, doing so facilitates a new training-time attack (i.e., backdoor attack) against DNNs. This attack aims to induce misclassification of input samples containing adversary-specified trigger patterns. In this paper, we first conduct a layer-wise feature analysis of poisoned and benign samples from the target class. We find out that the feature difference between benign and poisoned samples tends to be maximum at a critical layer, which is not always the one typically used in existing defenses, namely the layer before fully-connected layers. We also demonstrate how to locate this critical layer based on the behaviors of benign samples. We then propose a simple yet effective method to filter poisoned samples by analyzing the feature differences between suspicious and benign samples at the critical layer. Extensive experiments on two benchmark datasets are reported which confirm the effectiveness of our defense.
引用
收藏
页码:8416 / 8420
页数:5
相关论文
共 50 条
  • [31] ANALYSIS OF LAMINATED BEAMS WITH A LAYER-WISE CONSTANT SHEAR THEORY
    DAVALOS, JF
    KIM, YC
    BARBERO, EJ
    COMPOSITE STRUCTURES, 1994, 28 (03) : 241 - 253
  • [32] Defending against attacks tailored to transfer learning via feature distancing
    Ji, Sangwoo
    Park, Namgyu
    Na, Dongbin
    Zhu, Bin
    Kim, Jong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 223
  • [33] Exploring Fine-Grained Feature Analysis for Bird Species Classification using Layer-wise Relevance Propagation
    Arquilla, Kyle
    Gajera, Ishan Dilipbhai
    Darling, Melanie
    Bhati, Deepshikha
    Singh, Aditi
    Guercio, Angela
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0625 - 0631
  • [34] Personalized Federated Learning with Layer-Wise Feature Transformation via Meta-Learning
    Tu, Jingke
    Huang, Jiaming
    Yang, Lei
    Lin, Wanyu
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [35] Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks
    Xi, Zhaohan
    Du, Tianyu
    Li, Changjiang
    Pang, Ren
    Ji, Shouling
    Chen, Jinghui
    Ma, Fenglong
    Wang, Ting
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [36] High-dimensional neural feature design for layer-wise reduction of training cost
    Javid, Alireza M.
    Venkitaraman, Arun
    Skoglund, Mikael
    Chatterjee, Saikat
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)
  • [37] Unsupervised layer-wise feature extraction algorithm for surface electromyography based on information theory
    Li, Mingqiang
    Liu, Ziwen
    Tang, Siqi
    Ge, Jianjun
    Zhang, Feng
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [38] High-dimensional neural feature design for layer-wise reduction of training cost
    Alireza M. Javid
    Arun Venkitaraman
    Mikael Skoglund
    Saikat Chatterjee
    EURASIP Journal on Advances in Signal Processing, 2020
  • [39] Layer-wise analysis for free vibrations of thick composite spherical panels
    Univ of Maryland, Coll Park, United States
    J Compos Mater, 7 (658-671):
  • [40] Layer-wise analysis for free vibration of thick composite cylindrical shells
    Huang, K.H.
    Dasgupta, A.
    Journal of Sound and Vibration, 1995, 186 (02):