Defending Against Backdoor Attacks by Layer-wise Feature Analysis (Extended Abstract)

被引:0
|
作者
Jebreel, Najeeb Moharram [1 ]
Domingo-Ferrer, Josep [1 ]
Li, Yiming [2 ]
机构
[1] Univ Rovira Virgili, Tarragona, Spain
[2] Zhejiang Univ, State Key Lab Blockchain & Data Secur, Hangzhou, Zhejiang, Peoples R China
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training deep neural networks (DNNs) usually requires massive training data and computational resources. Users who cannot afford this may prefer to outsource training to a third party or resort to publicly available pre-trained models. Unfortunately, doing so facilitates a new training-time attack (i.e., backdoor attack) against DNNs. This attack aims to induce misclassification of input samples containing adversary-specified trigger patterns. In this paper, we first conduct a layer-wise feature analysis of poisoned and benign samples from the target class. We find out that the feature difference between benign and poisoned samples tends to be maximum at a critical layer, which is not always the one typically used in existing defenses, namely the layer before fully-connected layers. We also demonstrate how to locate this critical layer based on the behaviors of benign samples. We then propose a simple yet effective method to filter poisoned samples by analyzing the feature differences between suspicious and benign samples at the critical layer. Extensive experiments on two benchmark datasets are reported which confirm the effectiveness of our defense.
引用
收藏
页码:8416 / 8420
页数:5
相关论文
共 50 条
  • [41] A layer-wise analysis of Mandarin and English suprasegmentals in SSL speech models
    de la Fuente, Anton
    Jurafsky, Dan
    INTERSPEECH 2024, 2024, : 1290 - 1294
  • [42] Defending Federated Learning from Backdoor Attacks: Anomaly-Aware FedAVG with Layer-Based Aggregation
    Manzoor, Habib Ullah
    Khan, Ahsan Raza
    Sher, Tahir
    Ahmad, Wasim
    Zoha, Ahmed
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [43] Layer-wise mixed models for accurate vibrations analysis of multilayered plates
    Carrera, E
    JOURNAL OF APPLIED MECHANICS-TRANSACTIONS OF THE ASME, 1998, 65 (04): : 820 - 828
  • [44] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
    Zhou, Yefan
    Pang, Tianyu
    Liu, Keqin
    Martin, Charles H.
    Mahoney, Michael W.
    Yang, Yaoqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [45] Layer-wise mixed models for accurate vibrations analysis of multilayered plate
    Department of Aeronautics and Aerospace Engineering, Politecnico di Torino, Corso Duca degli Abruzzi, 24, Torino, 10129, Italy
    J Appl Mech Trans ASME, 4 (820-828):
  • [46] LAYER-WISE ANALYSIS OF A SELF-SUPERVISED SPEECH REPRESENTATION MODEL
    Pasad, Ankita
    Chou, Ju-Chieh
    Livescu, Karen
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 914 - 921
  • [47] ANALYSIS OF PIEZOELECTRICALLY ACTUATED BEAMS USING A LAYER-WISE DISPLACEMENT THEORY
    ROBBINS, DH
    REDDY, JN
    COMPUTERS & STRUCTURES, 1991, 41 (02) : 265 - 279
  • [48] A layer-wise analysis for free vibrations of thick composite spherical panels
    Dasgupta, A
    Huang, KH
    JOURNAL OF COMPOSITE MATERIALS, 1997, 31 (07) : 658 - 671
  • [49] Efficient layer-wise feature incremental approach for content-based image retrieval system
    Chauhan, Sachendra Singh
    Batra, Shalini
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
  • [50] Feature Selection Based on Layer-Wise Relevance Propagation for EEG-based MI classification
    Nam, Hyeonyeong
    Kim, Jun-Mo
    Kam, Tae-Eui
    2023 11TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI, 2023,