Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

被引:2
|
作者
Wojcik, Bartosz [1 ]
Morawiecki, Pawel [2 ]
Smieja, Marek [1 ]
Krzyzek, Tomasz [1 ]
Spurek, Przemyslaw [1 ]
Tabor, Jacek [1 ]
机构
[1] Jagiellonian Univ, Fac Math & Comp Sci, Lojasiewicza 6, PL-30348 Krakow, Poland
[2] Polish Acad Sci, Inst Comp Sci, Jana Kazimierza 5, PL-01248 Warsaw, Poland
关键词
adversarial examples; adversarial attack detection; adversarial noise; robustness; neural networks safety; trustworthy machine learning;
D O I
10.1109/ICTAI52525.2021.00209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a mechanism for detecting adversarial examples based on data representations taken from the hidden layers of the target network. Individual autoencoders at intermediate layers of the target network are trained for this purpose. This describes the manifold of true data and, in consequence, can be used to classify whether a given example has the same characteristics as true data. It also gives insight into the behavior of adversarial examples and their flow through the layers of a deep neural network. Experimental results show that our method outperforms the state of the art in supervised and unsupervised settings.
引用
收藏
页码:1322 / 1326
页数:5
相关论文
共 50 条
  • [1] Evaluation of the Explanatory Power Of Layer-wise Relevance Propagation using Adversarial Examples
    Tamara R. Dieter
    Horst Zisgen
    [J]. Neural Processing Letters, 2023, 55 : 8531 - 8550
  • [2] Evaluation of the Explanatory Power Of Layer-wise Relevance Propagation using Adversarial Examples
    Dieter, Tamara R.
    Zisgen, Horst
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (07) : 8531 - 8550
  • [3] Layer-wise Adversarial Training Approach to Improve Adversarial Robustness
    Chen, Xiaoyi
    Zhang, Ni
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Layer-wise regularized adversarial training using layers sustainability analysis framework
    Khalooei, Mohammad
    Homayounpour, Mohammad Mehdi
    Amirmazlaghani, Maryam
    [J]. NEUROCOMPUTING, 2023, 540
  • [5] Resilience of Bayesian Layer-Wise Explanations under Adversarial Attacks
    Carbone, Ginevra
    Bortolussi, Luca
    Sanguinetti, Guido
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Adversarial attacks on text classification models using layer-wise relevance propagation
    Xu, Jincheng
    Du, Qingfeng
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (09) : 1397 - 1415
  • [7] Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding
    Ma, Haotian
    Zhang, Hao
    Zhou, Fan
    Zhang, Yinqing
    Zhang, Quanshi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [8] Mixed layer-wise models for multilayered plates analysis
    Carrera, E
    [J]. COMPOSITE STRUCTURES, 1998, 43 (01) : 57 - 70
  • [9] FedScrap: Layer-Wise Personalized Federated Learning for Scrap Detection
    Zhang, Weidong
    Deng, Dongshang
    Wang, Lidong
    [J]. ELECTRONICS, 2024, 13 (03)
  • [10] Layer-wise powder deposition defect detection in additive manufacturing
    Hendriks, Attie
    Ramokolo, R.
    Ngobeni, Chris
    Moroko, M.
    Naidoo, Darryl
    [J]. LASER 3D MANUFACTURING VI, 2019, 10909