Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

被引：2

作者：

Wojcik, Bartosz ^{[1
]}

Morawiecki, Pawel ^{[2
]}

Smieja, Marek ^{[1
]}

Krzyzek, Tomasz ^{[1
]}

Spurek, Przemyslaw ^{[1
]}

Tabor, Jacek ^{[1
]}

机构：

[1] Jagiellonian Univ, Fac Math & Comp Sci, Lojasiewicza 6, PL-30348 Krakow, Poland

[2] Polish Acad Sci, Inst Comp Sci, Jana Kazimierza 5, PL-01248 Warsaw, Poland

来源：

2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021) | 2021年

关键词：

adversarial examples; adversarial attack detection; adversarial noise; robustness; neural networks safety; trustworthy machine learning;

D O I：

10.1109/ICTAI52525.2021.00209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a mechanism for detecting adversarial examples based on data representations taken from the hidden layers of the target network. Individual autoencoders at intermediate layers of the target network are trained for this purpose. This describes the manifold of true data and, in consequence, can be used to classify whether a given example has the same characteristics as true data. It also gives insight into the behavior of adversarial examples and their flow through the layers of a deep neural network. Experimental results show that our method outperforms the state of the art in supervised and unsupervised settings.

引用

页码：1322 / 1326

页数：5

共 50 条

[1] Evaluation of the Explanatory Power Of Layer-wise Relevance Propagation using Adversarial Examples
Tamara R. Dieter
Horst Zisgen
[J]. Neural Processing Letters, 2023, 55 : 8531 - 8550
[2] Evaluation of the Explanatory Power Of Layer-wise Relevance Propagation using Adversarial Examples
Dieter, Tamara R.
Zisgen, Horst
[J]. NEURAL PROCESSING LETTERS, 2023, 55 (07) : 8531 - 8550
[3] Layer-wise Adversarial Training Approach to Improve Adversarial Robustness
Chen, Xiaoyi
Zhang, Ni
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[4] Layer-wise regularized adversarial training using layers sustainability analysis framework
Khalooei, Mohammad
Homayounpour, Mohammad Mehdi
Amirmazlaghani, Maryam
[J]. NEUROCOMPUTING, 2023, 540
[5] Resilience of Bayesian Layer-Wise Explanations under Adversarial Attacks
Carbone, Ginevra
Bortolussi, Luca
Sanguinetti, Guido
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[6] Adversarial attacks on text classification models using layer-wise relevance propagation
Xu, Jincheng
Du, Qingfeng
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (09) : 1397 - 1415
[7] Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding
Ma, Haotian
Zhang, Hao
Zhou, Fan
Zhang, Yinqing
Zhang, Quanshi
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[8] Mixed layer-wise models for multilayered plates analysis
Carrera, E
[J]. COMPOSITE STRUCTURES, 1998, 43 (01) : 57 - 70
[9] FedScrap: Layer-Wise Personalized Federated Learning for Scrap Detection
Zhang, Weidong
Deng, Dongshang
Wang, Lidong
[J]. ELECTRONICS, 2024, 13 (03)
[10] Layer-wise powder deposition defect detection in additive manufacturing
Hendriks, Attie
Ramokolo, R.
Ngobeni, Chris
Moroko, M.
Naidoo, Darryl
[J]. LASER 3D MANUFACTURING VI, 2019, 10909

← 1 2 3 4 5 →