Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

被引:2
|
作者
Wojcik, Bartosz [1 ]
Morawiecki, Pawel [2 ]
Smieja, Marek [1 ]
Krzyzek, Tomasz [1 ]
Spurek, Przemyslaw [1 ]
Tabor, Jacek [1 ]
机构
[1] Jagiellonian Univ, Fac Math & Comp Sci, Lojasiewicza 6, PL-30348 Krakow, Poland
[2] Polish Acad Sci, Inst Comp Sci, Jana Kazimierza 5, PL-01248 Warsaw, Poland
关键词
adversarial examples; adversarial attack detection; adversarial noise; robustness; neural networks safety; trustworthy machine learning;
D O I
10.1109/ICTAI52525.2021.00209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a mechanism for detecting adversarial examples based on data representations taken from the hidden layers of the target network. Individual autoencoders at intermediate layers of the target network are trained for this purpose. This describes the manifold of true data and, in consequence, can be used to classify whether a given example has the same characteristics as true data. It also gives insight into the behavior of adversarial examples and their flow through the layers of a deep neural network. Experimental results show that our method outperforms the state of the art in supervised and unsupervised settings.
引用
收藏
页码:1322 / 1326
页数:5
相关论文
共 50 条
  • [11] Unsupervised Layer-Wise Score Aggregation for Textual OOD Detection
    Darrin, Maxime
    Staerman, Guillaume
    Gomes, Eduardo Dadalto Camara
    Gomes, Camara
    Cheung, Jackie Ck
    Piantanida, Pablo
    Colombo, Pierre
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17880 - 17888
  • [12] WAYS OF IMPROVING LAYER-WISE CARBONISATION
    SYSKOV, KI
    RAKHANSK.PD
    [J]. COKE & CHEMISTRY USSR, 1970, (07): : 13 - &
  • [13] Defending Against Backdoor Attacks by Layer-wise Feature Analysis
    Jebreel, Najeeb Moharram
    Domingo-Ferrer, Josep
    Li, Yiming
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 428 - 440
  • [14] A layer-wise triangle for analysis of laminated composite plates and shells
    Botello, S
    Oñate, E
    Canet, JM
    [J]. COMPUTERS & STRUCTURES, 1999, 70 (06) : 635 - 646
  • [15] Failure analysis of reinforeced glulam beams by a layer-wise theory
    Davalos, JF
    Kim, YC
    Qiao, PZ
    [J]. 5TH WORLD CONFERENCE ON TIMBER ENGINEERING, VOL 2, PROCEEDINGS, 1998, : 182 - 189
  • [16] Towards Layer-wise Image Vectorization
    Ma, Xu
    Zhou, Yuqian
    Xu, Xingqian
    Sun, Bin
    Filev, Valerii
    Orlov, Nikita
    Fu, Yun
    Shi, Humphrey
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16293 - 16302
  • [17] ANALYSIS OF LAMINATED BEAMS WITH A LAYER-WISE CONSTANT SHEAR THEORY
    DAVALOS, JF
    KIM, YC
    BARBERO, EJ
    [J]. COMPOSITE STRUCTURES, 1994, 28 (03) : 241 - 253
  • [18] Adversarial Optimization-Based Knowledge Transfer of Layer-Wise Dense Flow for Image Classification
    Yeo, Doyeob
    Kim, Min-Suk
    Bae, Ji-Hoon
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [19] ULAN: A Universal Local Adversarial Network for SAR Target Recognition Based on Layer-Wise Relevance Propagation
    Du, Meng
    Bi, Daping
    Du, Mingyang
    Xu, Xinsong
    Wu, Zilong
    [J]. REMOTE SENSING, 2023, 15 (01)
  • [20] Layer-Wise Modeling and Anomaly Detection for Laser-Based Additive Manufacturing
    Seifi, Seyyed Hadi
    Tian, Wenmeng
    Doude, Haley
    Tschopp, Mark A.
    Bian, Linkan
    [J]. JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2019, 141 (08):