Visualizing and Analyzing the Topology of Neuron Activations in Deep Adversarial Training

被引:0
|
作者
Zhou, Youjia [1 ]
Zhou, Yi [1 ]
Ding, Jie [2 ]
Wang, Bei [1 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
[2] Univ Minnesota Twin Cities, Minneapolis, MN USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep models are known to be vulnerable to data adversarial attacks, and many adversarial training techniques have been developed to improve their adversarial robustness. While data adversaries attack model predictions through modifying data, little is known about their impact on the neuron activations produced by the model, which play a crucial role in determining the model's predictions and interpretability. In this work, we aim to develop a topological understanding of adversarial training to enhance its interpretability. We analyze the topological structure-in particular, mapper graphs-of neuron activations of data samples produced by deep adversarial training. Each node of a mapper graph represents a cluster of activations, and two nodes are connected by an edge if their corresponding clusters have a nonempty intersection. We provide an interactive visualization tool that demonstrates the utility of our topological framework in exploring the activation space. We found that stronger attacks make the data samples more indistinguishable in the neuron activation space that leads to a lower accuracy. Our tool also provides a natural way to identify the vulnerable data samples that may be useful in improving model robustness.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
    Gwonsang Ryu
    Daeseon Choi
    Applied Intelligence, 2023, 53 : 9174 - 9187
  • [32] Enhancing adversarial robustness for deep metric learning via neural discrete adversarial training
    Li, Chaofei
    Zhu, Ziyuan
    Niu, Ruicheng
    Zhao, Yuting
    COMPUTERS & SECURITY, 2024, 143
  • [33] A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
    Ryu, Gwonsang
    Choi, Daeseon
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9174 - 9187
  • [34] VARIANCE PRESERVING INITIALIZATION FOR TRAINING DEEP NEUROMORPHIC PHOTONIC NETWORKS WITH SINUSOIDAL ACTIVATIONS
    Passalis, Nikolaos
    Mourgias-Alexandris, George
    Tsakyridis, Apostolos
    Pleros, Nikos
    Tefas, Anastasios
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1483 - 1487
  • [35] An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents
    Such, Felipe Petroski
    Madhavan, Vashisht
    Liu, Rosanne
    Wang, Rui
    Castro, Pablo Samuel
    Li, Yulun
    Zhi, Jiale
    Schubert, Ludwig
    Bellemare, Marc G.
    Clune, Jeff
    Lehman, Joel
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3260 - 3267
  • [36] A New Approach to Descriptors Generation for Image Retrieval by Analyzing Activations of Deep Neural Network Layers
    Staszewski, Pawel
    Jaworski, Maciej
    Cao, Jinde
    Rutkowski, Leszek
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7913 - 7920
  • [37] Interpreting Deep Neural Networks with Relative Sectional Propagation by Analyzing Comparative Gradients and Hostile Activations
    Nam, Woo-Jeoung
    Choi, Jaesik
    Lee, Seong-Whan
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11604 - 11612
  • [38] Analyzing and Accelerating the Bottlenecks of Training Deep SNNs With Backpropagation
    Chen, Ruizhi
    Li, Ling
    NEURAL COMPUTATION, 2020, 32 (12) : 2507 - 2550
  • [39] Distributed Adversarial Training to Robustify Deep Neural Networks at Scale
    Zhang, Gaoyuan
    Lu, Songtao
    Zhang, Yihua
    Chen, Xiangyi
    Chen, Pin-Yu
    Fan, Quanfu
    Martie, Lee
    Horesh, Lior
    Hong, Mingyi
    Liu, Sijia
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2353 - 2363
  • [40] Music Generation System for Adversarial Training Based on Deep Learning
    Min, Jun
    Liu, Zhaoqi
    Wang, Lei
    Li, Dongyang
    Zhang, Maoqing
    Huang, Yantai
    PROCESSES, 2022, 10 (12)