Visualizing and Analyzing the Topology of Neuron Activations in Deep Adversarial Training

被引：0

作者：

Zhou, Youjia ^{[1
]}

Zhou, Yi ^{[1
]}

Ding, Jie ^{[2
]}

Wang, Bei ^{[1
]}

机构：

[1] Univ Utah, Salt Lake City, UT 84112 USA

[2] Univ Minnesota Twin Cities, Minneapolis, MN USA

来源：

TOPOLOGICAL, ALGEBRAIC AND GEOMETRIC LEARNING WORKSHOPS 2023, VOL 221 | 2023年 / 221卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep models are known to be vulnerable to data adversarial attacks, and many adversarial training techniques have been developed to improve their adversarial robustness. While data adversaries attack model predictions through modifying data, little is known about their impact on the neuron activations produced by the model, which play a crucial role in determining the model's predictions and interpretability. In this work, we aim to develop a topological understanding of adversarial training to enhance its interpretability. We analyze the topological structure-in particular, mapper graphs-of neuron activations of data samples produced by deep adversarial training. Each node of a mapper graph represents a cluster of activations, and two nodes are connected by an edge if their corresponding clusters have a nonempty intersection. We provide an interactive visualization tool that demonstrates the utility of our topological framework in exploring the activation space. We found that stronger attacks make the data samples more indistinguishable in the neuron activation space that leads to a lower accuracy. Our tool also provides a natural way to identify the vulnerable data samples that may be useful in improving model robustness.

引用

页数：11

共 50 条

[31] A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
Gwonsang Ryu
Daeseon Choi
Applied Intelligence, 2023, 53 : 9174 - 9187
[32] Enhancing adversarial robustness for deep metric learning via neural discrete adversarial training
Li, Chaofei
Zhu, Ziyuan
Niu, Ruicheng
Zhao, Yuting
COMPUTERS & SECURITY, 2024, 143
[33] A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
Ryu, Gwonsang
Choi, Daeseon
APPLIED INTELLIGENCE, 2023, 53 (08) : 9174 - 9187
[34] VARIANCE PRESERVING INITIALIZATION FOR TRAINING DEEP NEUROMORPHIC PHOTONIC NETWORKS WITH SINUSOIDAL ACTIVATIONS
Passalis, Nikolaos
Mourgias-Alexandris, George
Tsakyridis, Apostolos
Pleros, Nikos
Tefas, Anastasios
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1483 - 1487
[35] An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents
Such, Felipe Petroski
Madhavan, Vashisht
Liu, Rosanne
Wang, Rui
Castro, Pablo Samuel
Li, Yulun
Zhi, Jiale
Schubert, Ludwig
Bellemare, Marc G.
Clune, Jeff
Lehman, Joel
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3260 - 3267
[36] A New Approach to Descriptors Generation for Image Retrieval by Analyzing Activations of Deep Neural Network Layers
Staszewski, Pawel
Jaworski, Maciej
Cao, Jinde
Rutkowski, Leszek
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7913 - 7920
[37] Interpreting Deep Neural Networks with Relative Sectional Propagation by Analyzing Comparative Gradients and Hostile Activations
Nam, Woo-Jeoung
Choi, Jaesik
Lee, Seong-Whan
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11604 - 11612
[38] Analyzing and Accelerating the Bottlenecks of Training Deep SNNs With Backpropagation
Chen, Ruizhi
Li, Ling
NEURAL COMPUTATION, 2020, 32 (12) : 2507 - 2550
[39] Distributed Adversarial Training to Robustify Deep Neural Networks at Scale
Zhang, Gaoyuan
Lu, Songtao
Zhang, Yihua
Chen, Xiangyi
Chen, Pin-Yu
Fan, Quanfu
Martie, Lee
Horesh, Lior
Hong, Mingyi
Liu, Sijia
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2353 - 2363
[40] Music Generation System for Adversarial Training Based on Deep Learning
Min, Jun
Liu, Zhaoqi
Wang, Lei
Li, Dongyang
Zhang, Maoqing
Huang, Yantai
PROCESSES, 2022, 10 (12)

← 1 2 3 4 5 →