Visualizing and Analyzing the Topology of Neuron Activations in Deep Adversarial Training

被引：0

作者：

Zhou, Youjia ^{[1
]}

Zhou, Yi ^{[1
]}

Ding, Jie ^{[2
]}

Wang, Bei ^{[1
]}

机构：

[1] Univ Utah, Salt Lake City, UT 84112 USA

[2] Univ Minnesota Twin Cities, Minneapolis, MN USA

来源：

TOPOLOGICAL, ALGEBRAIC AND GEOMETRIC LEARNING WORKSHOPS 2023, VOL 221 | 2023年 / 221卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep models are known to be vulnerable to data adversarial attacks, and many adversarial training techniques have been developed to improve their adversarial robustness. While data adversaries attack model predictions through modifying data, little is known about their impact on the neuron activations produced by the model, which play a crucial role in determining the model's predictions and interpretability. In this work, we aim to develop a topological understanding of adversarial training to enhance its interpretability. We analyze the topological structure-in particular, mapper graphs-of neuron activations of data samples produced by deep adversarial training. Each node of a mapper graph represents a cluster of activations, and two nodes are connected by an edge if their corresponding clusters have a nonempty intersection. We provide an interactive visualization tool that demonstrates the utility of our topological framework in exploring the activation space. We found that stronger attacks make the data samples more indistinguishable in the neuron activation space that leads to a lower accuracy. Our tool also provides a natural way to identify the vulnerable data samples that may be useful in improving model robustness.

引用

页数：11

共 50 条

[41] SINGLE IMAGE DEPTH ESTIMATION USING DEEP ADVERSARIAL TRAINING
Hambarde, Praful
Dudhane, Akshay
Murala, Subrahmanyam
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 989 - 993
[42] An active learning framework for adversarial training of deep neural networks
Susmita Ghosh
Abhiroop Chatterjee
Lance Fiondella
Neural Computing and Applications, 2025, 37 (9) : 6849 - 6876
[43] Transferable Adversarial Training: A General Approach to Adapting Deep Classifiers
Liu, Hong
Long, Mingsheng
Wang, Jianmin
Jordan, Michael I.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[44] Regularizing Deep Networks Using Efficient Layerwise Adversarial Training
Sankaranarayanan, Swami
Jain, Arpit
Chellappa, Rama
Lim, Ser Nam
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4008 - 4015
[45] Fast Training of Deep Neural Networks Robust to Adversarial Perturbations
Goodwin, Justin
Brown, Olivia
Helus, Victoria
2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
[46] Training Adversarial Agents to Exploit Weaknesses in Deep Control Policies
Kuutti, Sampo
Fallah, Saber
Bowden, Richard
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 108 - 114
[47] Melanoma detection using adversarial training and deep transfer learning
Zunair, Hasib
Ben Hamza, A.
PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (13):
[48] Disentangling factors of variation in deep representations using adversarial training
Mathieu, Michael
Zhao, Junbo
Sprechmann, Pablo
Ramesh, Aditya
Lecun, Yann
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[49] Analyzing Adversarial Attacks Against Deep Learning for Intrusion Detection in IoT Networks
Ibitoye, Olakunle
Shafiq, Omair
Matrawy, Ashraf
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[50] Adversarial Vulnerability of Deep Learning Models in Analyzing Next Generation Sequencing Data
Meiseles, Amiel
Rosenberg, Ishai
Motro, Yair
Rokach, Lior
Moran-Gilad, Jacob
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 464 - 468

← 1 2 3 4 5 →