Visualizing and Analyzing the Topology of Neuron Activations in Deep Adversarial Training

被引:0
|
作者
Zhou, Youjia [1 ]
Zhou, Yi [1 ]
Ding, Jie [2 ]
Wang, Bei [1 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
[2] Univ Minnesota Twin Cities, Minneapolis, MN USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep models are known to be vulnerable to data adversarial attacks, and many adversarial training techniques have been developed to improve their adversarial robustness. While data adversaries attack model predictions through modifying data, little is known about their impact on the neuron activations produced by the model, which play a crucial role in determining the model's predictions and interpretability. In this work, we aim to develop a topological understanding of adversarial training to enhance its interpretability. We analyze the topological structure-in particular, mapper graphs-of neuron activations of data samples produced by deep adversarial training. Each node of a mapper graph represents a cluster of activations, and two nodes are connected by an edge if their corresponding clusters have a nonempty intersection. We provide an interactive visualization tool that demonstrates the utility of our topological framework in exploring the activation space. We found that stronger attacks make the data samples more indistinguishable in the neuron activation space that leads to a lower accuracy. Our tool also provides a natural way to identify the vulnerable data samples that may be useful in improving model robustness.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] SINGLE IMAGE DEPTH ESTIMATION USING DEEP ADVERSARIAL TRAINING
    Hambarde, Praful
    Dudhane, Akshay
    Murala, Subrahmanyam
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 989 - 993
  • [42] An active learning framework for adversarial training of deep neural networks
    Susmita Ghosh
    Abhiroop Chatterjee
    Lance Fiondella
    Neural Computing and Applications, 2025, 37 (9) : 6849 - 6876
  • [43] Transferable Adversarial Training: A General Approach to Adapting Deep Classifiers
    Liu, Hong
    Long, Mingsheng
    Wang, Jianmin
    Jordan, Michael I.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [44] Regularizing Deep Networks Using Efficient Layerwise Adversarial Training
    Sankaranarayanan, Swami
    Jain, Arpit
    Chellappa, Rama
    Lim, Ser Nam
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4008 - 4015
  • [45] Fast Training of Deep Neural Networks Robust to Adversarial Perturbations
    Goodwin, Justin
    Brown, Olivia
    Helus, Victoria
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [46] Training Adversarial Agents to Exploit Weaknesses in Deep Control Policies
    Kuutti, Sampo
    Fallah, Saber
    Bowden, Richard
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 108 - 114
  • [47] Melanoma detection using adversarial training and deep transfer learning
    Zunair, Hasib
    Ben Hamza, A.
    PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (13):
  • [48] Disentangling factors of variation in deep representations using adversarial training
    Mathieu, Michael
    Zhao, Junbo
    Sprechmann, Pablo
    Ramesh, Aditya
    Lecun, Yann
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [49] Analyzing Adversarial Attacks Against Deep Learning for Intrusion Detection in IoT Networks
    Ibitoye, Olakunle
    Shafiq, Omair
    Matrawy, Ashraf
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [50] Adversarial Vulnerability of Deep Learning Models in Analyzing Next Generation Sequencing Data
    Meiseles, Amiel
    Rosenberg, Ishai
    Motro, Yair
    Rokach, Lior
    Moran-Gilad, Jacob
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 464 - 468