Metrics and methods for robustness evaluation of neural networks with generative models

被引:0
|
作者
Igor Buzhinsky
Arseny Nerinovsky
Stavros Tripakis
机构
[1] ITMO University,Computer Technologies Laboratory
[2] Aalto University,Department of Electrical Engineering and Automation
[3] Northeastern University,undefined
来源
Machine Learning | 2023年 / 112卷
关键词
Reliable machine learning; Adversarial examples; Natural adversarial examples; Generative models;
D O I
暂无
中图分类号
学科分类号
摘要
Recent studies have shown that modern deep neural network classifiers are easy to fool, assuming that an adversary is able to slightly modify their inputs. Many papers have proposed adversarial attacks, defenses and methods to measure robustness to such adversarial perturbations. However, most commonly considered adversarial examples are based on perturbations in the input space of the neural network that are unlikely to arise naturally. Recently, especially in computer vision, researchers discovered “natural” perturbations, such as rotations, changes of brightness, or more high-level changes, but these perturbations have not yet been systematically used to measure the performance of classifiers. In this paper, we propose several metrics to measure robustness of classifiers to natural adversarial examples, and methods to evaluate them. These metrics, called latent space performance metrics, are based on the ability of generative models to capture probability distributions. On four image classification case studies, we evaluate the proposed metrics for several classifiers, including ones trained in conventional and robust ways. We find that the latent counterparts of adversarial robustness are associated with the accuracy of the classifier rather than its conventional adversarial robustness, but the latter is still reflected on the properties of found latent perturbations. In addition, our novel method of finding latent adversarial perturbations demonstrates that these perturbations are often perceptually small.
引用
收藏
页码:3977 / 4012
页数:35
相关论文
共 50 条
  • [21] Bayesian methods for neural networks and related models
    Titterington, DM
    STATISTICAL SCIENCE, 2004, 19 (01) : 128 - 139
  • [22] SPECTRAL METHODS TO STUDY THE ROBUSTNESS OF RESIDUAL NEURAL NETWORKS WITH INFINITE LAYERS
    Trimborn, Torsten
    Gerster, Stephan
    Visconti, Giuseppe
    FOUNDATIONS OF DATA SCIENCE, 2020, 2 (03): : 257 - 278
  • [23] Scenario Generation for Market Risk Models Using Generative Neural Networks
    Flaig, Solveig
    Junike, Gero
    RISKS, 2022, 10 (11)
  • [24] Distribution-Aware Testing of Neural Networks Using Generative Models
    Dola, Swaroopa
    Dwyer, Matthew B.
    Soffa, Mary Lou
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021), 2021, : 226 - 237
  • [25] Towards Standardization of Evaluation Metrics and Methods for Visual Attention Models
    Aziz, Muhammad Zaheer
    Mertsching, Baerbel
    ATTENTION IN COGNITIVE SYSTEMS, 2009, 5395 : 227 - 241
  • [26] On trust models and trust evaluation metrics for ad hoc networks
    Theodorakopoulos, G
    Baras, JS
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2006, 24 (02) : 318 - 328
  • [27] Robustness in biological neural networks
    Kalampokis, A
    Kotsavasiloglou, C
    Argyrakis, P
    Baloyannis, S
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2003, 317 (3-4) : 581 - 590
  • [28] Robustness Verification in Neural Networks
    Wurm, Adrian
    INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, PT II, CPAIOR 2024, 2024, 14743 : 263 - 278
  • [29] Towards improved evaluation of generative neural networks: The Fréchet Coefficient
    Kucharski, Adrian
    Fabijanska, Anna
    NEUROCOMPUTING, 2025, 623
  • [30] Evaluation of correlation methods applying neural networks
    Holzapfel, W
    Sofsky, M
    NEURAL COMPUTING & APPLICATIONS, 2003, 12 (01): : 26 - 32