Robustness between the worst and average case

被引:0
|
作者
Rice, Leslie [1 ]
Bair, Anna [1 ]
Zhang, Huan [1 ]
Kolter, J. Zico [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Bosch Ctr Artificial Intelligence, Pittsburgh, PA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷
关键词
NORMALIZING CONSTANTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several recent works in machine learning have focused on evaluating the test-time robustness of a classifier: how well the classifier performs not just on the target domain it was trained upon, but upon perturbed examples. In these settings, the focus has largely been on two extremes of robustness: the robustness to perturbations drawn at random from within some distribution (i.e., robustness to random perturbations), and the robustness to the worst case perturbation in some set (i.e., adversarial robustness). In this paper, we argue that a sliding scale between these two extremes provides a valuable additional metric by which to gauge robustness. Specifically, we illustrate that each of these two extremes is naturally characterized by a (functional) q-norm over perturbation space, with q = 1 corresponding to robustness to random perturbations and q = infinity corresponding to adversarial perturbations. We then present the main technical contribution of our paper: a method for efficiently estimating the value of these norms by interpreting them as the partition function of a particular distribution, then using path sampling with MCMC methods to estimate this partition function (either traditional Metropolis-Hastings for non-differentiable perturbations, or Hamiltonian Monte Carlo for differentiable perturbations). We show that our approach provides substantially better estimates than simple random sampling of the actual "intermediate-q" robustness of standard, data-augmented, and adversarially-trained classifiers, illustrating a clear tradeoff between classifiers that optimize different metrics. Code for reproducing experiments can be found at https://github.com/locuslab/intermediate_robustness.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Minimizing worst-case and average-case makespan over scenarios
    Esteban Feuerstein
    Alberto Marchetti-Spaccamela
    Frans Schalekamp
    René Sitters
    Suzanne van der Ster
    Leen Stougie
    Anke van Zuylen
    Journal of Scheduling, 2017, 20 : 545 - 555
  • [32] Worst-Case to Average-Case Reductions via Additive Combinatorics
    Asadi, Vahid R.
    Golovnev, Alexander
    Gur, Tom
    Shinkar, Igor
    PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 1566 - 1574
  • [33] Average-case intractability vs. worst-case intractability
    Köbler, J
    Schuler, R
    INFORMATION AND COMPUTATION, 2004, 190 (01) : 1 - 17
  • [34] Average-case intractability vs. worst-case intractability
    Köbler, J
    Schuler, R
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 1998, 1998, 1450 : 493 - 502
  • [35] Worst-case to average-case reductions based on Gaussian measures
    Micciancio, D
    Regev, O
    45TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2004, : 372 - 381
  • [36] Bidirectional PageRank Estimation: From Average-Case to Worst-Case
    Lofgren, Peter
    Banerjee, Siddhartha
    Goel, Ashish
    ALGORITHMS AND MODELS FOR THE WEB GRAPH, (WAW 2015), 2015, 9479 : 164 - 176
  • [37] Minimizing worst-case and average-case makespan over scenarios
    Feuerstein, Esteban
    Marchetti-Spaccamela, Alberto
    Schalekamp, Frans
    Sitters, Rene
    van der Ster, Suzanne
    Stougie, Leen
    van Zuylen, Anke
    JOURNAL OF SCHEDULING, 2017, 20 (06) : 545 - 555
  • [38] Worst-case versus average case complexity of ray-shooting
    L. Szirmay-Kalos
    G. Márton
    Computing, 1998, 61 : 103 - 131
  • [39] Worst-case to average-case reductions based on Gaussian measures
    Micciancio, Daniele
    Regev, Oded
    SIAM JOURNAL ON COMPUTING, 2007, 37 (01) : 267 - 302
  • [40] Worst-case versus average case complexity of ray-shooting
    Szirmay-Kalos, L
    Marton, G
    COMPUTING, 1998, 61 (02) : 103 - 131