Robustness between the worst and average case

被引:0
|
作者
Rice, Leslie [1 ]
Bair, Anna [1 ]
Zhang, Huan [1 ]
Kolter, J. Zico [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Bosch Ctr Artificial Intelligence, Pittsburgh, PA USA
关键词
NORMALIZING CONSTANTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several recent works in machine learning have focused on evaluating the test-time robustness of a classifier: how well the classifier performs not just on the target domain it was trained upon, but upon perturbed examples. In these settings, the focus has largely been on two extremes of robustness: the robustness to perturbations drawn at random from within some distribution (i.e., robustness to random perturbations), and the robustness to the worst case perturbation in some set (i.e., adversarial robustness). In this paper, we argue that a sliding scale between these two extremes provides a valuable additional metric by which to gauge robustness. Specifically, we illustrate that each of these two extremes is naturally characterized by a (functional) q-norm over perturbation space, with q = 1 corresponding to robustness to random perturbations and q = infinity corresponding to adversarial perturbations. We then present the main technical contribution of our paper: a method for efficiently estimating the value of these norms by interpreting them as the partition function of a particular distribution, then using path sampling with MCMC methods to estimate this partition function (either traditional Metropolis-Hastings for non-differentiable perturbations, or Hamiltonian Monte Carlo for differentiable perturbations). We show that our approach provides substantially better estimates than simple random sampling of the actual "intermediate-q" robustness of standard, data-augmented, and adversarially-trained classifiers, illustrating a clear tradeoff between classifiers that optimize different metrics. Code for reproducing experiments can be found at https://github.com/locuslab/intermediate_robustness.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] On Proper Learnability between Average- and Worst-case Robustness
    Raman, Vinod
    Subedi, Unique
    Tewari, Ambuj
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Relations between average-case and worst-case complexity
    Pavan, A
    Vinodchandran, NV
    FUNDAMENTALS OF COMPUTATIONAL THEORY, PROCEEDINGS, 2005, 3623 : 422 - 432
  • [3] Relations between average-case and worst-case complexity
    Pavan, A.
    Vinodchandran, N. V.
    THEORY OF COMPUTING SYSTEMS, 2008, 42 (04) : 596 - 607
  • [4] Relations between Average-Case and Worst-Case Complexity
    A. Pavan
    N. V. Vinodchandran
    Theory of Computing Systems, 2008, 42 : 596 - 607
  • [5] Codes for Adversaries: Between Worst-Case and Average-Case Jamming
    Dey, Bikash Kumar
    Jaggi, Sidharth
    Langb, Michael
    Sarwate, Anand D.
    Zhang, Yihan
    FOUNDATIONS AND TRENDS IN COMMUNICATIONS AND INFORMATION THEORY, 2024, 21 (3-4): : 300 - 588
  • [6] Worst Case to Average Case Reductions for Polynomials
    Kaufman, Tali
    Lovett, Shachar
    PROCEEDINGS OF THE 49TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 2008, : 166 - +
  • [7] Finding the Trade-off between Robustness and Worst-case Quality
    Branke, Juergen
    Lu, Ke
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 623 - 630
  • [8] ROBUSTNESS OF DISCRETE DECISIONS TO WORST CASE DISTRIBUTION
    SCHENKERMAN, S
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1978, 29 (02) : 159 - 165
  • [9] Abstract interpretation for worst and average case analysis
    Di Pierro, Alessandra
    Hankin, Chris
    Wiklicky, Herbert
    PROGRAM ANALYSIS AND COMPILATION, THEORY AND PRACTICE: ESSAYS DEDICATED TO REINHARD WILHELM ON THE OCCASION OF HIS 60TH BIRTHDAY, 2007, 4444 : 160 - +
  • [10] Worst-case to average-case reductions revisited
    Gutfreund, Dan
    Ta-Shma, Amnon
    APPROXIMATION, RANDOMIZATION, AND COMBINATORIAL OPTIMIZATION: ALGORITHMS AND TECHNIQUES, 2007, 4627 : 569 - +