Robustness between the worst and average case

被引:0
|
作者
Rice, Leslie [1 ]
Bair, Anna [1 ]
Zhang, Huan [1 ]
Kolter, J. Zico [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Bosch Ctr Artificial Intelligence, Pittsburgh, PA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷
关键词
NORMALIZING CONSTANTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several recent works in machine learning have focused on evaluating the test-time robustness of a classifier: how well the classifier performs not just on the target domain it was trained upon, but upon perturbed examples. In these settings, the focus has largely been on two extremes of robustness: the robustness to perturbations drawn at random from within some distribution (i.e., robustness to random perturbations), and the robustness to the worst case perturbation in some set (i.e., adversarial robustness). In this paper, we argue that a sliding scale between these two extremes provides a valuable additional metric by which to gauge robustness. Specifically, we illustrate that each of these two extremes is naturally characterized by a (functional) q-norm over perturbation space, with q = 1 corresponding to robustness to random perturbations and q = infinity corresponding to adversarial perturbations. We then present the main technical contribution of our paper: a method for efficiently estimating the value of these norms by interpreting them as the partition function of a particular distribution, then using path sampling with MCMC methods to estimate this partition function (either traditional Metropolis-Hastings for non-differentiable perturbations, or Hamiltonian Monte Carlo for differentiable perturbations). We show that our approach provides substantially better estimates than simple random sampling of the actual "intermediate-q" robustness of standard, data-augmented, and adversarially-trained classifiers, illustrating a clear tradeoff between classifiers that optimize different metrics. Code for reproducing experiments can be found at https://github.com/locuslab/intermediate_robustness.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Worst- and average-case privacy breaches in randomization mechanisms
    Boreale, Michele
    Paolini, Michela
    THEORETICAL COMPUTER SCIENCE, 2015, 597 : 40 - 61
  • [42] AVERAGE AND WORST-CASE ANALYSIS OF HEURISTICS FOR THE MAXIMUM TARDINESS PROBLEM
    HALL, NG
    RHEE, WST
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1986, 26 (02) : 272 - 277
  • [43] Worst-Case Optimal Average Consensus Estimators for Robot Swarms
    Elwin, Matthew L.
    Freeman, Randy A.
    Lynch, Kevin M.
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 3814 - 3819
  • [44] Relativized Worlds without Worst-Case to Average-Case Reductions for NP
    Watson, Thomas
    APPROXIMATION, RANDOMIZATION, AND COMBINATORIAL OPTIMIZATION: ALGORITHMS AND TECHNIQUES, 2010, 6302 : 752 - 765
  • [45] Quantum Worst-Case to Average-Case Reductions for All Linear Problems
    Asadi, Vahid R.
    Golovnev, Alexander
    Gur, Tom
    Shinkar, Igor
    Subramanian, Sathyawageeswar
    PROCEEDINGS OF THE 2024 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2024, : 2535 - 2567
  • [46] Truncation in average and worst case settings for special classes of ∞-variate functions
    Kritzer, Peter
    Pillichshammer, Friedrich
    Wasilkowski, G. W.
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2019, 161 : 52 - 65
  • [47] Average and worst-case techniques in convex optimization with stochastic uncertainty
    Calafiore, Giuseppe
    Dabbene, Fabrizio
    2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 6614 - 6619
  • [48] The stretch-length tradeoff in geometric networks: average case and worst case study
    Aldous, David
    Lando, Tamar
    MATHEMATICAL PROCEEDINGS OF THE CAMBRIDGE PHILOSOPHICAL SOCIETY, 2015, 159 (01) : 125 - 151
  • [49] Lattices that Admit Logarithmic Worst-Case to Average-Case Connection Factors
    Peikert, Chris
    Rosen, Alon
    STOC 07: PROCEEDINGS OF THE 39TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, 2007, : 478 - 487
  • [50] Relativized Worlds without Worst-Case to Average-Case Reductions for NP
    Watson, Thomas
    ACM TRANSACTIONS ON COMPUTATION THEORY, 2012, 4 (03) : 1 - 30