Adversarial Robustness Via Fisher-Rao Regularization

被引:5
|
作者
Picot, Marine [1 ,2 ]
Messina, Francisco [1 ,2 ]
Boudiaf, Malik [3 ]
Labeau, Fabrice [1 ,2 ]
Ayed, Ismail Ben [3 ]
Piantanida, Pablo [4 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 0G4, Canada
[2] Univ Paris Saclay CNRS, CentraleSupelec, CNRS, Lab Signaux & Syst L2S, F-91190 Gif Sur Yvette, France
[3] ETS, Montreal, PQ H3C 1K3, Canada
[4] Univ Paris Saclay, Int Lab Learning Syst ILLS, McGill, CNRS,CentraleSupelec,ETS,Mila, Montreal, PQ H3H 2T2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Robustness; Manifolds; Training; Perturbation methods; Standards; Neural networks; Adversarial machine learning; Adversarial regularization; adversarial training; computer vision; deep learning; fisher-rao distance; information geometry; neural networks; safety AI;
D O I
10.1109/TPAMI.2022.3174724
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adversarial robustness has become a topic of growing interest in machine learning since it was observed that neural networks tend to be brittle. We propose an information-geometric formulation of adversarial defense and introduce Fire, a new Fisher-Rao regularization for the categorical cross-entropy loss, which is based on the geodesic distance between the softmax outputs corresponding to natural and perturbed input features. Based on the information-geometric properties of the class of softmax distributions, we derive an explicit characterization of the Fisher-Rao Distance (FRD) for the binary and multiclass cases, and draw some interesting properties as well as connections with standard regularization metrics. Furthermore, we verify on a simple linear and Gaussian model, that all Pareto-optimal points in the accuracy-robustness region can be reached by Fire while other state-of-the-art methods fail. Empirically, we evaluate the performance of various classifiers trained with the proposed loss on standard datasets, showing up to a simultaneous 1% of improvement in terms of clean and robust performances while reducing the training time by 20% over the best-performing methods.
引用
收藏
页码:2698 / 2710
页数:13
相关论文
共 50 条
  • [1] Fisher-Rao geometry of Dirichlet distributions
    Brigant, Alice Le
    Preston, Stephen C.
    Puechmorel, Stephane
    [J]. DIFFERENTIAL GEOMETRY AND ITS APPLICATIONS, 2021, 74
  • [2] The Fisher-Rao Metric in Computer Vision
    Maybank, Steve
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 3 - 3
  • [3] Bayesian sensitivity analysis with the Fisher-Rao metric
    Kurtek, Sebastian
    Bharath, Karthik
    [J]. BIOMETRIKA, 2015, 102 (03) : 601 - 616
  • [4] A Fisher-Rao Metric for Paracatadioptric Images of Lines
    Maybank, Stephen J.
    Ieng, Siohoi
    Benosman, Ryad
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 99 (02) : 147 - 165
  • [5] The Fisher-Rao metric for projective transformations of the line
    Maybank, SJ
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 63 (03) : 191 - 206
  • [6] Application of the Fisher-Rao Metric to Structure Detection
    Stephen J. Maybank
    [J]. Journal of Mathematical Imaging and Vision, 2006, 25 : 49 - 62
  • [7] Approximation to the Fisher-Rao metric for the focus of expansion
    Maybank, S. J.
    [J]. NEUROCOMPUTING, 2008, 71 (10-12) : 2037 - 2045
  • [8] Application of the Fisher-Rao metric to structure detection
    Maybank, Stephen J.
    [J]. JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2006, 25 (01) : 49 - 62
  • [9] The Fisher-Rao Metric for Projective Transformations of the Line
    Stephen J. Maybank
    [J]. International Journal of Computer Vision, 2005, 63 : 191 - 206
  • [10] Application of the Fisher-Rao Metric to Ellipse Detection
    Stephen J. Maybank
    [J]. International Journal of Computer Vision, 2007, 72 : 287 - 307