An Information-Theoretic Explanation for the Adversarial Fragility of AI Classifiers

被引:0
|
作者
Xie, Hui [1 ]
Yi, Jirong [1 ]
Xu, Weiyu [1 ]
Mudumbai, Raghu [1 ]
机构
[1] Univ Iowa, Dept Elect & Comp Engn, Iowa City, IA 52242 USA
关键词
D O I
10.1109/isit.2019.8849757
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a simple hypothesis about a compression property of artificial intelligence (AI) classifiers and present theoretical arguments to show that this hypothesis successfully accounts for the observed fragility of AI classifiers to small adversarial perturbations. We also propose a new method for detecting when small input perturbations cause classifier errors, and show theoretical guarantees for the performance of this detection method. We present experimental results with a voice recognition system to demonstrate this method. The ideas in this paper are motivated by a simple analogy between AI classifiers and the standard Shannon model of a communication system.
引用
收藏
页码:1977 / 1981
页数:5
相关论文
共 50 条
  • [31] Distributed information-theoretic clustering
    Pichler, Georg
    Piantanida, Pablo
    Matz, Gerald
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2022, 11 (01) : 137 - 166
  • [32] Information-Theoretic System Identification
    Chernyshov, K. R.
    2017 4TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2017, : 1117 - 1122
  • [33] An information-theoretic perspective on teleconnections
    Greene, Arthur M.
    GEOPHYSICAL RESEARCH LETTERS, 2013, 40 (19) : 5258 - 5262
  • [34] An information-theoretic model for steganography
    Cachin, C
    INFORMATION AND COMPUTATION, 2004, 192 (01) : 41 - 56
  • [35] Information-Theoretic Security with Asymmetries
    Beyne, Tim
    Chen, Yu Long
    ADVANCES IN CRYPTOLOGY - CRYPTO 2024, PT IV, 2024, 14923 : 463 - 494
  • [36] An information-theoretic model for steganography
    Cachin, C
    INFORMATION HIDING, 1998, 1525 : 306 - 318
  • [37] An Information-theoretic Framework for Visualization
    Chen, Min
    Jaenicke, Heike
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1206 - 1215
  • [38] Information-Theoretic Odometry Learning
    Sen Zhang
    Jing Zhang
    Dacheng Tao
    International Journal of Computer Vision, 2022, 130 : 2553 - 2570
  • [39] Demystifying Information-Theoretic Clustering
    Ver Steeg, Greg
    Galstyan, Aram
    Sha, Fei
    DeDeo, Simon
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
  • [40] INFORMATION-THEORETIC BELL INEQUALITIES
    BRAUNSTEIN, SL
    CAVES, CM
    PHYSICAL REVIEW LETTERS, 1988, 61 (06) : 662 - 665