An Information-Theoretic Explanation for the Adversarial Fragility of AI Classifiers

被引:0
|
作者
Xie, Hui [1 ]
Yi, Jirong [1 ]
Xu, Weiyu [1 ]
Mudumbai, Raghu [1 ]
机构
[1] Univ Iowa, Dept Elect & Comp Engn, Iowa City, IA 52242 USA
关键词
D O I
10.1109/isit.2019.8849757
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a simple hypothesis about a compression property of artificial intelligence (AI) classifiers and present theoretical arguments to show that this hypothesis successfully accounts for the observed fragility of AI classifiers to small adversarial perturbations. We also propose a new method for detecting when small input perturbations cause classifier errors, and show theoretical guarantees for the performance of this detection method. We present experimental results with a voice recognition system to demonstrate this method. The ideas in this paper are motivated by a simple analogy between AI classifiers and the standard Shannon model of a communication system.
引用
收藏
页码:1977 / 1981
页数:5
相关论文
共 50 条
  • [41] Information-theoretic software clustering
    Andritsos, P
    Tzerpos, V
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2005, 31 (02) : 150 - 165
  • [42] Information-theoretic image formation
    O'Sullivan, JA
    Blahut, RE
    Snyder, DL
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) : 2094 - 2123
  • [43] Information-theoretic analysis of watermarking
    Moulin, P
    O'Sullivan, JA
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3630 - 3633
  • [44] Information-Theoretic Statistical Linearization
    Chernyshov, K. R.
    IFAC PAPERSONLINE, 2016, 49 (12): : 1797 - 1802
  • [45] INFORMATION-THEORETIC COMPUTATIONAL COMPLEXITY
    CHAITIN, GJ
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1974, 20 (01) : 10 - 15
  • [46] Intrinsic Information-Theoretic Models
    Bernal-Casas, D.
    Oller, J. M.
    ENTROPY, 2024, 26 (05)
  • [47] An information-theoretic view on spacetime
    Saueressig, Frank
    Khosravi, Amir
    MODERN PHYSICS LETTERS A, 2021, 36 (10)
  • [48] Information-theoretic competitive learning
    Kamimura, R
    IASTED: PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, 2003, : 359 - 365
  • [49] An Information-Theoretic Analysis of Deduplication
    Niesen, Urs
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (09) : 5688 - 5704
  • [50] Interference as an information-theoretic game
    Horvat, Sebastian
    Dakic, Borivoje
    QUANTUM, 2021, 5