An Information-Theoretic Explanation for the Adversarial Fragility of AI Classifiers

被引:0
|
作者
Xie, Hui [1 ]
Yi, Jirong [1 ]
Xu, Weiyu [1 ]
Mudumbai, Raghu [1 ]
机构
[1] Univ Iowa, Dept Elect & Comp Engn, Iowa City, IA 52242 USA
关键词
D O I
10.1109/isit.2019.8849757
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a simple hypothesis about a compression property of artificial intelligence (AI) classifiers and present theoretical arguments to show that this hypothesis successfully accounts for the observed fragility of AI classifiers to small adversarial perturbations. We also propose a new method for detecting when small input perturbations cause classifier errors, and show theoretical guarantees for the performance of this detection method. We present experimental results with a voice recognition system to demonstrate this method. The ideas in this paper are motivated by a simple analogy between AI classifiers and the standard Shannon model of a communication system.
引用
收藏
页码:1977 / 1981
页数:5
相关论文
共 50 条
  • [1] ON AN INFORMATION-THEORETIC MODEL OF EXPLANATION
    WOODWARD, J
    PHILOSOPHY OF SCIENCE, 1987, 54 (01) : 21 - 44
  • [2] Selection of classifiers using information-theoretic criteria
    Kang, HJ
    PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 478 - 487
  • [3] An Adversarial Interpretation of Information-Theoretic Bounded Rationality
    Ortega, Pedro A.
    Lee, Daniel D.
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2483 - 2489
  • [4] An information-theoretic perspective of physical adversarial patches
    Tarchoun, Bilel
    Ben Khalifa, Anouar
    Mahjoub, Mohamed Ali
    Abu-Ghazaleh, Nael
    Alouani, Ihsen
    NEURAL NETWORKS, 2024, 179
  • [5] INFORMATION-THEORETIC CRITERIA FOR THE DESIGN OF COMPRESSIVE SUBSPACE CLASSIFIERS
    Nokleby, Matthew
    Rodrigues, Miguel
    Calderbank, Robert
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Transfer Learning for Quantum Classifiers: An Information-Theoretic Generalization Analysis
    Jose, Sharu Theresa
    Simeone, Osvaldo
    2023 IEEE INFORMATION THEORY WORKSHOP, ITW, 2023, : 532 - 537
  • [7] Information-theoretic selection of classifiers for building multiple classifier systems
    Kang, HJ
    Choo, M
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 909 - 918
  • [8] An information-theoretic measure to evaluate data partitions in multiple classifiers
    Dara, RA
    Makrehchi, M
    Kamel, M
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 4826 - 4831
  • [9] Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness
    Zhao, Jiahao
    Mao, Wenji
    Zeng, Daniel Dajun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1237 - 1247
  • [10] Two information-theoretic tools to assess the performance of multi-class classifiers
    Valverde-Albacete, Francisco J.
    Pelaez-Moreno, Carmen
    PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1665 - 1671