An Information-Theoretic Explanation for the Adversarial Fragility of AI Classifiers

被引：0

作者：

Xie, Hui ^{[1
]}

Yi, Jirong ^{[1
]}

Xu, Weiyu ^{[1
]}

Mudumbai, Raghu ^{[1
]}

机构：

[1] Univ Iowa, Dept Elect & Comp Engn, Iowa City, IA 52242 USA

来源：

2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT) | 2019年

关键词：

D O I：

10.1109/isit.2019.8849757

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a simple hypothesis about a compression property of artificial intelligence (AI) classifiers and present theoretical arguments to show that this hypothesis successfully accounts for the observed fragility of AI classifiers to small adversarial perturbations. We also propose a new method for detecting when small input perturbations cause classifier errors, and show theoretical guarantees for the performance of this detection method. We present experimental results with a voice recognition system to demonstrate this method. The ideas in this paper are motivated by a simple analogy between AI classifiers and the standard Shannon model of a communication system.

引用

页码：1977 / 1981

页数：5

共 50 条

[1] ON AN INFORMATION-THEORETIC MODEL OF EXPLANATION
WOODWARD, J
PHILOSOPHY OF SCIENCE, 1987, 54 (01) : 21 - 44
[2] Selection of classifiers using information-theoretic criteria
Kang, HJ
PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 478 - 487
[3] An Adversarial Interpretation of Information-Theoretic Bounded Rationality
Ortega, Pedro A.
Lee, Daniel D.
PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2483 - 2489
[4] An information-theoretic perspective of physical adversarial patches
Tarchoun, Bilel
Ben Khalifa, Anouar
Mahjoub, Mohamed Ali
Abu-Ghazaleh, Nael
Alouani, Ihsen
NEURAL NETWORKS, 2024, 179
[5] INFORMATION-THEORETIC CRITERIA FOR THE DESIGN OF COMPRESSIVE SUBSPACE CLASSIFIERS
Nokleby, Matthew
Rodrigues, Miguel
Calderbank, Robert
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[6] Transfer Learning for Quantum Classifiers: An Information-Theoretic Generalization Analysis
Jose, Sharu Theresa
Simeone, Osvaldo
2023 IEEE INFORMATION THEORY WORKSHOP, ITW, 2023, : 532 - 537
[7] Information-theoretic selection of classifiers for building multiple classifier systems
Kang, HJ
Choo, M
ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 909 - 918
[8] An information-theoretic measure to evaluate data partitions in multiple classifiers
Dara, RA
Makrehchi, M
Kamel, M
2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 4826 - 4831
[9] Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness
Zhao, Jiahao
Mao, Wenji
Zeng, Daniel Dajun
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1237 - 1247
[10] Two information-theoretic tools to assess the performance of multi-class classifiers
Valverde-Albacete, Francisco J.
Pelaez-Moreno, Carmen
PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1665 - 1671

← 1 2 3 4 5 →