Searching for explanations of black-box classifiers in the space of semantic queries

被引：0

作者：

Liartis, Jason ^{[1
]}

Dervakos, Edmund ^{[1
]}

Menis-Mastromichalakis, Orfeas ^{[1
]}

Chortaras, Alexandros ^{[1
]}

Stamou, Giorgos ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Artificial Intelligence & Learning Syst Lab, Zografos, Greece

来源：

SEMANTIC WEB | 2024年 / 15卷 / 04期

关键词：

Explainable AI (XAI); opaque machine learning classifiers; knowledge graphs; description logics; semantic query answering; reverse query answering; post-hoc explainability; explanation rules; ISOMORPHISM; EXAMPLES; DATABASE;

D O I：

10.3233/SW-233469Press

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of "explanation rules for a specific class" with "semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class". Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results.

引用

页码：1085 / 1126

页数：42

共 50 条

[1] Generative causal explanations of black-box classifiers
O'Shaughnessy, Matthew
Canal, Gregory
Connor, Marissa
Davenport, Mark
Rozell, Christopher
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[2] Experimental Study on Generating Multi-modal Explanations of Black-box Classifiers in terms of Gray-box Classifiers
Alonso, Jose M.
Toja-Alamancos, J.
Bugarin, A.
2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
[3] A Generic Framework for Black-box Explanations
Henin, Clement
Le Metayer, Daniel
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3667 - 3676
[4] Learning Groupwise Explanations for Black-Box Models
Gao, Jingyue
Wang, Xiting
Wang, Yasha
Yan, Yulan
Xie, Xing
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2396 - 2402
[5] Active Bayesian Assessment of Black-Box Classifiers
Ji, Disi
Logan, Robert L.
Smyth, Padhraic
Steyvers, Mark
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7935 - 7944
[6] Explaining black-box classifiers: Properties and functions
Amgoud, Leila
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 155 : 40 - 65
[7] Rule-based approximation of black-box classifiers for tabular data to generate global and local explanations
Maszczyk, Cezary
Kozielski, Michal
Sikora, Marek
PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 89 - 92
[8] Feature Importance Explanations for Temporal Black-Box Models
Sood, Akshay
Craven, Mark
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8351 - 8360
[9] Black-box Adversarial Attacks with Limited Queries and Information
Ilyas, Andrew
Engstrom, Logan
Athalye, Anish
Lin, Jessy
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[10] Evading Black-box Classifiers Without Breaking Eggs
Debenedetti, Edoardo
Carlini, Nicholas
Tramer, Florian
IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 408 - 424

← 1 2 3 4 5 →