Searching for explanations of black-box classifiers in the space of semantic queries

被引：0

作者：

Liartis, Jason ^{[1
]}

Dervakos, Edmund ^{[1
]}

Menis-Mastromichalakis, Orfeas ^{[1
]}

Chortaras, Alexandros ^{[1
]}

Stamou, Giorgos ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Artificial Intelligence & Learning Syst Lab, Zografos, Greece

来源：

SEMANTIC WEB | 2024年 / 15卷 / 04期

关键词：

Explainable AI (XAI); opaque machine learning classifiers; knowledge graphs; description logics; semantic query answering; reverse query answering; post-hoc explainability; explanation rules; ISOMORPHISM; EXAMPLES; DATABASE;

D O I：

10.3233/SW-233469Press

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of "explanation rules for a specific class" with "semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class". Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results.

引用

页码：1085 / 1126

页数：42

共 50 条

[21] Explanations of Black-Box Model Predictions by Contextual Importance and Utility
Anjomshoae, Sule
Framling, Kary
Najjar, Amro
EXPLAINABLE, TRANSPARENT AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2019, 2019, 11763 : 95 - 109
[22] Uncertainty-Based Rejection Wrappers for Black-Box Classifiers
Mena, Jose
Pujol, Oriol
Vitria, Jordi
IEEE ACCESS, 2020, 8 : 101721 - 101746
[23] Explaining black-box classifiers using post-hoc explanations-by-example: The effect of explanations and error-rates in XAI user studies
Kenny, Eoin M.
Ford, Courtney
Quinn, Molly
Keane, Mark T.
ARTIFICIAL INTELLIGENCE, 2021, 294
[24] MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model Explanations
Yang, Qing
Zhu, Xia
Fwu, Jong-Kae
Ye, Yun
You, Ganmei
Zhu, Yuan
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1376 - 1383
[25] Iterative and Adaptive Sampling with Spatial Attention for Black-Box Model Explanations
Vasu, Bhavan
Long, Chengjiang
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2949 - 2958
[26] Unrestricted Black-box Adversarial Attack Using GAN with Limited Queries
Na, Dongbin
Ji, Sangwoo
Kim, Jong
arXiv, 2022,
[27] EXPLAN: Explaining Black-box Classifiers using Adaptive Neighborhood Generation
Rasouli, Peyman
Yu, Ingrid Chieh
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[28] A Practical Black-Box Attack on Source Code Authorship Identification Classifiers
Liu, Qianjun
Ji, Shouling
Liu, Changchang
Wu, Chunming
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 3620 - 3633
[29] Simple Black-Box Adversarial Examples Generation with Very Few Queries
Senzaki, Yuya
Ohata, Satsuya
Matsuura, Kanta
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (02) : 212 - 221
[30] THE BLACK-BOX
KYLE, SA
NEW SCIENTIST, 1986, 110 (1512) : 61 - 61

← 1 2 3 4 5 →