Searching for explanations of black-box classifiers in the space of semantic queries

被引：0

作者：

Liartis, Jason ^{[1
]}

Dervakos, Edmund ^{[1
]}

Menis-Mastromichalakis, Orfeas ^{[1
]}

Chortaras, Alexandros ^{[1
]}

Stamou, Giorgos ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Artificial Intelligence & Learning Syst Lab, Zografos, Greece

来源：

SEMANTIC WEB | 2024年 / 15卷 / 04期

关键词：

Explainable AI (XAI); opaque machine learning classifiers; knowledge graphs; description logics; semantic query answering; reverse query answering; post-hoc explainability; explanation rules; ISOMORPHISM; EXAMPLES; DATABASE;

D O I：

10.3233/SW-233469Press

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of "explanation rules for a specific class" with "semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class". Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results.

引用

页码：1085 / 1126

页数：42

共 50 条

[41] DDImage: an image reduction based approach for automatically explaining black-box classifiers
Jiang, Mingyue
Tang, Chengjian
Zhang, Xiao-Yi
Zhao, Yangyang
Ding, Zuohua
EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
[42] An Evolutionary-Based Black-Box Attack to Deep Neural Network Classifiers
Zhou, Yutian
Tan, Yu-an
Zhang, Quanxin
Kuang, Xiaohui
Han, Yahong
Hu, Jingjing
MOBILE NETWORKS & APPLICATIONS, 2021, 26 (04): : 1616 - 1629
[43] Stable and actionable explanations of black-box models through factual and counterfactual rules
Guidotti, Riccardo
Monreale, Anna
Ruggieri, Salvatore
Naretto, Francesca
Turini, Franco
Pedreschi, Dino
Giannotti, Fosca
DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (05) : 2825 - 2862
[44] BLACK-BOX ATTACKS ON IMAGE ACTIVITY PREDICTION AND ITS NATURAL LANGUAGE EXPLANATIONS
Baia, Alina Elena
Poggioni, Valentina
Cavallaro, Andrea
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3688 - 3697
[45] Post-hoc explanation of black-box classifiers using confident itemsets
Moradi, Milad
Samwald, Matthias
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
[46] An Evolutionary-Based Black-Box Attack to Deep Neural Network Classifiers
Yutian Zhou
Yu-an Tan
Quanxin Zhang
Xiaohui Kuang
Yahong Han
Jingjing Hu
Mobile Networks and Applications, 2021, 26 : 1616 - 1629
[47] Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers
Gao, Ji
Lanchantin, Jack
Soffa, Mary Lou
Qi, Yanjun
2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, : 50 - 56
[48] THE MATHEMATICAL WORLD IN THE BLACK-BOX - SIGNIFICANCE OF THE BLACK-BOX AS A MEDIUM OF MATHEMATIZING
MAASS, J
SCHLOGLMANN, W
CYBERNETICS AND SYSTEMS, 1988, 19 (04) : 295 - 309
[49] RobustCheck: A Python']Python package for black-box robustness assessment of image classifiers
Ilie, Andrei
Stefanescu, Alin
SOFTWAREX, 2024, 27
[50] INSIDE THE BLACK-BOX
HORGAN, J
IEEE SPECTRUM, 1986, 23 (11) : 65 - 65

← 1 2 3 4 5 →