Searching for explanations of black-box classifiers in the space of semantic queries

被引:0
|
作者
Liartis, Jason [1 ]
Dervakos, Edmund [1 ]
Menis-Mastromichalakis, Orfeas [1 ]
Chortaras, Alexandros [1 ]
Stamou, Giorgos [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Artificial Intelligence & Learning Syst Lab, Zografos, Greece
关键词
Explainable AI (XAI); opaque machine learning classifiers; knowledge graphs; description logics; semantic query answering; reverse query answering; post-hoc explainability; explanation rules; ISOMORPHISM; EXAMPLES; DATABASE;
D O I
10.3233/SW-233469Press
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of "explanation rules for a specific class" with "semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class". Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results.
引用
收藏
页码:1085 / 1126
页数:42
相关论文
共 50 条
  • [21] Explanations of Black-Box Model Predictions by Contextual Importance and Utility
    Anjomshoae, Sule
    Framling, Kary
    Najjar, Amro
    EXPLAINABLE, TRANSPARENT AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2019, 2019, 11763 : 95 - 109
  • [22] Uncertainty-Based Rejection Wrappers for Black-Box Classifiers
    Mena, Jose
    Pujol, Oriol
    Vitria, Jordi
    IEEE ACCESS, 2020, 8 : 101721 - 101746
  • [23] Explaining black-box classifiers using post-hoc explanations-by-example: The effect of explanations and error-rates in XAI user studies
    Kenny, Eoin M.
    Ford, Courtney
    Quinn, Molly
    Keane, Mark T.
    ARTIFICIAL INTELLIGENCE, 2021, 294
  • [24] MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model Explanations
    Yang, Qing
    Zhu, Xia
    Fwu, Jong-Kae
    Ye, Yun
    You, Ganmei
    Zhu, Yuan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1376 - 1383
  • [25] Iterative and Adaptive Sampling with Spatial Attention for Black-Box Model Explanations
    Vasu, Bhavan
    Long, Chengjiang
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2949 - 2958
  • [26] Unrestricted Black-box Adversarial Attack Using GAN with Limited Queries
    Na, Dongbin
    Ji, Sangwoo
    Kim, Jong
    arXiv, 2022,
  • [27] EXPLAN: Explaining Black-box Classifiers using Adaptive Neighborhood Generation
    Rasouli, Peyman
    Yu, Ingrid Chieh
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [28] A Practical Black-Box Attack on Source Code Authorship Identification Classifiers
    Liu, Qianjun
    Ji, Shouling
    Liu, Changchang
    Wu, Chunming
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 3620 - 3633
  • [29] Simple Black-Box Adversarial Examples Generation with Very Few Queries
    Senzaki, Yuya
    Ohata, Satsuya
    Matsuura, Kanta
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (02) : 212 - 221
  • [30] THE BLACK-BOX
    KYLE, SA
    NEW SCIENTIST, 1986, 110 (1512) : 61 - 61