Searching for explanations of black-box classifiers in the space of semantic queries

被引:0
|
作者
Liartis, Jason [1 ]
Dervakos, Edmund [1 ]
Menis-Mastromichalakis, Orfeas [1 ]
Chortaras, Alexandros [1 ]
Stamou, Giorgos [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Artificial Intelligence & Learning Syst Lab, Zografos, Greece
关键词
Explainable AI (XAI); opaque machine learning classifiers; knowledge graphs; description logics; semantic query answering; reverse query answering; post-hoc explainability; explanation rules; ISOMORPHISM; EXAMPLES; DATABASE;
D O I
10.3233/SW-233469Press
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of "explanation rules for a specific class" with "semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class". Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results.
引用
收藏
页码:1085 / 1126
页数:42
相关论文
共 50 条
  • [1] Generative causal explanations of black-box classifiers
    O'Shaughnessy, Matthew
    Canal, Gregory
    Connor, Marissa
    Davenport, Mark
    Rozell, Christopher
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Experimental Study on Generating Multi-modal Explanations of Black-box Classifiers in terms of Gray-box Classifiers
    Alonso, Jose M.
    Toja-Alamancos, J.
    Bugarin, A.
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [3] A Generic Framework for Black-box Explanations
    Henin, Clement
    Le Metayer, Daniel
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3667 - 3676
  • [4] Learning Groupwise Explanations for Black-Box Models
    Gao, Jingyue
    Wang, Xiting
    Wang, Yasha
    Yan, Yulan
    Xie, Xing
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2396 - 2402
  • [5] Active Bayesian Assessment of Black-Box Classifiers
    Ji, Disi
    Logan, Robert L.
    Smyth, Padhraic
    Steyvers, Mark
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7935 - 7944
  • [6] Explaining black-box classifiers: Properties and functions
    Amgoud, Leila
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 155 : 40 - 65
  • [7] Rule-based approximation of black-box classifiers for tabular data to generate global and local explanations
    Maszczyk, Cezary
    Kozielski, Michal
    Sikora, Marek
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 89 - 92
  • [8] Feature Importance Explanations for Temporal Black-Box Models
    Sood, Akshay
    Craven, Mark
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8351 - 8360
  • [9] Black-box Adversarial Attacks with Limited Queries and Information
    Ilyas, Andrew
    Engstrom, Logan
    Athalye, Anish
    Lin, Jessy
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [10] Evading Black-box Classifiers Without Breaking Eggs
    Debenedetti, Edoardo
    Carlini, Nicholas
    Tramer, Florian
    IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 408 - 424