Searching for explanations of black-box classifiers in the space of semantic queries

被引:0
|
作者
Liartis, Jason [1 ]
Dervakos, Edmund [1 ]
Menis-Mastromichalakis, Orfeas [1 ]
Chortaras, Alexandros [1 ]
Stamou, Giorgos [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Artificial Intelligence & Learning Syst Lab, Zografos, Greece
关键词
Explainable AI (XAI); opaque machine learning classifiers; knowledge graphs; description logics; semantic query answering; reverse query answering; post-hoc explainability; explanation rules; ISOMORPHISM; EXAMPLES; DATABASE;
D O I
10.3233/SW-233469Press
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of "explanation rules for a specific class" with "semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class". Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results.
引用
收藏
页码:1085 / 1126
页数:42
相关论文
共 50 条
  • [41] DDImage: an image reduction based approach for automatically explaining black-box classifiers
    Jiang, Mingyue
    Tang, Chengjian
    Zhang, Xiao-Yi
    Zhao, Yangyang
    Ding, Zuohua
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
  • [42] An Evolutionary-Based Black-Box Attack to Deep Neural Network Classifiers
    Zhou, Yutian
    Tan, Yu-an
    Zhang, Quanxin
    Kuang, Xiaohui
    Han, Yahong
    Hu, Jingjing
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (04): : 1616 - 1629
  • [43] Stable and actionable explanations of black-box models through factual and counterfactual rules
    Guidotti, Riccardo
    Monreale, Anna
    Ruggieri, Salvatore
    Naretto, Francesca
    Turini, Franco
    Pedreschi, Dino
    Giannotti, Fosca
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (05) : 2825 - 2862
  • [44] BLACK-BOX ATTACKS ON IMAGE ACTIVITY PREDICTION AND ITS NATURAL LANGUAGE EXPLANATIONS
    Baia, Alina Elena
    Poggioni, Valentina
    Cavallaro, Andrea
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3688 - 3697
  • [45] Post-hoc explanation of black-box classifiers using confident itemsets
    Moradi, Milad
    Samwald, Matthias
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
  • [46] An Evolutionary-Based Black-Box Attack to Deep Neural Network Classifiers
    Yutian Zhou
    Yu-an Tan
    Quanxin Zhang
    Xiaohui Kuang
    Yahong Han
    Jingjing Hu
    Mobile Networks and Applications, 2021, 26 : 1616 - 1629
  • [47] Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers
    Gao, Ji
    Lanchantin, Jack
    Soffa, Mary Lou
    Qi, Yanjun
    2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, : 50 - 56
  • [48] THE MATHEMATICAL WORLD IN THE BLACK-BOX - SIGNIFICANCE OF THE BLACK-BOX AS A MEDIUM OF MATHEMATIZING
    MAASS, J
    SCHLOGLMANN, W
    CYBERNETICS AND SYSTEMS, 1988, 19 (04) : 295 - 309
  • [49] RobustCheck: A Python']Python package for black-box robustness assessment of image classifiers
    Ilie, Andrei
    Stefanescu, Alin
    SOFTWAREX, 2024, 27
  • [50] INSIDE THE BLACK-BOX
    HORGAN, J
    IEEE SPECTRUM, 1986, 23 (11) : 65 - 65