Searching for explanations of black-box classifiers in the space of semantic queries

被引:0
|
作者
Liartis, Jason [1 ]
Dervakos, Edmund [1 ]
Menis-Mastromichalakis, Orfeas [1 ]
Chortaras, Alexandros [1 ]
Stamou, Giorgos [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Artificial Intelligence & Learning Syst Lab, Zografos, Greece
关键词
Explainable AI (XAI); opaque machine learning classifiers; knowledge graphs; description logics; semantic query answering; reverse query answering; post-hoc explainability; explanation rules; ISOMORPHISM; EXAMPLES; DATABASE;
D O I
10.3233/SW-233469Press
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of "explanation rules for a specific class" with "semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class". Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results.
引用
收藏
页码:1085 / 1126
页数:42
相关论文
共 50 条
  • [31] THE BLACK-BOX
    WISEMAN, J
    ECONOMIC JOURNAL, 1991, 101 (404): : 149 - 155
  • [32] Best-Effort Adversarial Approximation of Black-Box Malware Classifiers
    Ali, Abdullah
    Eshete, Birhanu
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT I, 2020, 335 : 318 - 338
  • [33] Black-Box Adversarial Attack for Deep Learning Classifiers in IoT Applications
    Singh, Abhijit
    Sikdar, Biplab
    2022 IEEE 8TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2022,
  • [34] Defending Black-Box Skeleton-Based Human Activity Classifiers
    Wang, He
    Diao, Yunfeng
    Tan, Zichang
    Guo, Guodong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2546 - 2554
  • [35] Boosting Physical Layer Black-Box Attacks with Semantic Adversaries in Semantic Communications
    Li, Zeju
    Liu, Xinghan
    Nan, Guoshun
    Zhou, Jinfei
    Lyu, Xinchen
    Cui, Qimei
    Tao, Xiaofeng
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5614 - 5619
  • [36] Comparing Explanations from Glass-Box and Black-Box Machine-Learning Models
    Kuk, Michal
    Bobek, Szymon
    Nalepa, Grzegorz J.
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 668 - 675
  • [37] DiConStruct: Causal Concept-based Explanations through Black-Box Distillation
    Moreira, Ricardo
    Bono, Jacopo
    Cardoso, Mario
    Saleiro, Pedro
    Figueiredo, Mario
    Bizarro, Pedro
    CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 740 - 768
  • [38] Making Sense of Dependence: Efficient Black-box Explanations Using Dependence Measure
    Novello, Paul
    Fel, Thomas
    Vigouroux, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [39] Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks
    Ardis, Paul
    Flenner, Arjuna
    ASSURANCE AND SECURITY FOR AI-ENABLED SYSTEMS, 2024, 13054
  • [40] Hybrid Batch Attacks: Finding Black-box Adversarial Examples with Limited Queries
    Suya, Fnu
    Chi, Jianfeng
    Evans, David
    Tian, Yuan
    PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, 2020, : 1327 - 1344