Learning to Explain: A Model -Agnostic Framework for Explaining Black Box Models

被引：2

作者：

Barkan, Oren ^{[1
]}

Asher, Yuval ^{[2
]}

Eshel, Amit ^{[2
]}

Elisha, Yelionatan ^{[1
]}

Koenigstein, Noam ^{[2
]}

机构：

[1] Open Univ, Milton Keynes, England

[2] Tel Aviv Univ, Tel Aviv, Israel

来源：

23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023 | 2023年

基金：

以色列科学基金会;

关键词：

Explainable AI; computer vision; transformers;

D O I：

10.1109/ICDM58522.2023.00105

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present Learning to Explain (LTX), a model-agnostic framework designed for providing post -hoc explanations for vision models. The LTX framework introduces an "explainer" model that generates explanation maps, highlighting the crucial regions that justify the predictions made by the model being explained. To train the explainer, we employ a two -stage process consisting of initial pretraining followed by per-instance finetuning. During both stages of training, we utilize a unique configuration where we compare the explained model's prediction for a masked input with its original prediction for the unmasked input. This approach enables the use of a novel counterfactual objective, which aims to anticipate the model's output using masked versions of the input image. Importantly, the LTX framework is not restricted to a specific model architecture and can provide explanations for both Transformer-based and convolutional models. Through our evaluations, we demonstrate that LTX significantly outperforms the current state-of-the-art in explainability across various metrics. Our code is available at: https://githab.cian/LTX-CodelLTX

引用

页码：944 / 949

页数：6

共 50 条

[1] Explaining Black Box Drug Target Prediction Through Model Agnostic Counterfactual Samples
Nguyen, Tri Minh
Quinn, Thomas P.
Nguyen, Thin
Tran, Truyen
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1020 - 1029
[2] Explaining the Performance of Black Box Regression Models
Areosa, Ines
Torgo, Luis
2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 110 - 118
[3] A Survey of Methods for Explaining Black Box Models
Guidotti, Riccardo
Monreale, Anna
Ruggieri, Salvatore
Turin, Franco
Giannotti, Fosca
Pedreschi, Dino
ACM COMPUTING SURVEYS, 2019, 51 (05)
[4] Multi-criteria Approaches to Explaining Black Box Machine Learning Models
Stefanowski, Jerzy
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT II, 2023, 13996 : 195 - 208
[5] A Rate-Distortion Framework for Explaining Black-Box Model Decisions
Kolek, Stefan
Nguyen, Duc Anh
Levie, Ron
Bruna, Joan
Kutyniok, Gitta
XXAI - BEYOND EXPLAINABLE AI: International Workshop, Held in Conjunction with ICML 2020, July 18, 2020, Vienna, Austria, Revised and Extended Papers, 2022, 13200 : 91 - 115
[6] Explaining Black Box Models Through Twin Systems
Cau, Federico Maria
PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES COMPANION (IUI'20), 2020, : 27 - 28
[7] Explaining Black Box Models by means of Local Rules
Pastor, Eliana
Baralis, Elena
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 510 - 517
[8] Explaining Black-box Classification Models with Arguments
Amgoud, Leila
2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 791 - 795
[9] Alibi explain: Algorithms for explaining machine learning models
Klaise, Janis
Van Looveren, Arnaud
Vacanti, Giovanni
Coca, Alexandru
Journal of Machine Learning Research, 2021, 22
[10] Alibi Explain: Algorithms for Explaining Machine Learning Models
Klaise, Janis
Van Looveren, Arnaud
Vacanti, Giovanni
Coca, Alexandru
JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22

← 1 2 3 4 5 →