Learning to Explain: A Model -Agnostic Framework for Explaining Black Box Models

被引:2
|
作者
Barkan, Oren [1 ]
Asher, Yuval [2 ]
Eshel, Amit [2 ]
Elisha, Yelionatan [1 ]
Koenigstein, Noam [2 ]
机构
[1] Open Univ, Milton Keynes, England
[2] Tel Aviv Univ, Tel Aviv, Israel
来源
23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023 | 2023年
基金
以色列科学基金会;
关键词
Explainable AI; computer vision; transformers;
D O I
10.1109/ICDM58522.2023.00105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Learning to Explain (LTX), a model-agnostic framework designed for providing post -hoc explanations for vision models. The LTX framework introduces an "explainer" model that generates explanation maps, highlighting the crucial regions that justify the predictions made by the model being explained. To train the explainer, we employ a two -stage process consisting of initial pretraining followed by per-instance finetuning. During both stages of training, we utilize a unique configuration where we compare the explained model's prediction for a masked input with its original prediction for the unmasked input. This approach enables the use of a novel counterfactual objective, which aims to anticipate the model's output using masked versions of the input image. Importantly, the LTX framework is not restricted to a specific model architecture and can provide explanations for both Transformer-based and convolutional models. Through our evaluations, we demonstrate that LTX significantly outperforms the current state-of-the-art in explainability across various metrics. Our code is available at: https://githab.cian/LTX-CodelLTX
引用
收藏
页码:944 / 949
页数:6
相关论文
共 50 条
  • [1] Explaining Black Box Drug Target Prediction Through Model Agnostic Counterfactual Samples
    Nguyen, Tri Minh
    Quinn, Thomas P.
    Nguyen, Thin
    Tran, Truyen
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1020 - 1029
  • [2] Explaining the Performance of Black Box Regression Models
    Areosa, Ines
    Torgo, Luis
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 110 - 118
  • [3] A Survey of Methods for Explaining Black Box Models
    Guidotti, Riccardo
    Monreale, Anna
    Ruggieri, Salvatore
    Turin, Franco
    Giannotti, Fosca
    Pedreschi, Dino
    ACM COMPUTING SURVEYS, 2019, 51 (05)
  • [4] Multi-criteria Approaches to Explaining Black Box Machine Learning Models
    Stefanowski, Jerzy
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT II, 2023, 13996 : 195 - 208
  • [5] A Rate-Distortion Framework for Explaining Black-Box Model Decisions
    Kolek, Stefan
    Nguyen, Duc Anh
    Levie, Ron
    Bruna, Joan
    Kutyniok, Gitta
    XXAI - BEYOND EXPLAINABLE AI: International Workshop, Held in Conjunction with ICML 2020, July 18, 2020, Vienna, Austria, Revised and Extended Papers, 2022, 13200 : 91 - 115
  • [6] Explaining Black Box Models Through Twin Systems
    Cau, Federico Maria
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES COMPANION (IUI'20), 2020, : 27 - 28
  • [7] Explaining Black Box Models by means of Local Rules
    Pastor, Eliana
    Baralis, Elena
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 510 - 517
  • [8] Explaining Black-box Classification Models with Arguments
    Amgoud, Leila
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 791 - 795
  • [9] Alibi explain: Algorithms for explaining machine learning models
    Klaise, Janis
    Van Looveren, Arnaud
    Vacanti, Giovanni
    Coca, Alexandru
    Journal of Machine Learning Research, 2021, 22
  • [10] Alibi Explain: Algorithms for Explaining Machine Learning Models
    Klaise, Janis
    Van Looveren, Arnaud
    Vacanti, Giovanni
    Coca, Alexandru
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22