Explaining machine learning models using entropic variable projection

被引:1
|
作者
Bachoc, Francois [1 ]
Gamboa, Fabrice [1 ,2 ]
Halford, Max [3 ]
Loubes, Jean-Michel [1 ,2 ]
Risser, Laurent [1 ,2 ]
机构
[1] Inst Math Toulouse, Toulouse, France
[2] Artificial & Nat Intelligence Toulouse Inst 3IA AN, Toulouse, France
[3] Inst Rech Informat Toulouse, Toulouse, France
关键词
Explainability; Black-box decision rules; Kullback-Leibler divergence; Wasserstein distance;
D O I
10.1093/imaiai/iaad010
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we present a new explainability formalism designed to shed light on how each input variable of a test set impacts the predictions of machine learning models. Hence, we propose a group explainability formalism for trained machine learning decision rules, based on their response to the variability of the input variables distribution. In order to emphasize the impact of each input variable, this formalism uses an information theory framework that quantifies the influence of all input-output observations based on entropic projections. This is thus the first unified and model agnostic formalism enabling data scientists to interpret the dependence between the input variables, their impact on the prediction errors and their influence on the output predictions. Convergence rates of the entropic projections are provided in the large sample case. Most importantly, we prove that computing an explanation in our framework has a low algorithmic complexity, making it scalable to real-life large datasets. We illustrate our strategy by explaining complex decision rules learned using XGBoost, Random Forest or Deep Neural Network classifiers on various datasets such as Adult Income, MNIST, CelebA, Boston Housing, Iris, as well as synthetic ones. We finally make clear its differences with the explainability strategies LIME and SHAP, which are based on single observations. Results can be reproduced using the freely distributed Python toolbox .
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Explaining Mispredictions of Machine Learning Models using Rule Induction
    Cito, Juergen
    Dillig, Isil
    Kim, Seohyun
    Murali, Vijayaraghavan
    Chandra, Satish
    PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 716 - 727
  • [2] Explaining machine learning models in sales predictions
    Bohanec, Marko
    Borstnar, Mirjana Kljajic
    Robnik-Sikonja, Marko
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 71 : 416 - 428
  • [3] Explaining machine learning models with interactive natural language conversations using TalkToModel
    Slack, Dylan
    Krishna, Satyapriya
    Lakkaraju, Himabindu
    Singh, Sameer
    NATURE MACHINE INTELLIGENCE, 2023, 5 (08) : 873 - +
  • [4] Explaining machine learning models with interactive natural language conversations using TalkToModel
    Dylan Slack
    Satyapriya Krishna
    Himabindu Lakkaraju
    Sameer Singh
    Nature Machine Intelligence, 2023, 5 : 873 - 883
  • [5] Alibi explain: Algorithms for explaining machine learning models
    Klaise, Janis
    Van Looveren, Arnaud
    Vacanti, Giovanni
    Coca, Alexandru
    Journal of Machine Learning Research, 2021, 22
  • [6] Explaining and Integrating Machine Learning Models with Rigorous Simulation
    Schoeneberger, Jan C.
    Aker, Burcu
    Fricke, Armin
    CHEMIE INGENIEUR TECHNIK, 2021, 93 (12) : 1998 - 2009
  • [7] Explaining Machine Learning Models for Clinical Gait Analysis
    Slijepcevic D.
    Horst F.
    Lapuschkin S.
    Horsak B.
    Raberger A.-M.
    Kranzl A.
    Samek W.
    Breiteneder C.
    Schöllhorn W.I.
    Zeppelzauer M.
    ACM Transactions on Computing for Healthcare, 2022, 3 (02):
  • [8] SurvLIME: A method for explaining machine learning survival models
    Kovalev, Maxim S.
    Utkin, Lev, V
    Kasimov, Ernest M.
    KNOWLEDGE-BASED SYSTEMS, 2020, 203
  • [9] Alibi Explain: Algorithms for Explaining Machine Learning Models
    Klaise, Janis
    Van Looveren, Arnaud
    Vacanti, Giovanni
    Coca, Alexandru
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [10] Explaining customer churn prediction in telecom industry using tabular machine learning models
    Poudel, Sumana Sharma
    Pokharel, Suresh
    Timilsina, Mohan
    MACHINE LEARNING WITH APPLICATIONS, 2024, 17