Counterfactual Models for Fair and Adequate Explanations

被引:6
|
作者
Asher, Nicholas [1 ]
De Lara, Lucas [2 ]
Paul, Soumya [3 ]
Russell, Chris [4 ]
机构
[1] Univ Paul Sabatier, Inst Rech Informat Toulouse, F-31062 Toulouse, France
[2] Univ Paul Sabatier, Inst Math Toulouse, F-31062 Toulouse, France
[3] Telindus, 18 Rue Puits Romain, L-8070 Luxembourg, Luxembourg
[4] Amazon Res, D-72072 Tubingen, Germany
来源
关键词
explainability; counterfactual models; transport theories; 1ST-ORDER LOGICS; COMPLEXITY;
D O I
10.3390/make4020014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent efforts have uncovered various methods for providing explanations that can help interpret the behavior of machine learning programs. Exact explanations with a rigorous logical foundation provide valid and complete explanations, but they have an epistemological problem: they are often too complex for humans to understand and too expensive to compute even with automated reasoning methods. Interpretability requires good explanations that humans can grasp and can compute. We take an important step toward specifying what good explanations are by analyzing the epistemically accessible and pragmatic aspects of explanations. We characterize sufficiently good, or fair and adequate, explanations in terms of counterfactuals and what we call the conundra of the explainee, the agent that requested the explanation. We provide a correspondence between logical and mathematical formulations for counterfactuals to examine the partiality of counterfactual explanations that can hide biases; we define fair and adequate explanations in such a setting. We provide formal results about the algorithmic complexity of fair and adequate explanations. We then detail two sophisticated counterfactual models, one based on causal graphs, and one based on transport theories. We show transport based models have several theoretical advantages over the competition as explanation frameworks for machine learning algorithms.
引用
收藏
页码:371 / 396
页数:26
相关论文
共 50 条
  • [1] Fair and Adequate Explanations
    Asher, Nicholas
    Paul, Soumya
    Russell, Chris
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION (CD-MAKE 2021), 2021, 12844 : 79 - 97
  • [2] Counterfactual Explanations for Models of Code
    Cito, Juergen
    Dillig, Isil
    Murali, Vijayaraghavan
    Chandra, Satish
    [J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP 2022), 2022, : 125 - 134
  • [3] Diffusion Models for Counterfactual Explanations
    Jeanneret, Guillaume
    Simon, Loic
    Jurie, Fredric
    [J]. COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 219 - 237
  • [4] Understanding Survival Models Through Counterfactual Explanations
    Alabdallah, Abdallah
    Jakubowski, Jakub
    Pashami, Sepideh
    Bobek, Szymon
    Ohlsson, Mattias
    Rognvaldsson, Thorsteinn
    Nalepa, Grzegorz J.
    [J]. COMPUTATIONAL SCIENCE, ICCS 2024, PT IV, 2024, 14835 : 310 - 324
  • [5] The Skyline of Counterfactual Explanations for Machine Learning Decision Models
    Wang, Yongjie
    Ding, Qinxu
    Wang, Ke
    Liu, Yue
    Wu, Xingyu
    Wang, Jinglong
    Liu, Yong
    Miao, Chunyan
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2030 - 2039
  • [6] ViCE: Visual Counterfactual Explanations for Machine Learning Models
    Gomez, Oscar
    Holter, Steffen
    Yuan, Jun
    Bertini, Enrico
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2020, 2020, : 531 - 535
  • [7] DECE: Decision Explorer with Counterfactual Explanations for Machine Learning Models
    Cheng, Furui
    Ming, Yao
    Qu, Huamin
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1438 - 1447
  • [8] Counterfactual Visual Explanations
    Goyal, Yash
    Wu, Ziyan
    Ernst, Jan
    Batra, Dhruv
    Parikh, Devi
    Lee, Stefan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [9] CSSE - An agnostic method of counterfactual, selected, and social explanations for classification models
    Balbino, Marcelo de Sousa
    Galvez, Luis Enrique Zarate
    Nobre, Cristiane Neri
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [10] PreCoF: counterfactual explanations for fairness
    Sofie Goethals
    David Martens
    Toon Calders
    [J]. Machine Learning, 2024, 113 : 3111 - 3142