"Better" Counterfactuals, Ones People Can Understand: Psychologically-Plausible Case-Based Counterfactuals Using Categorical Features for Explainable AI (XAI)

被引：7

作者：

Warren, Greta ^{[1
]}

Smyth, Barry ^{[1
,2
]}

Keane, Mark T. ^{[1
,2
,3
]}

机构：

[1] Univ Coll Dublin, Sch Comp Sci, Dublin, Ireland

[2] Univ Coll Dublin, Insight Ctr Data Analyt, Dublin, Ireland

[3] Univ Coll Dublin, VistaMilk SFI Res Ctr, Dublin, Ireland

来源：

CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2022 | 2022年 / 13405卷

基金：

爱尔兰科学基金会;

关键词：

CBR; Explanation; XAI; Counterfactuals; Contrastive; BLACK-BOX; EXPLANATION;

D O I：

10.1007/978-3-031-14923-8_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A recent surge of research has focused on counterfactual explanations as a promising solution to the eXplainable AI (XAI) problem. Over 100 counterfactual XAI methods have been proposed, many emphasising the key role of features that are "important" or "causal" or "actionable" in making explanations comprehensible to human users. However, these proposals rest on intuition rather than psychological evidence. Indeed, recent psychological evidence [22] shows that it is abstract feature-types that impact people's understanding of explanations; categorical features better support people's learning of an AI model's predictions than continuous features. This paper proposes a more psychologically-valid counterfactual method, one extending case-based techniques with additional functionality to transform feature-differences into categorical versions of themselves. This enhanced case-based counterfactual method, still generates good counterfactuals relative to baseline methods on coverage and distances metrics. This is the first counterfactual method specifically designed to meet identified psychological requirements of end-users, rather than merely reflecting the intuitions of algorithm designers.

引用

页码：63 / 78

页数：16