Causality-based counterfactual explanation for classification models

被引：1

作者：

Duong, Tri Dung ^{[1
]}

Li, Qian ^{[2
]}

Xu, Guandong ^{[3
]}

机构：

[1] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW, Australia

[2] Curtin Univ, Sch Elect Engn Comp & Math Sci, Perth, WA, Australia

[3] Educ Univ Hong Kong, Ctr Learning Teaching & Technol, Hong Kong, HK, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 300卷

基金：

澳大利亚研究理事会; 美国国家科学基金会;

关键词：

Counterfactual explanation; Interpretable machine learning; Structural causal model; GENETIC ALGORITHM;

D O I：

10.1016/j.knosys.2024.112200

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Counterfactual explanation is one branch of interpretable machine learning that produces a perturbation sample to change the model's original decision. The generated samples can act as a recommendation for end-users to achieve their desired outputs. Most of the current counterfactual explanation approaches are the gradient-based method, which can only optimize the differentiable loss functions with continuous variables. Accordingly, the gradient-free methods are proposed to handle the categorical variables, which however have several major limitations: (1) causal relationships among features are typically ignored when generating the counterfactuals, possibly resulting in impractical guidelines for decision-makers; (2) the counterfactual explanation algorithm requires a great deal of effort into parameter tuning for determining the optimal weight for each loss functions which must be conducted repeatedly for different datasets and settings. In this work, to address the above limitations, we propose a prototype-based counterfactual explanation framework (ProCE). ProCE is capable of preserving the causal relationship underlying the features of the counterfactual data. In addition, we design a novel gradient-free optimization based on the multi-objective genetic algorithm that generates the counterfactual explanations for the mixed-type of continuous and categorical features. Numerical experiments demonstrate that our method compares favorably with state-of-the-art methods and therefore is applicable to existing prediction models. All the source codes and data are available at https: //github.com/tridungduong16/multiobj-scm-cf.

引用

页数：12

共 50 条

[41] PC-Fairness: A Unified Framework for Measuring Causality-based Fairness
Wu, Yongkai
Zhang, Lu
Wu, Xintao
Tong, Hanghang
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[42] CausaLM: Causal Model Explanation Through Counterfactual Language Models
Feder, Amir
Oved, Nadav
Shalit, Uri
Reichart, Roi
[J]. COMPUTATIONAL LINGUISTICS, 2021, 47 (02) : 333 - 386
[43] CAUSALITY-BASED FAILURE-DRIVEN LEARNING IN DIAGNOSTIC EXPERT SYSTEMS
RICH, SH
VENKATASUBRAMANIAN, V
[J]. AICHE JOURNAL, 1989, 35 (06) : 943 - 950
[44] A Causality-Based Approach to Assessing Inconsistency for Multi-context Systems
Mu, Kedian
[J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 817 - 828
[45] Granger causality-based synaptic weights estimation for analyzing neuronal networks
Shao, Pei-Chiang
Huang, Jian-Jia
Shann, Wei-Chang
Yen, Chen-Tung
Tsai, Meng-Li
Yen, Chien-Chang
[J]. JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2015, 38 (03) : 483 - 497
[46] Causality-based method for determining the time origin in terahertz emission spectroscopy
Unuma, Takeya
Ino, Yusuke
Peiponen, Kai-Erik
Vartiainen, Erik M.
Kuwata-Gonokami, Makoto
Hirakawa, Kazuhiko
[J]. OPTICS EXPRESS, 2011, 19 (13): : 12759 - 12765
[47] Causality-Based Model For User Profile Construction From Behavior Sequences
Chikhaoui, Belkacem
Wang, Shengrui
Pigot, Helene
[J]. 2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2013, : 461 - 468
[48] Granger causality-based synaptic weights estimation for analyzing neuronal networks
Pei-Chiang Shao
Jian-Jia Huang
Wei-Chang Shann
Chen-Tung Yen
Meng-Li Tsai
Chien-Chang Yen
[J]. Journal of Computational Neuroscience, 2015, 38 : 483 - 497
[49] Counterfactual Narrative Explanation
Dohrn, Daniel
[J]. JOURNAL OF AESTHETICS AND ART CRITICISM, 2009, 67 (01): : 37 - 47
[50] A Consistent Causality-Based View on a Timed Process Algebra Including Urgent Interactions
Joost-Pieter Katoen
Rom Langerak
Ed Brinksma
Diego Latella
Tommaso Bolognesi
[J]. Formal Methods in System Design, 1998, 12 : 189 - 216

← 1 2 3 4 5 →