Causality-based counterfactual explanation for classification models

被引:1
|
作者
Duong, Tri Dung [1 ]
Li, Qian [2 ]
Xu, Guandong [3 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW, Australia
[2] Curtin Univ, Sch Elect Engn Comp & Math Sci, Perth, WA, Australia
[3] Educ Univ Hong Kong, Ctr Learning Teaching & Technol, Hong Kong, HK, Peoples R China
基金
澳大利亚研究理事会; 美国国家科学基金会;
关键词
Counterfactual explanation; Interpretable machine learning; Structural causal model; GENETIC ALGORITHM;
D O I
10.1016/j.knosys.2024.112200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Counterfactual explanation is one branch of interpretable machine learning that produces a perturbation sample to change the model's original decision. The generated samples can act as a recommendation for end-users to achieve their desired outputs. Most of the current counterfactual explanation approaches are the gradient-based method, which can only optimize the differentiable loss functions with continuous variables. Accordingly, the gradient-free methods are proposed to handle the categorical variables, which however have several major limitations: (1) causal relationships among features are typically ignored when generating the counterfactuals, possibly resulting in impractical guidelines for decision-makers; (2) the counterfactual explanation algorithm requires a great deal of effort into parameter tuning for determining the optimal weight for each loss functions which must be conducted repeatedly for different datasets and settings. In this work, to address the above limitations, we propose a prototype-based counterfactual explanation framework (ProCE). ProCE is capable of preserving the causal relationship underlying the features of the counterfactual data. In addition, we design a novel gradient-free optimization based on the multi-objective genetic algorithm that generates the counterfactual explanations for the mixed-type of continuous and categorical features. Numerical experiments demonstrate that our method compares favorably with state-of-the-art methods and therefore is applicable to existing prediction models. All the source codes and data are available at https: //github.com/tridungduong16/multiobj-scm-cf.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] PC-Fairness: A Unified Framework for Measuring Causality-based Fairness
    Wu, Yongkai
    Zhang, Lu
    Wu, Xintao
    Tong, Hanghang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [42] CausaLM: Causal Model Explanation Through Counterfactual Language Models
    Feder, Amir
    Oved, Nadav
    Shalit, Uri
    Reichart, Roi
    [J]. COMPUTATIONAL LINGUISTICS, 2021, 47 (02) : 333 - 386
  • [43] CAUSALITY-BASED FAILURE-DRIVEN LEARNING IN DIAGNOSTIC EXPERT SYSTEMS
    RICH, SH
    VENKATASUBRAMANIAN, V
    [J]. AICHE JOURNAL, 1989, 35 (06) : 943 - 950
  • [44] A Causality-Based Approach to Assessing Inconsistency for Multi-context Systems
    Mu, Kedian
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 817 - 828
  • [45] Granger causality-based synaptic weights estimation for analyzing neuronal networks
    Shao, Pei-Chiang
    Huang, Jian-Jia
    Shann, Wei-Chang
    Yen, Chen-Tung
    Tsai, Meng-Li
    Yen, Chien-Chang
    [J]. JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2015, 38 (03) : 483 - 497
  • [46] Causality-based method for determining the time origin in terahertz emission spectroscopy
    Unuma, Takeya
    Ino, Yusuke
    Peiponen, Kai-Erik
    Vartiainen, Erik M.
    Kuwata-Gonokami, Makoto
    Hirakawa, Kazuhiko
    [J]. OPTICS EXPRESS, 2011, 19 (13): : 12759 - 12765
  • [47] Causality-Based Model For User Profile Construction From Behavior Sequences
    Chikhaoui, Belkacem
    Wang, Shengrui
    Pigot, Helene
    [J]. 2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2013, : 461 - 468
  • [48] Granger causality-based synaptic weights estimation for analyzing neuronal networks
    Pei-Chiang Shao
    Jian-Jia Huang
    Wei-Chang Shann
    Chen-Tung Yen
    Meng-Li Tsai
    Chien-Chang Yen
    [J]. Journal of Computational Neuroscience, 2015, 38 : 483 - 497
  • [49] Counterfactual Narrative Explanation
    Dohrn, Daniel
    [J]. JOURNAL OF AESTHETICS AND ART CRITICISM, 2009, 67 (01): : 37 - 47
  • [50] A Consistent Causality-Based View on a Timed Process Algebra Including Urgent Interactions
    Joost-Pieter Katoen
    Rom Langerak
    Ed Brinksma
    Diego Latella
    Tommaso Bolognesi
    [J]. Formal Methods in System Design, 1998, 12 : 189 - 216