Achievable Minimally-Contrastive Counterfactual Explanations

被引:0
|
作者
Barzekar, Hosein [1 ]
McRoy, Susan [1 ]
机构
[1] Univ Wisconsin Milwaukee, Dept Comp Sci, Milwaukee, WI 53211 USA
来源
关键词
machine learning; interpretability; feasibility; counterfactual and contrastive explanation; SELF-DETERMINATION;
D O I
10.3390/make5030048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision support systems based on machine learning models should be able to help users identify opportunities and threats. Popular model-agnostic explanation models can identify factors that support various predictions, answering questions such as "What factors affect sales?" or "Why did sales decline?", but do not highlight what a person should or could do to get a more desirable outcome. Counterfactual explanation approaches address intervention, and some even consider feasibility, but none consider their suitability for real-time applications, such as question answering. Here, we address this gap by introducing a novel model-agnostic method that provides specific, feasible changes that would impact the outcomes of a complex Black Box AI model for a given instance and assess its real-world utility by measuring its real-time performance and ability to find achievable changes. The method uses the instance of concern to generate high-precision explanations and then applies a secondary method to find achievable minimally-contrastive counterfactual explanations (AMCC) while limiting the search to modifications that satisfy domain-specific constraints. Using a widely recognized dataset, we evaluated the classification task to ascertain the frequency and time required to identify successful counterfactuals. For a 90% accurate classifier, our algorithm identified AMCC explanations in 47% of cases (38 of 81), with an average discovery time of 80 ms. These findings verify the algorithm's efficiency in swiftly producing AMCC explanations, suitable for real-time systems. The AMCC method enhances the transparency of Black Box AI models, aiding individuals in evaluating remedial strategies or assessing potential outcomes.
引用
收藏
页码:922 / 936
页数:15
相关论文
共 50 条
  • [1] Contrastive counterfactual visual explanations with overdetermination
    Adam White
    Kwun Ho Ngan
    James Phelan
    Kevin Ryan
    Saman Sadeghi Afgeh
    Constantino Carlos Reyes-Aldasoro
    Artur d’Avila Garcez
    [J]. Machine Learning, 2023, 112 : 3497 - 3525
  • [2] Contrastive counterfactual visual explanations with overdetermination
    White, Adam
    Ngan, Kwun Ho
    Phelan, James
    Ryan, Kevin
    Afgeh, Saman Sadeghi
    Reyes-Aldasoro, Constantino Carlos
    Garcez, Artur d'Avila
    [J]. MACHINE LEARNING, 2023, 112 (09) : 3497 - 3525
  • [3] Contrastive Visual Explanations for Reinforcement Learning via Counterfactual Rewards
    Liu, Xiaowei
    McAreavey, Kevin
    Liu, Weiru
    [J]. EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT II, 2023, 1902 : 72 - 87
  • [4] Conversational Explanations of Machine Learning Predictions Through Class-contrastive Counterfactual Statements
    Sokol, Kacper
    Flach, Peter
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5785 - 5786
  • [5] Counterfactual-based Saliency Map: Towards Visual Contrastive Explanations for Neural Networks
    Wang, Xue
    Wang, Zhibo
    Weng, Haiqin
    Guo, Hengchang
    Zhang, Zhifei
    Jin, Lu
    Wei, Tao
    Ren, Kui
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2042 - 2051
  • [6] Counterfactual Visual Explanations
    Goyal, Yash
    Wu, Ziyan
    Ernst, Jan
    Batra, Dhruv
    Parikh, Devi
    Lee, Stefan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] Counterfactual Explanations for Models of Code
    Cito, Juergen
    Dillig, Isil
    Murali, Vijayaraghavan
    Chandra, Satish
    [J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP 2022), 2022, : 125 - 134
  • [8] Counterfactual Causality and Historical Explanations
    Gerber, Doris
    [J]. EXPLANATION IN ACTION THEORY AND HISTORIOGRAPHY: CAUSAL AND TELEOLOGICAL APPROACHES, 2019, : 167 - 178
  • [9] PreCoF: counterfactual explanations for fairness
    Sofie Goethals
    David Martens
    Toon Calders
    [J]. Machine Learning, 2024, 113 : 3111 - 3142
  • [10] On generating trustworthy counterfactual explanations
    Del Ser, Javier
    Barredo-Arrieta, Alejandro
    Diaz-Rodriguez, Natalia
    Herrera, Francisco
    Saranti, Anna
    Holzinger, Andreas
    [J]. INFORMATION SCIENCES, 2024, 655