A Survey of Counterfactual Explanations: Definition, Evaluation, Algorithms, and Applications

被引:0
|
作者
Zhang, Xuezhong [1 ]
Dai, Libin [1 ]
Peng, Qingming [1 ]
Tang, Ruizhi [1 ]
Li, Xinwei [2 ]
机构
[1] PetroChina, South Sulige Operat Branch Changqing Oilfield Co, Xian 710018, Shaanxi, Peoples R China
[2] Xidian Univ, Sch Comp Sci & Technol, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Explainable AI; Counterfactual explanations; Causability; Interpretable machine learning;
D O I
10.1007/978-3-031-20738-9_99
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable machine learning aims to reveal the reasons why black-box models make decisions. Counterfactual explanation is an example-based post-hoc explanation method. The counterfactual explanations aims to find the minimum perturbation that changes the model output with respect to the original instance. This study's goal is to review the literature that has already been written about counterfactual explanations and topics that are relevant to it. We provide a formal definition of counterfactual explanations and counterfactual explainer, and a summary and formulaic description of the properties of the generated counterfactual instances. In addition, we investigate the application of counterfactual explanations in two areas: model robustness, and generating feature importance. The findings demonstrate that the qualities necessary for counterfactual instances cannot be simultaneously satisfied by present methodologies. Finally, we go over potential future research directions.
引用
收藏
页码:905 / 912
页数:8
相关论文
共 50 条
  • [1] A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research Challenges
    Prado-Romero, Mario Alfonso
    Prenkaj, Bardh
    Stilo, Giovanni
    Giannotti, Fosca
    [J]. ACM COMPUTING SURVEYS, 2024, 56 (07)
  • [2] Counterfactual Explanations for Oblique Decision Trees: Exact, Efficient Algorithms
    Carreira-Perpinan, Miguel A.
    Hada, Suryabhan Singh
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6903 - 6911
  • [3] Counterfactual Visual Explanations
    Goyal, Yash
    Wu, Ziyan
    Ernst, Jan
    Batra, Dhruv
    Parikh, Devi
    Lee, Stefan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] PreCoF: counterfactual explanations for fairness
    Sofie Goethals
    David Martens
    Toon Calders
    [J]. Machine Learning, 2024, 113 : 3111 - 3142
  • [5] Counterfactual Explanations for Models of Code
    Cito, Juergen
    Dillig, Isil
    Murali, Vijayaraghavan
    Chandra, Satish
    [J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP 2022), 2022, : 125 - 134
  • [6] Counterfactual Causality and Historical Explanations
    Gerber, Doris
    [J]. EXPLANATION IN ACTION THEORY AND HISTORIOGRAPHY: CAUSAL AND TELEOLOGICAL APPROACHES, 2019, : 167 - 178
  • [7] On generating trustworthy counterfactual explanations
    Del Ser, Javier
    Barredo-Arrieta, Alejandro
    Diaz-Rodriguez, Natalia
    Herrera, Francisco
    Saranti, Anna
    Holzinger, Andreas
    [J]. INFORMATION SCIENCES, 2024, 655
  • [8] Diffusion Models for Counterfactual Explanations
    Jeanneret, Guillaume
    Simon, Loic
    Jurie, Fredric
    [J]. COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 219 - 237
  • [9] PreCoF: counterfactual explanations for fairness
    Goethals, Sofie
    Martens, David
    Calders, Toon
    [J]. MACHINE LEARNING, 2024, 113 (05) : 3111 - 3142
  • [10] Adversarial Counterfactual Visual Explanations
    Jeanneret, Guillaume
    Simon, Loic
    Jurie, Frederic
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16425 - 16435