FASTER-CE: Fast, Sparse, Transparent, and Robust Counterfactual Explanations

被引:1
|
作者
Sharma, Shubham [1 ]
Gee, Alan [2 ]
Henderson, Jette [3 ]
Ghosh, Joydeep [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Amira Learning, Seattle, WA USA
[3] TecnoTree, Espoo, Finland
关键词
D O I
10.1007/978-3-031-63223-5_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Counterfactual explanations were first introduced as a human-centric way to understand model behavior. While validity remains core to the counterfactual explanation definition, researchers have also identified other desirable properties that make counterfactual explanations more usable on the deployment and the end-user sides: speed of explanation generation, robustness/sensitivity, and succinctness of explanations. Motivated by the need to make counterfactual explanations practically viable for large-scale datasets, we introduce a novel set of algorithms called FASTER-CE, which generate sparse and robust counterfactual explanations efficiently at test time by finding promising search directions for counterfactuals in a latent space that is specified via an autoencoder. These directions are determined based on gradients with respect to each of the original input features as well as of the target, as estimated in the latent space. The ability to quickly examine combinations of the most promising gradient directions as well as to incorporate additional user-defined constraints enables FASTER-CE to generate multiple counterfactual explanations that are sparse, realistic, and robust to input manipulations. Through experiments on multiple datasets of varied complexities, we show that FASTER-CE is not only much faster at test time, but also capable of considering a larger set of desirable and often conflicting properties for counterfactual explanations.
引用
收藏
页码:183 / 196
页数:14
相关论文
共 50 条
  • [1] Generating Robust Counterfactual Explanations
    Guyomard, Victor
    Fessant, Francoise
    Guyet, Thomas
    Bouadi, Tassadit
    Termier, Alexandre
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 394 - 409
  • [2] On the robustness of sparse counterfactual explanations to adverse perturbations
    Virgolin, Marco
    Fracaros, Saverio
    ARTIFICIAL INTELLIGENCE, 2023, 316
  • [3] Interval abstractions for robust counterfactual explanations
    Jiang, Junqi
    Leofante, Francesco
    Rago, Antonio
    Toni, Francesca
    ARTIFICIAL INTELLIGENCE, 2024, 336
  • [4] Sparse Visual Counterfactual Explanations in Image Space
    Boreiko, Valentyn
    Augustin, Maximilian
    Croce, Francesco
    Berens, Philipp
    Hein, Matthias
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 133 - 148
  • [5] Gradient-based Counterfactual Generation for Sparse and Diverse Counterfactual Explanations
    Han, Chan Sik
    Lee, Keon Myung
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1240 - 1247
  • [6] Transparent Latent Space Counterfactual Explanations for Tabular Data
    Bodria, Francesco
    Guidotti, Riccardo
    Giannotti, Fosca
    Pedreschi, Dino
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 445 - 454
  • [7] Robust Counterfactual Explanations in Machine Learning: A Survey
    Jiang, Junqi
    Leofante, Francesco
    Rago, Antonio
    Toni, Francesca
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8086 - 8094
  • [8] Robust Stochastic Graph Generator for Counterfactual Explanations
    Prado-Romero, Mario Alfonso
    Prenkaj, Bardh
    Stilo, Giovanni
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21518 - 21526
  • [9] Generating Sparse Counterfactual Explanations for Multivariate Time Series
    Lang, Jana
    Giese, Martin A.
    Ilg, Winfried
    Otte, Sebastian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 180 - 193
  • [10] Robust Counterfactual Explanations on Graph Neural Networks
    Bajaj, Mohit
    Chu, Lingyang
    Xue, Zi Yu
    Pei, Jian
    Wang, Lanjun
    Lam, Peter Cho-Ho
    Zhang, Yong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34