Interval abstractions for robust counterfactual explanations

被引:0
|
作者
Jiang, Junqi [1 ]
Leofante, Francesco [1 ]
Rago, Antonio [1 ]
Toni, Francesca [1 ]
机构
[1] Imperial Coll London, Dept Comp, 180 Queens Gate, London SW7 2AZ, England
基金
欧洲研究理事会;
关键词
Explainable AI; Counterfactual explanations; Algorithmic recourse; Robustness of explanations;
D O I
10.1016/j.artint.2024.104218
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Counterfactual Explanations (CEs) have emerged as a major paradigm in explainable AI research, providing recourse recommendations for users affected by the decisions of machine learning models. However, CEs found by existing methods often become invalid when slight changes occur in the parameters of the model they were generated for. The literature lacks a way to provide exhaustive robustness guarantees for CEs under model changes, in that existing methods to improve CEs' robustness are mostly heuristic, and the robustness performances are evaluated empirically using only a limited number of retrained models. To bridge this gap, we propose a novel interval abstraction technique for parametric machine learning models, which allows us to obtain provable robustness guarantees for CEs under a possibly infinite set of plausible model changes Delta. Based on this idea, we formalise a robustness notion for CEs, which we call Delta-robustness, in both binary and multi-class classification settings. We present procedures to verify Delta-robustness based on Mixed Integer Linear Programming, using which we further propose algorithms to generate CEs that are Delta-robust. In an extensive empirical study involving neural networks and logistic regression models, we demonstrate the practical applicability of our approach. We discuss two strategies for determining the appropriate hyperparameters in our method, and we quantitatively benchmark CEs generated by eleven methods, highlighting the effectiveness of our algorithms in finding robust CEs.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Generating Robust Counterfactual Explanations
    Guyomard, Victor
    Fessant, Francoise
    Guyet, Thomas
    Bouadi, Tassadit
    Termier, Alexandre
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 394 - 409
  • [2] Robust Counterfactual Explanations in Machine Learning: A Survey
    Jiang, Junqi
    Leofante, Francesco
    Rago, Antonio
    Toni, Francesca
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8086 - 8094
  • [3] Robust Stochastic Graph Generator for Counterfactual Explanations
    Prado-Romero, Mario Alfonso
    Prenkaj, Bardh
    Stilo, Giovanni
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21518 - 21526
  • [4] Robust Counterfactual Explanations on Graph Neural Networks
    Bajaj, Mohit
    Chu, Lingyang
    Xue, Zi Yu
    Pei, Jian
    Wang, Lanjun
    Lam, Peter Cho-Ho
    Zhang, Yong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Finding Regions of Counterfactual Explanations via Robust Optimization
    Maragno, Donato
    Kurtz, Jannis
    Rober, Tabea E.
    Goedhart, Rob
    Birbil, S. Ilker
    den Hertog, Dick
    INFORMS JOURNAL ON COMPUTING, 2024, 36 (05) : 1316 - 1334
  • [6] Flexible and Robust Counterfactual Explanations with Minimal Satisfiable Perturbations
    Wang, Yongjie
    Qian, Hangwei
    Liu, Yongjie
    Guo, Wei
    Miao, Chunyan
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2596 - 2605
  • [7] Robust Counterfactual Explanations for Tree-Based Ensembles
    Dutta, Sanghamitra
    Long, Jason
    Mishra, Saumitra
    Tilli, Cecilia
    Magazzeni, Daniele
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [8] Provably Robust and Plausible Counterfactual Explanations for Neural Networks via Robust Optimisation
    Jiang, Junqi
    Lan, Jianglin
    Leofante, Francesco
    Rago, Antonio
    Toni, Francesca
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [9] Generally-Occurring Model Change for Robust Counterfactual Explanations
    Xu, Ao
    Wu, Tieru
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IV, 2024, 15019 : 215 - 229
  • [10] FASTER-CE: Fast, Sparse, Transparent, and Robust Counterfactual Explanations
    Sharma, Shubham
    Gee, Alan
    Henderson, Jette
    Ghosh, Joydeep
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024, 2024, 714 : 183 - 196