CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations

被引:2
|
作者
Guo, Hangzhi [1 ]
Nguyen, Thanh H. [2 ]
Yadav, Amulya [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
[2] Univ Oregon, Eugene, OR 97403 USA
关键词
Counterfactual Explanation; Algorithmic Recourse; Explainable Artificial Intelligence; Interpretability;
D O I
10.1145/3580305.3599290
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work presents CounterNet, a novel end-to-end learning framework which integrates Machine Learning (ML) model training and the generation of corresponding counterfactual (CF) explanations into a single end-to-end pipeline. Counterfactual explanations offer a contrastive case, i.e., they attempt to find the smallest modification to the feature values of an instance that changes the prediction of the ML model on that instance to a predefined output. Prior techniques for generating CF explanations suffer from two major limitations: (i) all of them are post-hoc methods designed for use with proprietary ML models - as a result, their procedure for generating CF explanations is uninformed by the training of the ML model, which leads to misalignment between model predictions and explanations; and (ii) most of them rely on solving separate time-intensive optimization problems to find CF explanations for each input data point (which negatively impacts their runtime). This work makes a novel departure from the prevalent post-hoc paradigm (of generating CF explanations) by presenting CounterNet, an end-to-end learning framework which integrates predictive model training and the generation of counterfactual (CF) explanations into a single pipeline. Unlike post-hoc methods, CounterNet enables the optimization of the CF explanation generation only once together with the predictive model. We adopt a block-wise coordinate descent procedure which helps in effectively training CounterNet's network. Our extensive experiments on multiple real-world datasets show that CounterNet generates high-quality predictions, and consistently achieves 100% CF validity and low proximity scores (thereby achieving a well-balanced cost-invalidity trade-off) for any new input instance, and runs 3X faster than existing state-of-the-art baselines.
引用
收藏
页码:577 / 589
页数:13
相关论文
共 50 条
  • [31] Training Traffic Light Behavior with End-to-End Learning
    Wildi, Mael
    Alahi, Alexandre
    Visser, Arnoud
    INTELLIGENT AUTONOMOUS SYSTEMS 17, IAS-17, 2023, 577 : 753 - 764
  • [32] Training neural networks with end-to-end optical backpropagation
    James Spall
    Xianxin Guo
    Alexander ILvovsky
    Advanced Photonics, 2025, 7 (01) : 35 - 44
  • [33] Gigapixel end-to-end training using streaming and attention
    Dooper, Stephan
    Pinckaers, Hans
    Aswolinskiy, Witali
    Hebeda, Konnie
    Jarkman, Sofia
    van der Laak, Jeroen
    Litjens, Geert
    BIGPICTURE Consortium
    MEDICAL IMAGE ANALYSIS, 2023, 88
  • [34] End-to-end Optimization of Machine Learning Prediction Queries
    Park, Kwanghyun
    Saur, Karla
    Banda, Dalitso
    Interlandi, Rathijit Sen Matteo
    Karanasos, Konstantinos
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 587 - 601
  • [35] Risk Guarantees for End-to-End Prediction and Optimization Processes
    Nam Ho-Nguyen
    Kilinc-Karzan, Fatma
    MANAGEMENT SCIENCE, 2022, 68 (12) : 8680 - 8698
  • [36] End-to-End Learning for Prediction and Optimization with Gradient Boosting
    Konishi, Takuya
    Fukunaga, Takuro
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT III, 2021, 12459 : 191 - 207
  • [37] End-to-End Prediction of EGFR Mutation Status With Denseformer
    Zhao, Shijie
    Li, Wenyuan
    Liu, Zhuoyan
    Pang, Tianji
    Yang, Yang
    Qiang, Ning
    Zhao, Jingyi
    Li, Bangguo
    Lei, Baiying
    Han, Junwei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (01) : 54 - 65
  • [38] Prediction, scenarios and insight: The uses of an end-to-end model
    Steele, John H.
    PROGRESS IN OCEANOGRAPHY, 2012, 102 : 67 - 73
  • [39] HazDesNet: An End-to-End Network for Haze Density Prediction
    Zhang, Jiahe
    Min, Xiongkuo
    Zhu, Yucheng
    Zhai, Guangtao
    Zhou, Jiantao
    Yang, Xiaokang
    Zhang, Wenjun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) : 3087 - 3102
  • [40] End-to-end Contextual Perception and Prediction with Interaction Transformer
    Li, Lingyun Luke
    Yang, Bin
    Liang, Ming
    Zeng, Wenyuan
    Ren, Mengye
    Segal, Sean
    Urtasun, Raquel
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5784 - 5791