CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations

被引：2

作者：

Guo, Hangzhi ^{[1
]}

Nguyen, Thanh H. ^{[2
]}

Yadav, Amulya ^{[1
]}

机构：

[1] Penn State Univ, University Pk, PA 16802 USA

[2] Univ Oregon, Eugene, OR 97403 USA

来源：

PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023 | 2023年

关键词：

Counterfactual Explanation; Algorithmic Recourse; Explainable Artificial Intelligence; Interpretability;

D O I：

10.1145/3580305.3599290

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work presents CounterNet, a novel end-to-end learning framework which integrates Machine Learning (ML) model training and the generation of corresponding counterfactual (CF) explanations into a single end-to-end pipeline. Counterfactual explanations offer a contrastive case, i.e., they attempt to find the smallest modification to the feature values of an instance that changes the prediction of the ML model on that instance to a predefined output. Prior techniques for generating CF explanations suffer from two major limitations: (i) all of them are post-hoc methods designed for use with proprietary ML models - as a result, their procedure for generating CF explanations is uninformed by the training of the ML model, which leads to misalignment between model predictions and explanations; and (ii) most of them rely on solving separate time-intensive optimization problems to find CF explanations for each input data point (which negatively impacts their runtime). This work makes a novel departure from the prevalent post-hoc paradigm (of generating CF explanations) by presenting CounterNet, an end-to-end learning framework which integrates predictive model training and the generation of counterfactual (CF) explanations into a single pipeline. Unlike post-hoc methods, CounterNet enables the optimization of the CF explanation generation only once together with the predictive model. We adopt a block-wise coordinate descent procedure which helps in effectively training CounterNet's network. Our extensive experiments on multiple real-world datasets show that CounterNet generates high-quality predictions, and consistently achieves 100% CF validity and low proximity scores (thereby achieving a well-balanced cost-invalidity trade-off) for any new input instance, and runs 3X faster than existing state-of-the-art baselines.

引用

页码：577 / 589

页数：13

共 50 条

[31] Training Traffic Light Behavior with End-to-End Learning
Wildi, Mael
Alahi, Alexandre
Visser, Arnoud
INTELLIGENT AUTONOMOUS SYSTEMS 17, IAS-17, 2023, 577 : 753 - 764
[32] Training neural networks with end-to-end optical backpropagation
James Spall
Xianxin Guo
Alexander ILvovsky
Advanced Photonics, 2025, 7 (01) : 35 - 44
[33] Gigapixel end-to-end training using streaming and attention
Dooper, Stephan
Pinckaers, Hans
Aswolinskiy, Witali
Hebeda, Konnie
Jarkman, Sofia
van der Laak, Jeroen
Litjens, Geert
BIGPICTURE Consortium
MEDICAL IMAGE ANALYSIS, 2023, 88
[34] End-to-end Optimization of Machine Learning Prediction Queries
Park, Kwanghyun
Saur, Karla
Banda, Dalitso
Interlandi, Rathijit Sen Matteo
Karanasos, Konstantinos
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 587 - 601
[35] Risk Guarantees for End-to-End Prediction and Optimization Processes
Nam Ho-Nguyen
Kilinc-Karzan, Fatma
MANAGEMENT SCIENCE, 2022, 68 (12) : 8680 - 8698
[36] End-to-End Learning for Prediction and Optimization with Gradient Boosting
Konishi, Takuya
Fukunaga, Takuro
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT III, 2021, 12459 : 191 - 207
[37] End-to-End Prediction of EGFR Mutation Status With Denseformer
Zhao, Shijie
Li, Wenyuan
Liu, Zhuoyan
Pang, Tianji
Yang, Yang
Qiang, Ning
Zhao, Jingyi
Li, Bangguo
Lei, Baiying
Han, Junwei
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (01) : 54 - 65
[38] Prediction, scenarios and insight: The uses of an end-to-end model
Steele, John H.
PROGRESS IN OCEANOGRAPHY, 2012, 102 : 67 - 73
[39] HazDesNet: An End-to-End Network for Haze Density Prediction
Zhang, Jiahe
Min, Xiongkuo
Zhu, Yucheng
Zhai, Guangtao
Zhou, Jiantao
Yang, Xiaokang
Zhang, Wenjun
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) : 3087 - 3102
[40] End-to-end Contextual Perception and Prediction with Interaction Transformer
Li, Lingyun Luke
Yang, Bin
Liang, Ming
Zeng, Wenyuan
Ren, Mengye
Segal, Sean
Urtasun, Raquel
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5784 - 5791

← 1 2 3 4 5 →