Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives

被引:0
|
作者
Dhurandhar, Amit [1 ]
Chen, Pin-Yu [1 ]
Luss, Ronny [1 ]
Tu, Chun-Chen [2 ]
Ting, Paishun [2 ]
Shanmugam, Karthikeyan [1 ]
Das, Payel [1 ]
机构
[1] IBM Res, Yorktown Hts, NY 10598 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel method that provides contrastive explanations justifying the classification of an input by a black box classifier such as a deep neural network. Given an input we find what should be minimally and sufficiently present (viz. important object pixels in an image) to justify its classification and analogously what should be minimally and necessarily absent (viz. certain background pixels). We argue that such explanations are natural for humans and are used commonly in domains such as health care and criminology. What is minimally but critically absent is an important part of an explanation, which to the best of our knowledge, has not been explicitly identified by current explanation methods that explain predictions of neural networks. We validate our approach on three real datasets obtained from diverse domains; namely, a handwritten digits dataset MNIST, a large procurement fraud dataset and a brain activity strength dataset. In all three cases, we witness the power of our approach in generating precise explanations that are also easy for human experts to understand and evaluate.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Achievable Minimally-Contrastive Counterfactual Explanations
    Barzekar, Hosein
    McRoy, Susan
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (03): : 922 - 936
  • [32] Evaluative Item-Contrastive Explanations in Rankings
    Castelnovo, Alessandro
    Crupi, Riccardo
    Mombelli, Nicolo
    Nanino, Gabriele
    Regoli, Daniele
    [J]. COGNITIVE COMPUTATION, 2024,
  • [33] Model Agnostic Contrastive Explanations for Classification Models
    Dhurandhar, Amit
    Pedapati, Tejaswini
    Balakrishnan, Avinash
    Chen, Pin-Yu
    Shanmugam, Karthikeyan
    Puri, Ruchir
    [J]. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2024, 14 (04) : 789 - 798
  • [34] Prompting Contrastive Explanations for Commonsense Reasoning Tasks
    Paranjape, Bhargavi
    Michael, Julian
    Ghazvininejad, Marjan
    Hajishirzi, Hannaneh
    Zettlemoyer, Luke
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4179 - 4192
  • [35] Contrastive Explanations of Plans Through Model Restrictions
    Krarup, Benjamin
    Krivic, Senka
    Magazzeni, Daniele
    Long, Derek
    Cashmore, Michael
    Smith, David E.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 72 : 533 - 612
  • [36] "Why Not Other Classes?": Towards Class-Contrastive Back-Propagation Explanations
    Wang, Yipei
    Wang, Xiaoqian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [37] Towards Analogy-Based Explanations in Machine Learning
    Huellermeier, Eyke
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2020), 2020, 12256 : 205 - 217
  • [38] Towards the Unification and Robustness of Perturbation and Gradient Based Explanations
    Agarwal, Sushant
    Jabbari, Shahin
    Agarwal, Chirag
    Upadhyay, Sohini
    Wu, Zhiwei Steven
    Lakkaraju, Himabindu
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [39] Towards Context-Based Explanations for Teacher Support
    Edman, Anneli
    Lundstrom, Jenny Eriksson
    Akrawi, Narin
    [J]. MODELING AND USING CONTEXT, 2011, 6967 : 97 - 103
  • [40] Contrastive Explanations to Classification Systems Using Sparse Dictionaries
    Apicella, A.
    Isgro, F.
    Prevete, R.
    Tamburrini, G.
    [J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 207 - 218