Explaining A Black-box By Using A Deep Variational Information Bottleneck Approach

被引:0
|
作者
Bang, Seojin [1 ,2 ]
Xie, Pengtao [2 ,3 ]
Lee, Heewook [4 ]
Wu, Wei [1 ]
Xing, Eric [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Petuum Inc, Pittsburgh, PA USA
[3] Univ Calif San Diego, La Jolla, CA 92093 USA
[4] Arizona State Univ, Tempe, AZ 85287 USA
关键词
DATABASE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpretable machine learning has gained much attention recently. Briefness and comprehensiveness are necessary in order to provide a large amount of information concisely when explaining a black-box decision system. However, existing interpretable machine learning methods fail to consider briefness and comprehensiveness simultaneously, leading to redundant explanations. We propose the variational information bottleneck for interpretation, VIBI, a system-agnostic interpretable method that provides a brief but comprehensive explanation. VIBI adopts an information theoretic principle, information bottleneck principle, as a criterion for finding such explanations. For each instance, VIBI selects key features that are maximally compressed about an input (briefness), and informative about a decision made by a black-box system on that input (comprehensive). We evaluate VIBI on three datasets and compare with state-of-the-art interpretable machine learning methods in terms of both interpretability and fidelity evaluated by human and quantitative metrics.
引用
收藏
页码:11396 / 11404
页数:9
相关论文
共 50 条
  • [1] Explaining the black-box smoothly-A counterfactual approach
    Singla, Sumedha
    Eslami, Motahhare
    Pollack, Brian
    Wallace, Stephen
    Batmanghelich, Kayhan
    [J]. MEDICAL IMAGE ANALYSIS, 2023, 84
  • [2] Explaining Black-Box Models Using Interpretable Surrogates
    Kuttichira, Deepthi Praveenlal
    Gupta, Sunil
    Li, Cheng
    Rana, Santu
    Venkatesh, Svetha
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 3 - 15
  • [3] Explaining Black-Box Algorithms Using Probabilistic Contrastive Counterfactuals
    Galhotra, Sainyam
    Pradhan, Romila
    Salimi, Babak
    [J]. SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 577 - 590
  • [4] On the Convergence of Black-Box Variational Inference
    Kim, Kyurae
    Oh, Jisu
    Wu, Kaiwen
    Ma, Yi-An
    Gardner, Jacob R.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Explaining black-box classifiers: Properties and functions
    Amgoud, Leila
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 155 : 40 - 65
  • [6] Explaining Black-box Classification Models with Arguments
    Amgoud, Leila
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 791 - 795
  • [7] Explaining Black-box Android Malware Detection
    Melis, Marco
    Maiorca, Davide
    Biggio, Battista
    Giacinto, Giorgio
    Roli, Fabio
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 524 - 528
  • [8] EXPLAN: Explaining Black-box Classifiers using Adaptive Neighborhood Generation
    Rasouli, Peyman
    Yu, Ingrid Chieh
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] DDImage: an image reduction based approach for automatically explaining black-box classifiers
    Jiang, Mingyue
    Tang, Chengjian
    Zhang, Xiao-Yi
    Zhao, Yangyang
    Ding, Zuohua
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
  • [10] A MODEL DISTILLATION APPROACH FOR EXPLAINING BLACK-BOX MODELS FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Taskin, Gulsen
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3592 - 3595