EXPLAN: Explaining Black-box Classifiers using Adaptive Neighborhood Generation

被引:9
|
作者
Rasouli, Peyman [1 ]
Yu, Ingrid Chieh [1 ]
机构
[1] Univ Oslo, Dept Informat, Oslo, Norway
关键词
XAI; Interpretable Machine Learning; Perturbation-based Explanation Methods; Data Sampling; CLASSIFICATION;
D O I
10.1109/ijcnn48605.2020.9206710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Defining a representative locality is an urgent challenge in perturbation-based explanation methods, which influences the fidelity and soundness of explanations. We address this issue by proposing a robust and intuitive approach for EXPLaining black-box classifiers using Adaptive Neighborhood generation (EXPLAN). EXPLAN is a module-based algorithm consisted of dense data generation, representative data selection, data balancing, and rule-based interpretable model. It takes into account the adjacency information derived from the black-box decision function and the structure of the data for creating a representative neighborhood for the instance being explained. As a local model-agnostic explanation method, EXPLAN generates explanations in the form of logical rules that are highly interpretable and well-suited for qualitative analysis of the model's behavior. We discuss fidelity-interpretability trade-offs and demonstrate the performance of the proposed algorithm by a comprehensive comparison with state-of-the-art explanation methods LIME, LORE, and Anchor. The conducted experiments on real-world data sets show our method achieves solid empirical results in terms of fidelity, precision, and stability of explanations.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Defending Black-Box Skeleton-Based Human Activity Classifiers
    Wang, He
    Diao, Yunfeng
    Tan, Zichang
    Guo, Guodong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2546 - 2554
  • [42] Experimental Study on Generating Multi-modal Explanations of Black-box Classifiers in terms of Gray-box Classifiers
    Alonso, Jose M.
    Toja-Alamancos, J.
    Bugarin, A.
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [43] Explaining black-box classifiers using post-hoc explanations-by-example: The effect of explanations and error-rates in XAI user studies
    Kenny, Eoin M.
    Ford, Courtney
    Quinn, Molly
    Keane, Mark T.
    ARTIFICIAL INTELLIGENCE, 2021, 294
  • [44] A Rate-Distortion Framework for Explaining Black-Box Model Decisions
    Kolek, Stefan
    Nguyen, Duc Anh
    Levie, Ron
    Bruna, Joan
    Kutyniok, Gitta
    XXAI - BEYOND EXPLAINABLE AI: International Workshop, Held in Conjunction with ICML 2020, July 18, 2020, Vienna, Austria, Revised and Extended Papers, 2022, 13200 : 91 - 115
  • [45] Adaptive Hyperparameter Tuning for Black-box LiDAR Odometry
    Koide, Kenji
    Yokozuka, Masashi
    Oishi, Shuji
    Banno, Atsuhiko
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 7708 - 7714
  • [46] Black-box composition does not imply adaptive security
    Myers, S
    ADVANCES IN CRYPTOLOGY - EUROCRYPT 2004, PROCEEDINGS, 2004, 3027 : 189 - 206
  • [47] Adaptive hyperparameter optimization for black-box adversarial attack
    Zhenyu Guan
    Lixin Zhang
    Bohan Huang
    Bihe Zhao
    Song Bian
    International Journal of Information Security, 2023, 22 : 1765 - 1779
  • [48] EXPLAINABLE AI FOR DATA FARMING OUTPUT ANALYSIS: A USE CASE FOR KNOWLEDGE GENERATION THROUGH BLACK-BOX CLASSIFIERS
    Feldkamp, Niclas
    Genath, Jonas
    Strassburger, Steffen
    2022 WINTER SIMULATION CONFERENCE (WSC), 2022, : 1152 - 1163
  • [49] Adaptive hyperparameter optimization for black-box adversarial attack
    Guan, Zhenyu
    Zhang, Lixin
    Huang, Bohan
    Zhao, Bihe
    Bian, Song
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2023, 22 (06) : 1765 - 1779
  • [50] Adaptive control of black-box nonlinear systems using recurrent neural networks
    Li, MZ
    Wang, FL
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 4165 - 4170