EXPLAN: Explaining Black-box Classifiers using Adaptive Neighborhood Generation

被引:9
|
作者
Rasouli, Peyman [1 ]
Yu, Ingrid Chieh [1 ]
机构
[1] Univ Oslo, Dept Informat, Oslo, Norway
关键词
XAI; Interpretable Machine Learning; Perturbation-based Explanation Methods; Data Sampling; CLASSIFICATION;
D O I
10.1109/ijcnn48605.2020.9206710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Defining a representative locality is an urgent challenge in perturbation-based explanation methods, which influences the fidelity and soundness of explanations. We address this issue by proposing a robust and intuitive approach for EXPLaining black-box classifiers using Adaptive Neighborhood generation (EXPLAN). EXPLAN is a module-based algorithm consisted of dense data generation, representative data selection, data balancing, and rule-based interpretable model. It takes into account the adjacency information derived from the black-box decision function and the structure of the data for creating a representative neighborhood for the instance being explained. As a local model-agnostic explanation method, EXPLAN generates explanations in the form of logical rules that are highly interpretable and well-suited for qualitative analysis of the model's behavior. We discuss fidelity-interpretability trade-offs and demonstrate the performance of the proposed algorithm by a comprehensive comparison with state-of-the-art explanation methods LIME, LORE, and Anchor. The conducted experiments on real-world data sets show our method achieves solid empirical results in terms of fidelity, precision, and stability of explanations.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Explaining Black-Box Models for Biomedical Text Classification
    Moradi, Milad
    Samwald, Matthias
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (08) : 3112 - 3120
  • [22] Black-Box Saliency Map Generation Using Bayesian Optimisation
    Mokuwe, Mamuku
    Burke, Michael
    Bosman, Anna Sergeevna
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [23] Attacking Black-Box Image Classifiers With Particle Swarm Optimization
    Zhang, Quanxin
    Wang, Kunqing
    Zhang, Wenjiao
    Hu, Jingjing
    IEEE ACCESS, 2019, 7 : 158051 - 158063
  • [24] Searching for explanations of black-box classifiers in the space of semantic queries
    Liartis, Jason
    Dervakos, Edmund
    Menis-Mastromichalakis, Orfeas
    Chortaras, Alexandros
    Stamou, Giorgos
    SEMANTIC WEB, 2024, 15 (04) : 1085 - 1126
  • [25] Uncertainty-Based Rejection Wrappers for Black-Box Classifiers
    Mena, Jose
    Pujol, Oriol
    Vitria, Jordi
    IEEE ACCESS, 2020, 8 : 101721 - 101746
  • [26] Generating Black-Box Adversarial Examples for Text Classifiers Using a Deep Reinforced Model
    Vijayaraghavan, Prashanth
    Roy, Deb
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 11907 : 711 - 726
  • [27] Optimizing Black-box Metrics with Adaptive Surrogates
    Jiang, Qijia
    Adigun, Olaoluwa
    Narasimhan, Harikrishna
    Fard, Mahdi Milani
    Gupta, Maya
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [28] A General Method for Visualizing and Explaining Black-Box Regression Models
    Strumbelj, Erik
    Kononenko, Igor
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT II, 2011, 6594 : 21 - 30
  • [29] Analyzing and Explaining Black-Box Models for Online Malware Detection
    Manthena, Harikha
    Kimmel, Jeffrey C.
    Abdelsalam, Mahmoud
    Gupta, Maanak
    IEEE ACCESS, 2023, 11 : 25237 - 25252
  • [30] Explaining Black-box Predictions by Generating Local Meaningful Perturbations
    Verma, Tejaswani
    Lingenfelder, Christoph
    Klakow, Dietrich
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2022, 16 (01) : 47 - 68