EXPLAN: Explaining Black-box Classifiers using Adaptive Neighborhood Generation

被引:9
|
作者
Rasouli, Peyman [1 ]
Yu, Ingrid Chieh [1 ]
机构
[1] Univ Oslo, Dept Informat, Oslo, Norway
关键词
XAI; Interpretable Machine Learning; Perturbation-based Explanation Methods; Data Sampling; CLASSIFICATION;
D O I
10.1109/ijcnn48605.2020.9206710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Defining a representative locality is an urgent challenge in perturbation-based explanation methods, which influences the fidelity and soundness of explanations. We address this issue by proposing a robust and intuitive approach for EXPLaining black-box classifiers using Adaptive Neighborhood generation (EXPLAN). EXPLAN is a module-based algorithm consisted of dense data generation, representative data selection, data balancing, and rule-based interpretable model. It takes into account the adjacency information derived from the black-box decision function and the structure of the data for creating a representative neighborhood for the instance being explained. As a local model-agnostic explanation method, EXPLAN generates explanations in the form of logical rules that are highly interpretable and well-suited for qualitative analysis of the model's behavior. We discuss fidelity-interpretability trade-offs and demonstrate the performance of the proposed algorithm by a comprehensive comparison with state-of-the-art explanation methods LIME, LORE, and Anchor. The conducted experiments on real-world data sets show our method achieves solid empirical results in terms of fidelity, precision, and stability of explanations.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Composition of relational features with an application to explaining black-box predictors
    Ashwin Srinivasan
    A. Baskar
    Tirtharaj Dash
    Devanshu Shah
    Machine Learning, 2024, 113 : 1091 - 1132
  • [32] Generating Causal Hypotheses for Explaining Black-Box Industrial Processes
    Balzereit, Kaja
    Diedrich, Alexander
    Kubus, Daniel
    Ginster, Jonas
    Bunte, Andreas
    2022 IEEE 5TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2022,
  • [33] Composition of relational features with an application to explaining black-box predictors
    Srinivasan, Ashwin
    Baskar, A.
    Dash, Tirtharaj
    Shah, Devanshu
    MACHINE LEARNING, 2024, 113 (03) : 1091 - 1132
  • [34] Cluster-Explorer: Explaining Black-Box Clustering Results
    Tutay, Sariel
    Somech, Amit
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5106 - 5110
  • [35] Explaining Black-Box Classifiers with ILP - Empowering LIME with Aleph to Approximate Non-linear Decisions with Relational Rules
    Rabold, Johannes
    Siebers, Michael
    Schmid, Ute
    INDUCTIVE LOGIC PROGRAMMING (ILP 2018), 2018, 11105 : 105 - 117
  • [36] Black-Box Audio Adversarial Example Generation Using Variational Autoencoder
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT II, 2021, 12919 : 142 - 160
  • [37] Universal Perturbation Generation for Black-box Attack Using Evolutionary Algorithms
    Wang, Siyu
    Shi, Yucheng
    Han, Yahong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1277 - 1282
  • [38] A Practical Black-Box Attack on Source Code Authorship Identification Classifiers
    Liu, Qianjun
    Ji, Shouling
    Liu, Changchang
    Wu, Chunming
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 3620 - 3633
  • [39] Best-Effort Adversarial Approximation of Black-Box Malware Classifiers
    Ali, Abdullah
    Eshete, Birhanu
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT I, 2020, 335 : 318 - 338
  • [40] Black-Box Adversarial Attack for Deep Learning Classifiers in IoT Applications
    Singh, Abhijit
    Sikdar, Biplab
    2022 IEEE 8TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2022,