Explainable AI for Mixed Data Clustering

被引:0
|
作者
Amling, Jonas [2 ]
Scheele, Stephan [1 ]
Slany, Emanuel [3 ]
Lang, Moritz [2 ]
Schmid, Ute [1 ]
机构
[1] Univ Bamberg, Bamberg, Germany
[2] Dab Daten Anal & Beratung GmbH, Deggendorf, Germany
[3] Fraunhofer Inst Integrated Circuits IIS, Erlangen, Germany
关键词
XAI; Mixed Data Clustering; Model-Agnostic;
D O I
10.1007/978-3-031-63797-1_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering, an unsupervised machine learning approach, aims to find groups of similar instances. Mixed data clustering is of particular interest since real-life data often consists of diverse data types. The unsupervised nature of clustering emphasizes the need to understand the criteria for defining and distinguishing clusters. Current explainable AI (XAI) methods for clustering focus on intrinsically explainable clustering techniques, surrogate model-based explanations utilizing established XAI frameworks, and explanations generated from inter-instance distances. However, there exists a research gap in developing post-hoc methods that directly explain clusterings without resorting to surrogate models or requiring prior knowledge about the clustering algorithm. Addressing this gap, our work introduces a model-agnostic, entropy-based Feature Importance Score for continuous and discrete data, offering direct and comprehensible explanations by highlighting key features, deriving rules, and identifying cluster prototypes. The comparison with existing XAI frameworks like SHAP and ClAMP shows that we achieve similar fidelity and simplicity, proving that mixed data clusterings can be effectively explained solely from the distributions of the features and assigned clusters, making complex clusterings comprehensible to humans.
引用
收藏
页码:42 / 62
页数:21
相关论文
共 50 条
  • [31] Chess and explainable AI
    Bjornsson, Yngvi
    ICGA JOURNAL, 2024, 46 (02) : 67 - 75
  • [32] CLUSTERING OF VARIABLES FOR MIXED DATA
    Saracco, J.
    Chavent, M.
    STATISTICS FOR ASTROPHYSICS: CLUSTERING AND CLASSIFICATION, 2016, 77 : 121 - 169
  • [33] Fuzzy clustering of mixed data
    D'Urso, Pierpaolo
    Massari, Riccardo
    INFORMATION SCIENCES, 2019, 505 : 513 - 534
  • [34] MAKING AI EXPLAINABLE
    Pingel J.
    New Electronics, 2022, 55 (10): : 30 - 31
  • [35] Logic for Explainable AI
    Darwiche, Adnan
    2023 38TH ANNUAL ACM/IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE, LICS, 2023,
  • [36] Unexplainable Explainable AI
    Kim, Hyeongjoo
    SYNTHESIS PHILOSOPHICA, 2024, 38 (02) : 275 - 295
  • [37] Explainable AI in Healthcare
    Pawar, Urja
    O'Shea, Donna
    Rea, Susan
    O'Reilly, Ruairi
    2020 INTERNATIONAL CONFERENCE ON CYBER SITUATIONAL AWARENESS, DATA ANALYTICS AND ASSESSMENT (CYBER SA 2020), 2020,
  • [38] Explainable AI in Industry
    Gade, Krishna
    Geyik, Sahin Cem
    Kenthapadi, Krishnaram
    Mithal, Varun
    Taly, Ankur
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 3203 - 3204
  • [39] Integrating Fuzzy C-Means Clustering and Explainable AI for Robust Galaxy Classification
    Diaz, Gabriel Marin
    Medina, Raquel Gomez
    Jimenez, Jose Alberto Aijon
    MATHEMATICS, 2024, 12 (18)
  • [40] Explainable Spatial Clustering: Leveraging Spatial Data in Radiation Oncology
    Wentzel, Andrew
    Canahuate, Guadalupe
    van Dijk, Lisanne, V
    Mohamed, Abdallah S. R.
    Fuller, C. David
    Marai, G. Elisabeta
    2020 IEEE VISUALIZATION CONFERENCE - SHORT PAPERS (VIS 2020), 2020, : 281 - 285