Corpus-level and Concept-based Explanations for Interpretable Document Classification

被引：2

作者：

Shi, Tian ^{[1
]}

Zhang, Xuchao ^{[1
]}

Wang, Ping ^{[1
]}

Reddy, Chandan K. ^{[1
]}

机构：

[1] Virginia Tech, Blacksburg, VA 24061 USA

来源：

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA | 2022年 / 16卷 / 03期

基金：

美国国家科学基金会;

关键词：

Attention mechanism; model interpretation; document classification; sentiment classification; concept-based explanation;

D O I：

10.1145/3477539

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Using attention weights to identify information that is important for models' decision making is a popular approach to interpret attention-based neural networks. This is commonly realized in practice through the generation of a heat-map for every single document based on attention weights. However, this interpretation method is fragile and it is easy to find contradictory examples. In this article, we propose a corpus-level explanation approach, which aims at capturing causal relationships between keywords and model predictions via learning the importance of keywords for predicted labels across a training corpus based on attention weights. Based on this idea, we further propose a concept-based explanation method that can automatically learn higher level concepts and their importance to model prediction tasks. Our concept-based explanation method is built upon a novel Abstraction-Aggregation Network (AAN), which can automatically cluster important keywords during an end-to-end training process. We apply these methods to the document classification task and show that they are powerful in extracting semantically meaningful keywords and concepts. Our consistency analysis results based on an attention-based Naive Bayes classifier (NBC) also demonstrate that these keywords and concepts are important for model predictions.

引用

页数：17

共 50 条

[21] On Completeness-aware Concept-Based Explanations in Deep Neural Networks
Yeh, Chih-Kuan
Kim, Been
Arik, Sercan O.
Li, Chun-Liang
Pfister, Tomas
Ravikumar, Pradeep
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[22] Concept-Based Lesion Aware Transformer for Interpretable Retinal Disease Diagnosis
Wen, Chi
Ye, Mang
Li, He
Chen, Ting
Xiao, Xuan
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 57 - 68
[23] Document indexing: a concept-based approach to term weight estimation
Kang, BY
Lee, SJ
INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (05) : 1065 - 1080
[24] A fuzzy-rough method for concept-based document expansion
Li, Y
Shiu, SCK
Pal, SK
Liu, JNK
ROUGH SETS AND CURRENT TRENDS IN COMPUTING, 2004, 3066 : 699 - 707
[25] Concept-based Document Models using Explicit Semantic Analysis
Luo, Jing
Meng, Bo
Tu, Xinhui
Liu, Maofu
2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 338 - 342
[26] ConceptEVA: Concept-Based Interactive Exploration and Customization of Document Summaries
Zhang, Xiaoyu
Li, Jianping Kelvin
Chi, Po-Wei
Chandrasegaran, Senthil
Ma, Kwan-Liu
PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023), 2023,
[27] Using WordNet for Concept-Based Document Indexing in Information Retrieval
Boubekeur, Fatiha
Boughanem, Mohand
Tamine, Lynda
Daoud, Mariam
SEMAPRO 2010: THE FOURTH INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, 2010, : 151 - 157
[28] Concept-Based Label Distribution Learning for Text Classification
Hui Li
Guimin Huang
Yiqun Li
Xiaowei Zhang
Yabing Wang
International Journal of Computational Intelligence Systems, 15
[29] Concept-Based Semi-Automatic Classification of Drugs
Gurulingappa, Harsha
Kolarik, Corinna
Hofmann-Apitius, Martin
Fluck, Juliane
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (08) : 1986 - 1992
[30] Concept-Based Label Distribution Learning for Text Classification
Li, Hui
Huang, Guimin
Li, Yiqun
Zhang, Xiaowei
Wang, Yabing
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)

← 1 2 3 4 5 →