Corpus-level and Concept-based Explanations for Interpretable Document Classification

被引：2

作者：

Shi, Tian ^{[1
]}

Zhang, Xuchao ^{[1
]}

Wang, Ping ^{[1
]}

Reddy, Chandan K. ^{[1
]}

机构：

[1] Virginia Tech, Blacksburg, VA 24061 USA

来源：

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA | 2022年 / 16卷 / 03期

基金：

美国国家科学基金会;

关键词：

Attention mechanism; model interpretation; document classification; sentiment classification; concept-based explanation;

D O I：

10.1145/3477539

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Using attention weights to identify information that is important for models' decision making is a popular approach to interpret attention-based neural networks. This is commonly realized in practice through the generation of a heat-map for every single document based on attention weights. However, this interpretation method is fragile and it is easy to find contradictory examples. In this article, we propose a corpus-level explanation approach, which aims at capturing causal relationships between keywords and model predictions via learning the importance of keywords for predicted labels across a training corpus based on attention weights. Based on this idea, we further propose a concept-based explanation method that can automatically learn higher level concepts and their importance to model prediction tasks. Our concept-based explanation method is built upon a novel Abstraction-Aggregation Network (AAN), which can automatically cluster important keywords during an end-to-end training process. We apply these methods to the document classification task and show that they are powerful in extracting semantically meaningful keywords and concepts. Our consistency analysis results based on an attention-based Naive Bayes classifier (NBC) also demonstrate that these keywords and concepts are important for model predictions.

引用

页数：17

共 50 条

[31] Invertible Concept-based Explanations for CNN Models with Non-negative Concept Activation Vectors
Zhang, Ruihan
Madumal, Prashan
Miller, Tim
Ehinger, Krista A.
Rubinstein, Benjamin I. P.
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11682 - 11690
[32] Unlocking the Black Box: Concept-Based Modeling for Interpretable Affective Computing Applications
Li, Xinyu
Mahmoud, Marwa
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
[33] DiConStruct: Causal Concept-based Explanations through Black-Box Distillation
Moreira, Ricardo
Bono, Jacopo
Cardoso, Mario
Saleiro, Pedro
Figueiredo, Mario
Bizarro, Pedro
CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 740 - 768
[34] A concept-based interpretable model for the diagnosis of choroid neoplasias using multimodal data
Yifan Wu
Yang Liu
Yue Yang
Michael S. Yao
Wenli Yang
Xuehui Shi
Lihong Yang
Dongjun Li
Yueming Liu
Shiyi Yin
Chunyan Lei
Meixia Zhang
James C. Gee
Xuan Yang
Wenbin Wei
Shi Gu
Nature Communications, 16 (1)
[35] A semi-supervised framework for concept-based hierarchical document clustering
Sadjadi, Seyed Mojtaba
Mashayekhi, Hoda
Hassanpour, Hamid
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (06): : 3861 - 3890
[36] GENERATING, INTEGRATING, AND ACTIVATING THESAURI FOR CONCEPT-BASED DOCUMENT-RETRIEVAL
CHEN, HC
LYNCH, KJ
BASU, K
NG, TD
IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1993, 8 (02): : 25 - 34
[37] A semi-supervised framework for concept-based hierarchical document clustering
Seyed Mojtaba Sadjadi
Hoda Mashayekhi
Hamid Hassanpour
World Wide Web, 2023, 26 : 3861 - 3890
[38] Neural Concept Map Generation for Effective Document Classification with Interpretable Structured Summarization
Yang, Carl
Zhang, Jieyu
Wang, Haonan
Li, Bangzheng
Han, Jiawei
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1629 - 1632
[39] Hierarchical document categorization with k-NN and concept-based thesauri
Bang, SL
Yang, JD
Yang, HJ
INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (02) : 387 - 406
[40] Concept-based Topic Attention for a Convolutional Sequence Document Summarization Model
Khanam, Shirin Akther
Liu, Fei
Chen, Yi-Ping Phoebe
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,

← 1 2 3 4 5 →