Corpus-level and Concept-based Explanations for Interpretable Document Classification

被引:2
|
作者
Shi, Tian [1 ]
Zhang, Xuchao [1 ]
Wang, Ping [1 ]
Reddy, Chandan K. [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Attention mechanism; model interpretation; document classification; sentiment classification; concept-based explanation;
D O I
10.1145/3477539
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using attention weights to identify information that is important for models' decision making is a popular approach to interpret attention-based neural networks. This is commonly realized in practice through the generation of a heat-map for every single document based on attention weights. However, this interpretation method is fragile and it is easy to find contradictory examples. In this article, we propose a corpus-level explanation approach, which aims at capturing causal relationships between keywords and model predictions via learning the importance of keywords for predicted labels across a training corpus based on attention weights. Based on this idea, we further propose a concept-based explanation method that can automatically learn higher level concepts and their importance to model prediction tasks. Our concept-based explanation method is built upon a novel Abstraction-Aggregation Network (AAN), which can automatically cluster important keywords during an end-to-end training process. We apply these methods to the document classification task and show that they are powerful in extracting semantically meaningful keywords and concepts. Our consistency analysis results based on an attention-based Naive Bayes classifier (NBC) also demonstrate that these keywords and concepts are important for model predictions.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] On Completeness-aware Concept-Based Explanations in Deep Neural Networks
    Yeh, Chih-Kuan
    Kim, Been
    Arik, Sercan O.
    Li, Chun-Liang
    Pfister, Tomas
    Ravikumar, Pradeep
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [22] Concept-Based Lesion Aware Transformer for Interpretable Retinal Disease Diagnosis
    Wen, Chi
    Ye, Mang
    Li, He
    Chen, Ting
    Xiao, Xuan
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 57 - 68
  • [23] Document indexing: a concept-based approach to term weight estimation
    Kang, BY
    Lee, SJ
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (05) : 1065 - 1080
  • [24] A fuzzy-rough method for concept-based document expansion
    Li, Y
    Shiu, SCK
    Pal, SK
    Liu, JNK
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, 2004, 3066 : 699 - 707
  • [25] Concept-based Document Models using Explicit Semantic Analysis
    Luo, Jing
    Meng, Bo
    Tu, Xinhui
    Liu, Maofu
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 338 - 342
  • [26] ConceptEVA: Concept-Based Interactive Exploration and Customization of Document Summaries
    Zhang, Xiaoyu
    Li, Jianping Kelvin
    Chi, Po-Wei
    Chandrasegaran, Senthil
    Ma, Kwan-Liu
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023), 2023,
  • [27] Using WordNet for Concept-Based Document Indexing in Information Retrieval
    Boubekeur, Fatiha
    Boughanem, Mohand
    Tamine, Lynda
    Daoud, Mariam
    SEMAPRO 2010: THE FOURTH INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, 2010, : 151 - 157
  • [28] Concept-Based Label Distribution Learning for Text Classification
    Hui Li
    Guimin Huang
    Yiqun Li
    Xiaowei Zhang
    Yabing Wang
    International Journal of Computational Intelligence Systems, 15
  • [29] Concept-Based Semi-Automatic Classification of Drugs
    Gurulingappa, Harsha
    Kolarik, Corinna
    Hofmann-Apitius, Martin
    Fluck, Juliane
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (08) : 1986 - 1992
  • [30] Concept-Based Label Distribution Learning for Text Classification
    Li, Hui
    Huang, Guimin
    Li, Yiqun
    Zhang, Xiaowei
    Wang, Yabing
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)