Corpus-level and Concept-based Explanations for Interpretable Document Classification

被引:2
|
作者
Shi, Tian [1 ]
Zhang, Xuchao [1 ]
Wang, Ping [1 ]
Reddy, Chandan K. [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Attention mechanism; model interpretation; document classification; sentiment classification; concept-based explanation;
D O I
10.1145/3477539
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using attention weights to identify information that is important for models' decision making is a popular approach to interpret attention-based neural networks. This is commonly realized in practice through the generation of a heat-map for every single document based on attention weights. However, this interpretation method is fragile and it is easy to find contradictory examples. In this article, we propose a corpus-level explanation approach, which aims at capturing causal relationships between keywords and model predictions via learning the importance of keywords for predicted labels across a training corpus based on attention weights. Based on this idea, we further propose a concept-based explanation method that can automatically learn higher level concepts and their importance to model prediction tasks. Our concept-based explanation method is built upon a novel Abstraction-Aggregation Network (AAN), which can automatically cluster important keywords during an end-to-end training process. We apply these methods to the document classification task and show that they are powerful in extracting semantically meaningful keywords and concepts. Our consistency analysis results based on an attention-based Naive Bayes classifier (NBC) also demonstrate that these keywords and concepts are important for model predictions.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Unsupervised Interpretable Basis Extraction for Concept-Based Visual Explanations
    Doumanoglou A.
    Asteriadis S.
    Zarpalas D.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1496 - 1510
  • [2] CONCEPT-BASED CLASSIFICATION FOR MULTI-DOCUMENT SUMMARIZATION
    Celikyilmaz, Asli
    Hakkani-Tuer, Dilek
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5540 - 5543
  • [3] Towards Automatic Concept-based Explanations
    Ghorbani, Amirata
    Wexler, James
    Zou, James
    Kim, Been
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Concept-Based Document Classification Using Wikipedia and Value Function
    Malo, Pekka
    Sinha, Ankur
    Wallenius, Jyrki
    Korhonen, Pekka
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (12): : 2496 - 2511
  • [5] Concept Activation Regions: A Generalized Framework For Concept-Based Explanations
    Crabbe, Jonathan
    van der Schaar, Mihaela
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Concept-based Explanations for Out-of-Distribution Detectors
    Choi, Jihye
    Raghuram, Jayaram
    Feng, Ryan
    Chen, Jiefeng
    Jha, Somesh
    Prakash, Atul
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [7] Concept-based document recommendations for CiteSeer authors
    Chandrasekaran, Karman
    Gauch, Susan
    Lakkaraju, Praveen
    Luong, Hiep Phuc
    ADAPTIVE HYPERMEDIA AND ADAPTIVE WEB-BASED SYSTEMS, 2008, 5149 : 83 - +
  • [8] Learning a concept-based document similarity measure
    Huang, Lan
    Milne, David
    Frank, Eibe
    Witten, Ian H.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (08): : 1593 - 1608
  • [9] GlanceNets: Interpretable, Leak-proof Concept-based Models
    Marconato, Emanuele
    Passerini, Andrea
    Teso, Stefano
    NEURAL-SYMBOLIC LEARNING AND REASONING 2023, NESY 2023, 2023,
  • [10] GlanceNets: Interpretable, Leak-proof Concept-based Models
    Marconato, Emanuele
    Passerini, Andrea
    Teso, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,