Corpus-level and Concept-based Explanations for Interpretable Document Classification

被引:2
|
作者
Shi, Tian [1 ]
Zhang, Xuchao [1 ]
Wang, Ping [1 ]
Reddy, Chandan K. [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Attention mechanism; model interpretation; document classification; sentiment classification; concept-based explanation;
D O I
10.1145/3477539
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using attention weights to identify information that is important for models' decision making is a popular approach to interpret attention-based neural networks. This is commonly realized in practice through the generation of a heat-map for every single document based on attention weights. However, this interpretation method is fragile and it is easy to find contradictory examples. In this article, we propose a corpus-level explanation approach, which aims at capturing causal relationships between keywords and model predictions via learning the importance of keywords for predicted labels across a training corpus based on attention weights. Based on this idea, we further propose a concept-based explanation method that can automatically learn higher level concepts and their importance to model prediction tasks. Our concept-based explanation method is built upon a novel Abstraction-Aggregation Network (AAN), which can automatically cluster important keywords during an end-to-end training process. We apply these methods to the document classification task and show that they are powerful in extracting semantically meaningful keywords and concepts. Our consistency analysis results based on an attention-based Naive Bayes classifier (NBC) also demonstrate that these keywords and concepts are important for model predictions.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Invertible Concept-based Explanations for CNN Models with Non-negative Concept Activation Vectors
    Zhang, Ruihan
    Madumal, Prashan
    Miller, Tim
    Ehinger, Krista A.
    Rubinstein, Benjamin I. P.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11682 - 11690
  • [32] Unlocking the Black Box: Concept-Based Modeling for Interpretable Affective Computing Applications
    Li, Xinyu
    Mahmoud, Marwa
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [33] DiConStruct: Causal Concept-based Explanations through Black-Box Distillation
    Moreira, Ricardo
    Bono, Jacopo
    Cardoso, Mario
    Saleiro, Pedro
    Figueiredo, Mario
    Bizarro, Pedro
    CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 740 - 768
  • [34] A concept-based interpretable model for the diagnosis of choroid neoplasias using multimodal data
    Yifan Wu
    Yang Liu
    Yue Yang
    Michael S. Yao
    Wenli Yang
    Xuehui Shi
    Lihong Yang
    Dongjun Li
    Yueming Liu
    Shiyi Yin
    Chunyan Lei
    Meixia Zhang
    James C. Gee
    Xuan Yang
    Wenbin Wei
    Shi Gu
    Nature Communications, 16 (1)
  • [35] A semi-supervised framework for concept-based hierarchical document clustering
    Sadjadi, Seyed Mojtaba
    Mashayekhi, Hoda
    Hassanpour, Hamid
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (06): : 3861 - 3890
  • [36] GENERATING, INTEGRATING, AND ACTIVATING THESAURI FOR CONCEPT-BASED DOCUMENT-RETRIEVAL
    CHEN, HC
    LYNCH, KJ
    BASU, K
    NG, TD
    IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1993, 8 (02): : 25 - 34
  • [37] A semi-supervised framework for concept-based hierarchical document clustering
    Seyed Mojtaba Sadjadi
    Hoda Mashayekhi
    Hamid Hassanpour
    World Wide Web, 2023, 26 : 3861 - 3890
  • [38] Neural Concept Map Generation for Effective Document Classification with Interpretable Structured Summarization
    Yang, Carl
    Zhang, Jieyu
    Wang, Haonan
    Li, Bangzheng
    Han, Jiawei
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1629 - 1632
  • [39] Hierarchical document categorization with k-NN and concept-based thesauri
    Bang, SL
    Yang, JD
    Yang, HJ
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (02) : 387 - 406
  • [40] Concept-based Topic Attention for a Convolutional Sequence Document Summarization Model
    Khanam, Shirin Akther
    Liu, Fei
    Chen, Yi-Ping Phoebe
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,