Interpreting Deep Models for Text Analysis via Optimization and Regularization Methods

被引:0
|
作者
Yuan, Hao [1 ]
Chen, Yongjun [1 ]
Hu, Xia [2 ]
Ji, Shuiwang [2 ]
机构
[1] Washington State Univ, Pullman, WA 99164 USA
[2] Texas A&M Univ, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpreting deep neural networks is of great importance to understand and verify deep models for natural language processing (NLP) tasks. However, most existing approaches only focus on improving the performance of models but ignore their interpretability. In this work, we propose an approach to investigate the meaning of hidden neurons of the convolutional neural network (CNN) models. We first employ saliency map and optimization techniques to approximate the detected information of hidden neurons from input sentences. Then we develop regularization terms and explore words in vocabulary to interpret such detected information. Experimental results demonstrate that our approach can identify meaningful and reasonable interpretations for hidden spatial locations. Additionally, we show that our approach can describe the decision procedure of deep NLP models.
引用
收藏
页码:5717 / 5724
页数:8
相关论文
共 50 条
  • [31] IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models
    Mohammed Khaleel
    Lei Qi
    Wallapak Tavanapong
    Johnny Wong
    Adisak Sukul
    David A. M. Peterson
    Journal of Big Data, 9
  • [32] Clustering Analysis via Deep Generative Models With Mixture Models
    Yang, Lin
    Fan, Wentao
    Bouguila, Nizar
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 340 - 350
  • [33] Comparative Analysis of Using Different Text Features, Models, and Methods in Text Author Recognition
    Azimov, R. B.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2024, 60 (05) : 711 - 725
  • [34] On Explicit Curvature Regularization in Deep Generative Models
    Lee, Yonghyeon
    Park, Frank C.
    TOPOLOGICAL, ALGEBRAIC AND GEOMETRIC LEARNING WORKSHOPS 2023, VOL 221, 2023, 221
  • [35] Interpreting Deep Learning Models for Multimodal Neuroimaging
    Mueller, K. R.
    Hofmann, S. M.
    2023 11TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI, 2023,
  • [36] Interpreting Deep Learning Models for Knowledge Tracing
    Yu Lu
    Deliang Wang
    Penghe Chen
    Qinggang Meng
    Shengquan Yu
    International Journal of Artificial Intelligence in Education, 2023, 33 : 519 - 542
  • [37] Methods for interpreting and understanding deep neural networks
    Montavon, Gregoire
    Samek, Wojciech
    Mueller, Klaus-Robert
    DIGITAL SIGNAL PROCESSING, 2018, 73 : 1 - 15
  • [38] Interpreting Deep Learning Models for Knowledge Tracing
    Lu, Yu
    Wang, Deliang
    Chen, Penghe
    Meng, Qinggang
    Yu, Shengquan
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2023, 33 (03) : 519 - 542
  • [39] Interpreting deep learning models for weak lensing
    Matilla, Jose Manuel Zorrilla
    Sharma, Manasi
    Hsu, Daniel
    Haiman, Zoltan
    PHYSICAL REVIEW D, 2020, 102 (12)
  • [40] Spam SMS Detection for Turkish Language with Deep Text Analysis and Deep Learning Methods
    Onur Karasoy
    Serkan Ballı
    Arabian Journal for Science and Engineering, 2022, 47 : 9361 - 9377