Interpreting Deep Models for Text Analysis via Optimization and Regularization Methods

被引:0
|
作者
Yuan, Hao [1 ]
Chen, Yongjun [1 ]
Hu, Xia [2 ]
Ji, Shuiwang [2 ]
机构
[1] Washington State Univ, Pullman, WA 99164 USA
[2] Texas A&M Univ, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpreting deep neural networks is of great importance to understand and verify deep models for natural language processing (NLP) tasks. However, most existing approaches only focus on improving the performance of models but ignore their interpretability. In this work, we propose an approach to investigate the meaning of hidden neurons of the convolutional neural network (CNN) models. We first employ saliency map and optimization techniques to approximate the detected information of hidden neurons from input sentences. Then we develop regularization terms and explore words in vocabulary to interpret such detected information. Experimental results demonstrate that our approach can identify meaningful and reasonable interpretations for hidden spatial locations. Additionally, we show that our approach can describe the decision procedure of deep NLP models.
引用
收藏
页码:5717 / 5724
页数:8
相关论文
共 50 条
  • [21] Unbounded Bayesian Optimization via Regularization
    Shahriari, Bobak
    Bouchard-Cote, Alexandre
    de Freitas, Nando
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 1168 - 1176
  • [22] Optimization of deep learning models: benchmark and analysis
    Rasheed Ahmad
    Izzat Alsmadi
    Mohammad Al-Ramahi
    Advances in Computational Intelligence, 2023, 3 (2):
  • [23] Tuning regularization via scenario optimization
    Formentin, Simone
    Garatti, Simone
    Campi, Marco C.
    Savaresi, Sergio M.
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [24] Interpreting clusters via prototype optimization
    Carrizosa, Emilio
    Kurishchenko, Kseniia
    Marin, Alfredo
    Morales, Dolores Romero
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2022, 107
  • [25] THE FRAGMENTA'S TIMELINE MODELS FOR RECONSTRUCTING AND INTERPRETING THE TEXT
    Magni, Isabella
    MEDIAEVALIA-AN INTERDISCIPLINARY JOURNAL OF MEDIEVAL STUDIES WORLDWIDE, 2018, 39 : 319 - 343
  • [26] Interpretability of deep learning models in analysis of Spanish financial text
    César Vaca
    Manuel Astorgano
    Alfonso J. López-Rivero
    Fernando Tejerina
    Benjamín Sahelices
    Neural Computing and Applications, 2024, 36 : 7509 - 7527
  • [27] Blind deblurring text images via Beltrami regularization
    Gao, Haijun
    Feng, Minfu
    IMAGE AND VISION COMPUTING, 2024, 147
  • [28] Comparative Analysis of Deep Learning Models for Myanmar Text Classification
    Phyu, Myat Sapal
    Nwet, Khin Thandar
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2020), PT I, 2020, 12033 : 76 - 85
  • [29] Interpretability of deep learning models in analysis of Spanish financial text
    Vaca, Cesar
    Astorgano, Manuel
    Lopez-Rivero, Alfonso J.
    Tejerina, Fernando
    Sahelices, Benjamin
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7509 - 7527
  • [30] IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models
    Khaleel, Mohammed
    Qi, Lei
    Tavanapong, Wallapak
    Wong, Johnny
    Sukul, Adisak
    Peterson, David A. M.
    JOURNAL OF BIG DATA, 2022, 9 (01)