Improving convolutional neural network for text classification by recursive data pruning

被引:27
|
作者
Li, Qi [2 ]
Li, Pengfei [1 ]
Mao, Kezhi [1 ]
Lo, Edmond Yat-Man [2 ,3 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Inst Catastrophe Risk Management, Interdisciplinary Grad Programme, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore 639798, Singapore
关键词
Data pruning; Convolutional neural network; Text classification; SENTIMENT ANALYSIS;
D O I
10.1016/j.neucom.2020.07.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In spite of the state-of-the-art performance of deep neural networks, shallow neural networks are still the choice in applications with limited computing and memory resources. Convolutional neural network (CNN), in particular the one-convolutional-layer CNN, is a widely-used shallow neural network in natural language processing tasks such as text classification. However, it was found that CNNs may misfit to task-irrelevant words in dataset, which in turn leads to unsatisfactory performance. To alleviate this problem, attention mechanism can be integrated into CNN, but this takes up the limited resources. In this paper, we propose to address the misfitting problem from a novel angle - pruning task-irrelevant words from the dataset. The proposed method evaluates the performance of each convolutional filter based on its discriminative power of the feature generated at the pooling layer, and prunes words captured by the poorly-performed filters. Experiment results show that our proposed model significantly outperforms the CNN baseline model. Moreover, our proposed model produces performance similar to or better than the benchmark models (attention integrated CNNs) while demanding less parameters and FLOPs, and is therefore a choice model for resource limited scenarios, such as mobile applications. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [31] A Combined-Convolutional Neural Network for Chinese News Text Classification
    Zhang Y.
    Liu K.-F.
    Zhang Q.-X.
    Wang Y.-G.
    Gao K.-L.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (06): : 1059 - 1067
  • [32] Pruning Convolutional Neural Network with Distinctiveness Approach
    Li, Wenrui
    Plested, Jo
    NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 448 - 455
  • [33] Overview of Deep Convolutional Neural Network Pruning
    Li, Guang
    Liu, Fang
    Xia, Yuping
    2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584
  • [34] A morpheme sequence and convolutional neural network based Kazakh text classification
    Parhat, Sardar
    Ting, Gao
    Ablimit, Mijit
    Hamdulla, Askar
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1903 - 1906
  • [35] Variable Convolution and Pooling Convolutional Neural Network for Text Sentiment Classification
    Dong M.
    Li Y.
    Tang X.
    Xu J.
    Bi S.
    Cai Y.
    IEEE Access, 2020, 8 : 16174 - 16186
  • [36] Deep Convolutional Neural Network for Knowledge-Infused Text Classification
    Malik, Sonika
    Jain, Sarika
    NEW GENERATION COMPUTING, 2024, 42 (01) : 157 - 176
  • [37] Rethinking the Pruning Criteria for Convolutional Neural Network
    Huang, Zhongzhan
    Shao, Wenqi
    Wang, Xinjiang
    Lin, Liang
    Luo, Ping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [38] Binary and Multiclass Text Classification by Means of Separable Convolutional Neural Network
    Solovyeva, Elena
    Abdullah, Ali
    INVENTIONS, 2021, 6 (04)
  • [39] Semantic Template-based Convolutional Neural Network for Text Classification
    Chang, Yung-Chun
    Ng, Siu Hin
    Chen, Jung-Peng
    Liang, Yu-Chi
    Hsu, Wen-Lian
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [40] Deep Convolutional Neural Network for Knowledge-Infused Text Classification
    Sonika Malik
    Sarika Jain
    New Generation Computing, 2024, 42 : 157 - 176