Improving convolutional neural network for text classification by recursive data pruning

被引:27
|
作者
Li, Qi [2 ]
Li, Pengfei [1 ]
Mao, Kezhi [1 ]
Lo, Edmond Yat-Man [2 ,3 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Inst Catastrophe Risk Management, Interdisciplinary Grad Programme, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore 639798, Singapore
关键词
Data pruning; Convolutional neural network; Text classification; SENTIMENT ANALYSIS;
D O I
10.1016/j.neucom.2020.07.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In spite of the state-of-the-art performance of deep neural networks, shallow neural networks are still the choice in applications with limited computing and memory resources. Convolutional neural network (CNN), in particular the one-convolutional-layer CNN, is a widely-used shallow neural network in natural language processing tasks such as text classification. However, it was found that CNNs may misfit to task-irrelevant words in dataset, which in turn leads to unsatisfactory performance. To alleviate this problem, attention mechanism can be integrated into CNN, but this takes up the limited resources. In this paper, we propose to address the misfitting problem from a novel angle - pruning task-irrelevant words from the dataset. The proposed method evaluates the performance of each convolutional filter based on its discriminative power of the feature generated at the pooling layer, and prunes words captured by the poorly-performed filters. Experiment results show that our proposed model significantly outperforms the CNN baseline model. Moreover, our proposed model produces performance similar to or better than the benchmark models (attention integrated CNNs) while demanding less parameters and FLOPs, and is therefore a choice model for resource limited scenarios, such as mobile applications. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [41] Variable Convolution and Pooling Convolutional Neural Network for Text Sentiment Classification
    Dong, Min
    Li, Yongfa
    Tang, Xue
    Xu, Jingyun
    Bi, Sheng
    Cai, Yi
    IEEE ACCESS, 2020, 8 : 16174 - 16186
  • [42] Thinning of convolutional neural network with mixed pruning
    Yang, Wenzhu
    Jin, Lilei
    Wang, Sile
    Cu, Zhenchao
    Chen, Xiangyang
    Chen, Liping
    IET IMAGE PROCESSING, 2019, 13 (05) : 779 - 784
  • [43] Classification of Medical Text Data Using Convolutional Neural Network-Support Vector Machine Method
    Liu, Lan
    Sun, Xiankun
    Li, Chengfan
    Lei, Yongmei
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2020, 10 (07) : 1746 - 1753
  • [44] Recursive least squares method for training and pruning convolutional neural networks
    Yu, Tianzong
    Zhang, Chunyuan
    Ma, Meng
    Wang, Yuan
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24603 - 24618
  • [45] Recursive least squares method for training and pruning convolutional neural networks
    Tianzong Yu
    Chunyuan Zhang
    Meng Ma
    Yuan Wang
    Applied Intelligence, 2023, 53 : 24603 - 24618
  • [46] A Hypergraph Convolutional Neural Network for Node Classification of Citation Network Data
    Xiao, Bing-Yu
    Tseng, Chien-Cheng
    Lee, Su-Ling
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 243 - 244
  • [47] Casting Defect Detection and Classification of Convolutional Neural Network Based on Recursive Attention Model
    Zhao, Zhichao
    Wu, Tiefeng
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [48] Classification of Eye Tracking Data using a Convolutional Neural Network
    Yin, Yuehan
    Juan, Chunghao
    Chakraborty, Joyram
    McGuire, Michael P.
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 530 - 535
  • [49] Convolutional neural network for detection and classification of seizures in clinical data
    Tomas Iešmantas
    Robertas Alzbutas
    Medical & Biological Engineering & Computing, 2020, 58 : 1919 - 1932
  • [50] Convolutional neural network for detection and classification of seizures in clinical data
    Iesmantas, Tomas
    Alzbutas, Robertas
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (09) : 1919 - 1932