Improving convolutional neural network for text classification by recursive data pruning

被引:27
|
作者
Li, Qi [2 ]
Li, Pengfei [1 ]
Mao, Kezhi [1 ]
Lo, Edmond Yat-Man [2 ,3 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Inst Catastrophe Risk Management, Interdisciplinary Grad Programme, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore 639798, Singapore
关键词
Data pruning; Convolutional neural network; Text classification; SENTIMENT ANALYSIS;
D O I
10.1016/j.neucom.2020.07.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In spite of the state-of-the-art performance of deep neural networks, shallow neural networks are still the choice in applications with limited computing and memory resources. Convolutional neural network (CNN), in particular the one-convolutional-layer CNN, is a widely-used shallow neural network in natural language processing tasks such as text classification. However, it was found that CNNs may misfit to task-irrelevant words in dataset, which in turn leads to unsatisfactory performance. To alleviate this problem, attention mechanism can be integrated into CNN, but this takes up the limited resources. In this paper, we propose to address the misfitting problem from a novel angle - pruning task-irrelevant words from the dataset. The proposed method evaluates the performance of each convolutional filter based on its discriminative power of the feature generated at the pooling layer, and prunes words captured by the poorly-performed filters. Experiment results show that our proposed model significantly outperforms the CNN baseline model. Moreover, our proposed model produces performance similar to or better than the benchmark models (attention integrated CNNs) while demanding less parameters and FLOPs, and is therefore a choice model for resource limited scenarios, such as mobile applications. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [11] A Dynamic Convolutional Neural Network Approach for Legal Text Classification
    Hammami, Eya
    Faiz, Rim
    Akermi, Imen
    INFORMATION AND KNOWLEDGE SYSTEMS: DIGITAL TECHNOLOGIES, ARTIFICIAL INTELLIGENCE AND DECISION MAKING, ICIKS 2021, 2021, 425 : 71 - 84
  • [12] Text Classification Based on Convolutional Neural Network and Attention Model
    Yang, Shuang
    Tang, Yan
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 67 - 73
  • [13] Impact of convolutional neural network and FastText embedding on text classification
    Umer, Muhammad
    Imtiaz, Zainab
    Ahmad, Muhammad
    Nappi, Michele
    Medaglia, Carlo
    Choi, Gyu Sang
    Mehmood, Arif
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 5569 - 5585
  • [14] Application of an Improved Convolutional Neural Network Algorithm in Text Classification
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2024, 23 (03): : 315 - 340
  • [15] News Text Classification Based on an Improved Convolutional Neural Network
    Tao, Wenjing
    Chang, Dan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1400 - 1409
  • [16] Convolutional Neural Network with Contextualized Word Embedding for Text Classification
    Fan, Gaoyang
    Zhu, Cui
    Zhu, Wenjun
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [17] TextConvoNet: a convolutional neural network based architecture for text classification
    Sanskar Soni
    Satyendra Singh Chouhan
    Santosh Singh Rathore
    Applied Intelligence, 2023, 53 : 14249 - 14268
  • [18] Thai Text Detection and Classification Using Convolutional Neural Network
    Malakar, Susanta
    Chiracharit, Werapon
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 99 - 102
  • [19] Impact of convolutional neural network and FastText embedding on text classification
    Muhammad Umer
    Zainab Imtiaz
    Muhammad Ahmad
    Michele Nappi
    Carlo Medaglia
    Gyu Sang Choi
    Arif Mehmood
    Multimedia Tools and Applications, 2023, 82 : 5569 - 5585
  • [20] TextConvoNet: a convolutional neural network based architecture for text classification
    Soni, Sanskar
    Chouhan, Satyendra Singh
    Rathore, Santosh Singh
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14249 - 14268