DPTCN: A novel deep CNN model for short text classification

被引:7
|
作者
Yu, Shujuan [1 ]
Liu, Danlei [1 ]
Zhang, Yun [1 ]
Zhao, Shengmei [1 ]
Wang, Weigang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Elect & Opt Engn, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep convolution network; causal convolution; shortcut connection; short text classification;
D O I
10.3233/JIFS-210970
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an important branch of Nature Language Processing (NLP), how to extract useful text information and effective long-range associations has always been a bottleneck for text classification. With the great effort of deep learning researchers, deep Convolutional Neural Networks (CNNs) have made remarkable achievements in Computer Vision but still controversial in NLP tasks. In this paper, we propose a novel deep CNN named Deep Pyramid Temporal Convolutional Network (DPTCN) for short text classification, which is mainly consisting of concatenated embedding layer, causal convolution, 1/2 max pooling down-sampling and residual blocks. It is worth mentioning that our work was highly inspired by two well-designed models: one is temporal convolutional network for sequential modeling; another is deep pyramid CNN for text categorization; as their applicability and pertinence remind us how to build a model in a special domain. In the experiments, we evaluate the proposed model on 7 datasets with 6 models and analyze the impact of three different embedding methods. The results prove that our work is a good attempt to apply word-level deep convolutional network in short text classification.
引用
收藏
页码:7093 / 7100
页数:8
相关论文
共 50 条
  • [21] CNN-LSTM: A Novel Hybrid Deep Neural Network Model for Brain Tumor Classification
    Dhaniya, R. D.
    Umamaheswari, K. M.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01): : 1129 - 1143
  • [22] Deep Neural Network for Short-Text Sentiment Classification
    Li, Xiangsheng
    Pang, Jianhui
    Mo, Biyun
    Rao, Yanghui
    Wang, Fu Lee
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 168 - 175
  • [23] Survey of Short Text Classification Methods Based on Deep Learning
    Gan, Yating
    An, Jianye
    Xu, Xue
    [J]. Computer Engineering and Applications, 2024, 59 (04) : 43 - 53
  • [24] Text classification based on hybrid CNN-LSTM hybrid model
    She, Xiangyang
    Zhang, Di
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 185 - 189
  • [25] Multi Text Classification Model Based on BRET-CNN-BiLSTM
    Xu, ErZhuo
    Qin, Donghong
    Huang, Jun
    Zhang, Jinbo
    [J]. 2022 IEEE THE 5TH INTERNATIONAL CONFERENCE ON BIG DATA AND ARTIFICIAL INTELLIGENCE (BDAI 2022), 2022, : 184 - 189
  • [26] Friends Recommendation Based on KBERT-CNN Text Classification Model
    Pan, Ning
    Yao, Wenbin
    Li, Xiaoyong
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] SHORT TEXT CLASSIFICATION BASED ON LDA TOPIC MODEL
    Chen, Qiuxing
    Yao, Lixiu
    Yang, Jie
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 749 - 753
  • [28] Efficient Deep CNN Model for COVID-19 Classification
    El-Shafai, Walid
    Mahmoud, Amira A.
    El-Rabaie, El-Sayed M.
    Taha, Taha E.
    Zahran, Osama F.
    El-Fishawy, Adel S.
    Abd-Elnaby, Mohammed
    Abd El-Samie, Fathi E.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (03): : 4373 - 4391
  • [29] A Novel Feature Voting Model for Text Classification
    Jia, Sen
    Liang, Jinquan
    Xie, Yao
    Deng, Lin
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 306 - 311
  • [30] Transfer Learning Method for Very Deep CNN for Text Classification and Methods for its Evaluation
    Moriya, Shun
    Shibata, Chihiro
    [J]. 2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2018), VOL 2, 2018, : 153 - 158