DPTCN: A novel deep CNN model for short text classification

被引:7
|
作者
Yu, Shujuan [1 ]
Liu, Danlei [1 ]
Zhang, Yun [1 ]
Zhao, Shengmei [1 ]
Wang, Weigang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Elect & Opt Engn, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep convolution network; causal convolution; shortcut connection; short text classification;
D O I
10.3233/JIFS-210970
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an important branch of Nature Language Processing (NLP), how to extract useful text information and effective long-range associations has always been a bottleneck for text classification. With the great effort of deep learning researchers, deep Convolutional Neural Networks (CNNs) have made remarkable achievements in Computer Vision but still controversial in NLP tasks. In this paper, we propose a novel deep CNN named Deep Pyramid Temporal Convolutional Network (DPTCN) for short text classification, which is mainly consisting of concatenated embedding layer, causal convolution, 1/2 max pooling down-sampling and residual blocks. It is worth mentioning that our work was highly inspired by two well-designed models: one is temporal convolutional network for sequential modeling; another is deep pyramid CNN for text categorization; as their applicability and pertinence remind us how to build a model in a special domain. In the experiments, we evaluate the proposed model on 7 datasets with 6 models and analyze the impact of three different embedding methods. The results prove that our work is a good attempt to apply word-level deep convolutional network in short text classification.
引用
收藏
页码:7093 / 7100
页数:8
相关论文
共 50 条
  • [1] A Novel Model Based on AdaBoost and Deep CNN for Vehicle Classification
    Chen, Wei
    Sun, Qiang
    Wang, Jue
    Dong, Jing-Jing
    Xu, Chen
    [J]. IEEE ACCESS, 2018, 6 : 60445 - 60455
  • [2] SF-CNN: Deep Text Classification and Retrieval for Text Documents
    Sarasu, R.
    Thyagharajan, K. K.
    Shanker, N. R.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02): : 1799 - 1813
  • [3] Towards Effective Short Text Deep Classification
    Sun, Xinruo
    Wang, Haofen
    Yu, Yong
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1143 - 1144
  • [4] An Analysis Method for Interpretability of CNN Text Classification Model
    Ce, Peng
    Tie, Bao
    [J]. FUTURE INTERNET, 2020, 12 (12): : 1 - 14
  • [5] LSTM-CNN Hybrid Model for Text Classification
    Zhang, Jiarui
    Li, Yingxiang
    Tian, Juan
    Li, Tongyan
    [J]. PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1675 - 1680
  • [6] Attention-based LSTM, GRU and CNN for short text classification
    Yu, Shujuan
    Liu, Danlei
    Zhu, Wenfeng
    Zhang, Yun
    Zhao, Shengmei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (01) : 333 - 340
  • [7] A Short Text Classification Method Based on N-Gram and CNN
    Wang, Haitao
    He, Jie
    Zhang, Xiaohong
    Liu, Shufen
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2020, 29 (02) : 248 - 254
  • [8] A Short Text Classification Method Based on N-Gram and CNN
    WANG Haitao
    HE Jie
    ZHANG Xiaohong
    LIU Shufen
    [J]. Chinese Journal of Electronics, 2020, 29 (02) : 248 - 254
  • [9] Deep Short Text Classification with Knowledge Powered Attention
    Chen, Jindong
    Hu, Yizhou
    Liu, Jingping
    Xiao, Yanghua
    Jiang, Haiyun
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6252 - 6259
  • [10] A novel deep CNN model with entropy coded sine cosine for corn disease classification
    Malik, Mehak Mushtaq
    Fayyaz, Abdul Muiz
    Yasmin, Mussarat
    Abdulkadir, Said Jadid
    Al-Selwi, Safwan Mahmood
    Raza, Mudassar
    Waheed, Sadia
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)