Compositional Recurrent Neural Networks for Chinese Short Text Classification

被引:0
|
作者
Zhou, Yujun [1 ,2 ,3 ]
Xu, Bo [1 ]
Xu, Jiaming [1 ]
Yang, Lei [1 ,2 ,3 ]
Li, Changliang [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Jiangsu Jinling Sci & Technol Grp Co Ltd, Nanjing 210008, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
D O I
10.1109/WI.2016.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word segmentation is the first step in Chinese natural language processing, and the error caused by word segmentation can be transmitted to the whole system. In order to reduce the impact of word segmentation and improve the overall performance of Chinese short text classification system, we propose a hybrid model of character-level and word-level features based on recurrent neural network (RNN) with long short-term memory (LSTM). By integrating character-level feature into word-level feature, the missing semantic information by the error of word segmentation will be constructed, meanwhile the wrong semantic relevance will be reduced. The final feature representation is that it suppressed the error of word segmentation in the case of maintaining most of the semantic features of the sentence. The whole model is finally trained end-to-end with supervised Chinese short text classification task. Results demonstrate that the proposed model in this paper is able to represent Chinese short text effectively, and the performances of 32-class and 5-class categorization outperform some remarkable methods.
引用
收藏
页码:137 / 144
页数:8
相关论文
共 50 条
  • [1] Attention-Based Recurrent Neural Networks in Chinese Short Text Classification
    Lin, Xin
    Chen, Jiahao
    Tan, Jun
    Bi, Ning
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 399 - 402
  • [2] LSTM Recurrent Neural Networks for Short Text and Sentiment Classification
    Nowak, Jakub
    Taspinar, Ahmet
    Scherer, Rafal
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT II, 2017, 10246 : 553 - 562
  • [3] Recurrent Convolutional Neural Networks for Text Classification
    Lai, Siwei
    Xu, Liheng
    Liu, Kang
    Zhao, Jun
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2267 - 2273
  • [4] Recurrent Graph Neural Networks for Text Classification
    Wei, Xinde
    Huang, Hai
    Ma, Longxuan
    Yang, Ze
    Xu, Liutong
    [J]. PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 91 - 97
  • [5] Convolutional Recurrent Neural Networks for Text Classification
    Wang, Ruishuang
    Li, Zhao
    Cao, Jian
    Chen, Tong
    Wang, Lei
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [6] Convolutional Recurrent Neural Networks for Text Classification
    Lyu, Shengfei
    Liu, Jiaqi
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 65 - 82
  • [7] Combining Concept Graph with Improved Neural Networks for Chinese Short Text Classification
    Liao, Jialu
    Sun, Fanke
    Gu, Jinguang
    [J]. SEMANTIC TECHNOLOGY, JIST 2019, 2020, 1157 : 205 - 212
  • [8] Chinese Short Text Classification with Mutual-Attention Convolutional Neural Networks
    Hao, Ming
    Xu, Bo
    Liang, Jing-Yi
    Zhang, Bo-Wen
    Yin, Xu-Cheng
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)
  • [9] Hybrid Framework of Convolution and Recurrent Neural Networks for Text Classification
    Lyu, Shengfei
    Liu, Jiaqi
    [J]. 11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 313 - 320
  • [10] Improving Multilabel Text Classification with Stacking and Recurrent Neural Networks
    Mansueli, Rodrigo
    Domingues, Marcos Aurelio
    Feltrim, Valeria Delisandra
    [J]. PROCEEDINGS OF THE 28TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, WEBMEDIA 2022, 2022, : 117 - 122