Hierarchical Comprehensive Context Modeling for Chinese Text Classification

被引:7
|
作者
Liu, Jingang [1 ,2 ]
Xia, Chunhe [2 ]
Yan, Haihua [2 ]
Xie, Zhipu [1 ]
Sun, Jie [2 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
Chinese text; classification; contextual information; deep neural network; STRATEGIES;
D O I
10.1109/ACCESS.2019.2949175
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Chinese text classification task is challenging compared to tasks based on other languages such as English due to the characteristics of the Chinese text itself. In recent years, some popular methods based on deep learning have been used for text classification, such as the convolutional neural network (CNN) and the long short-term memory (LSTM) network. However, some problems are still encountered when classifying Chinese text. For example, important but obscure context information in Chinese text is not easily extracted. To improve the effect of Chinese text classification, we propose a novel classification model in this paper named the hierarchical comprehensive context modeling network (HCCMN) that can extract more comprehensive context. Our approach aims to extract contextual information and integrate it with the original input and then extract hierarchically more context, spatial information and high-weight local features from the integrated results. In addition, our method can remember long-term historical obscure information. Since Chinese radiology texts are complicated and difficult to obtain, we collected a Chinese radiology medical text dataset (CIRTEXT) containing more than 56,000 real-world data samples to verify the effect of this work. We conducted experiments on four datasets and showed that our HCCMN performs at state-of-the-art levels on three selected evaluation metrics compared to baselines. We present promising results showing that our hierarchical context modeling network extracts useful context from Chinese text more effectively and comprehensively.
引用
收藏
页码:154546 / 154559
页数:14
相关论文
共 50 条
  • [1] Context Recognition for Hierarchical Text Classification
    Liu, Rey-Long
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (04): : 803 - 813
  • [2] Hierarchical text classification
    Pulijala, AK
    Gauch, S
    ISAS/CITSA 2004: International Conference on Cybernetics and Information Technologies, Systems and Applications and 10th International Conference on Information Systems Analysis and Synthesis, Vol 1, Proceedings: COMMUNICATIONS, INFORMATION TECHNOLOGIES AND COMPUTING, 2004, : 257 - 262
  • [3] On Dataless Hierarchical Text Classification
    Song, Yangqiu
    Roth, Dan
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1579 - 1585
  • [4] Experiments with hierarchical text classification
    Granitzer, M
    Auer, P
    PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2005, : 177 - 182
  • [5] Hierarchical text classification and evaluation
    Sun, AX
    Lim, EP
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 521 - 528
  • [6] Chinese Theology: Text and Context
    Yin, Peng
    JOURNAL OF ANGLICAN STUDIES, 2019, 17 (01) : 116 - 118
  • [7] Chinese Theology: Text and Context
    Skreslet, Stanley H.
    INTERPRETATION-A JOURNAL OF BIBLE AND THEOLOGY, 2018, 72 (04) : 462 - 463
  • [8] Chinese Theology: Text and Context
    Chow, Alexander
    STUDIES IN WORLD CHRISTIANITY, 2017, 23 (03) : 281 - 283
  • [9] Hierarchical Convolutional Attention Networks Using Joint Chinese Word Embedding for Text Classification
    Zhang, Kaiqiang
    Wang, Shupeng
    Li, Binbin
    Mei, Feng
    Zhang, Jianyu
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 234 - 246
  • [10] Naive approach for hierarchical text classification
    Wang, Mingwen
    Lu, Xu
    Zhang, Huawei
    Luo, Yuansheng
    Journal of Computational Information Systems, 2007, 3 (04): : 1591 - 1598