Text Classification and Transfer Learning Based on Character-Level Deep Convolutional Neural Networks

被引:4
|
作者
Sato, Minato [1 ]
Orihara, Ryohei [1 ]
Sei, Yuichi [1 ]
Tahara, Yasuyuki [1 ]
Ohsuga, Akihiko [1 ]
机构
[1] Univ Electrocommun, Grad Sch Informat Syst, 1-5-1 Chofu Gaoka, Chofu, Tokyo, Japan
关键词
Deep learning; Temporal ConvNets; Transfer learning Text classification; Sentiment analysis;
D O I
10.1007/978-3-319-93581-2_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal (one-dimensional) Convolutional Neural Network (Temporal CNN, ConvNet) is an emergent technology for text understanding. The input for the ConvNets could be either a sequence of words or a sequence of characters. In the latter case there are no needs for natural language processing. Past studies showed that the character-level ConvNets worked well for text classification in English and romanized Chinese corpus. In this article we apply the character-level ConvNets to Japanese corpus. We confirmed that meaningful representations are extracted by the ConvNets in English corpus and Japanese corpus. We attempt to reuse the meaningful representations that are learned in the ConvNets from a large-scale dataset in the form of transfer learning. As for the application to the news categorization and the sentiment analysis tasks in Japanese corpus, the ConvNets outperformed N-gram-based classifiers. In addition, our ConvNets transfer learning frameworks worked well for a task which is similar to one used for pre-training.
引用
收藏
页码:62 / 81
页数:20
相关论文
共 50 条
  • [1] Character-level Convolutional Networks for Text Classification
    Zhang, Xiang
    Zhao, Junbo
    Yann Lecun
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [2] Character-Level Attention Convolutional Neural Networks for Short-Text Classification
    Yin, Feiyang
    Yao, Zhilin
    Liu, Jia
    [J]. HUMAN CENTERED COMPUTING, 2019, 11956 : 560 - 567
  • [3] Weakly-supervised character-level convolutional neural networks for text classification
    Liu, Yongsheng
    Chen, Wenyu
    Niyongabo, Rubungo Andre
    Qu, Hong
    Miao, Kebin
    Wei, Feng
    [J]. DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 701 - 708
  • [4] Japanese Text Classification by Character-level Deep ConvNets and Transfer Learning
    Sato, Minato
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    [J]. ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 175 - 184
  • [5] Character-level Neural Networks for Short Text Classification
    Liu, Jingxue
    Meng, Fanrong
    Zhou, Yong
    Liu, Bing
    [J]. 2017 INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2017,
  • [6] Joint Character-Level Convolutional and Generative Adversarial Networks for Text Classification
    Wang, Tianshi
    Liu, Li
    Zhang, Huaxiang
    Zhang, Long
    Chen, Xiuxiu
    [J]. COMPLEXITY, 2020, 2020
  • [7] A Character-level Short Text Classification Model Based On Spiking Neural Networks
    Jiang, Chengzhi
    Li, Linjing
    Zeng, Daniel Dajun
    Wang, Xiaoxuan
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Crowdsourcing the character of a place: Character-level convolutional networks for multilingual geographic text classification
    Adams, Benjamin
    McKenzie, Grant
    [J]. TRANSACTIONS IN GIS, 2018, 22 (02) : 394 - 408
  • [9] A Complaint Text Classification Model Based on Character-level Convolutional Network
    Tong, Xuesong
    Wu, Bin
    Wang, Shuyang
    Lv, Jinna
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 507 - 511
  • [10] Character-level Intrusion Detection Based on Convolutional Neural Networks
    Lin, Steven Z.
    Shi, Yong
    Xue, Zhi
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,