Transfer Learning Method for Very Deep CNN for Text Classification and Methods for its Evaluation

被引:14
|
作者
Moriya, Shun [1 ]
Shibata, Chihiro [1 ]
机构
[1] Tokyo Univ Technol, Dept Comp Sci, Hachioji, Tokyo, Japan
基金
日本学术振兴会;
关键词
transfer learning; text classification; CNN; residual network;
D O I
10.1109/COMPSAC.2018.10220
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, it has become possible to perform text classification with high accuracy by using convolutional neural networks (CNNs). Zhang et al. decomposed words into characters and classified texts using a CNN with relatively deep layers to obtain excellent classification results. However, it is often difficult to prepare a sufficient number of labeled samples for solving real-world text-classification problems. One method for handling this problem is transfer learning, which uses a network tuned for an arbitrary task as the initial network for a target task. While transfer learning is known to be effective for image recognition, for tasks in natural language processing, such as document classification, it has not yet been shown for what types of data and to what extent transfer learning is effective. In this paper, we first introduce a character-level CNN adopting the structure of a residual network to construct a network with deeper layers for Japanese text classification. We then demonstrate that we can improve classification accuracy by performing transfer learning between two particular datasets. Additionally, we propose an approach to evaluate the effectiveness of transfer learning and use it to evaluate our model.
引用
收藏
页码:153 / 158
页数:6
相关论文
共 50 条
  • [1] Review of text classification methods on deep learning
    Wu, Hongping
    Liu, Yuling
    Wang, Jingwen
    [J]. Computers, Materials and Continua, 2020, 63 (03): : 1309 - 1321
  • [2] Review of Text Classification Methods on Deep Learning
    Wu, Hongping
    Liu, Yuling
    Wang, Jingwen
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1309 - 1321
  • [3] Leukemia classification using the deep learning method of CNN
    Arivuselvam, B.
    Sudha, S.
    [J]. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2022, 30 (03) : 567 - 585
  • [4] Transfer Learning to Timed Text Based Video Classification Using CNN
    Kastrati, Zenun
    Imran, Ali Shariq
    Kurti, Arianit
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, MINING AND SEMANTICS (WIMS 2019), 2019,
  • [5] Deep Learning methods for Subject Text Classification of Articles
    Semberecki, Piotr
    Maciejewski, Henryk
    [J]. PROCEEDINGS OF THE 2017 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2017, : 357 - 360
  • [6] Method with recording text classification based on deep learning
    Zhang, Yan-Nan
    Huang, Xiao-Hong
    Ma, Yan
    Cong, Qun
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (07): : 1264 - 1271
  • [7] Deep learning based text classification with Web Scraping methods
    Ertam, Fatih
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [8] Word embedding and text classification based on deep learning methods
    Li, Saihan
    Gong, Bing
    [J]. 2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [9] Survey of Short Text Classification Methods Based on Deep Learning
    Gan, Yating
    An, Jianye
    Xu, Xue
    [J]. Computer Engineering and Applications, 2024, 59 (04) : 43 - 53
  • [10] Deep transfer learning CNN based for classification quality of organic vegetables
    Promboonruang, Suksun
    Boonrod, Thummarat
    [J]. INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2023, 10 (12): : 203 - 210