TextCNN-based ensemble learning model for Japanese Text Multi-classification

被引:12
|
作者
Chen, Hua [1 ]
Zhang, Zepeng [1 ]
Huang, Shiting [1 ]
Hu, Jiayu [1 ]
Ni, Wenlong [1 ]
Liu, Jianming [1 ]
机构
[1] Jiangxi Normal Univ, Sch Comp & Informat Engn, Nanchang, Peoples R China
关键词
ALBERT; RoBERTa; DistilBERT; TextCNN; Ensemble learning; Japanese text classification;
D O I
10.1016/j.compeleceng.2023.108751
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we aim at improving Japanese text classification using TextCNN-based ensemble learning model. Specifically, we first construct three different sub-classifiers, combining AL-BERT, RoBERTa, DistilBERT with TextCNN, respectively; and then explore the effectiveness of ensemble learning model to leverage complementary information from different sub-classifiers for better text classification. We also conduct a series of experiments with the dataset collected from Japanese Wikipedia pages, which was divided into 31 categories. The experimental results show that the proposed approach achieves a good performance. The accuracy, precision, recall and F1 scores reach 0.881, 0.884, 0.880 and 0.881, respectively, which shows that the TextCNN-based ensemble learning model can be used for Japanese Text Multi-Classification effectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Adaptive segmentation based on multi-classification model for dermoscopy images
    Fengying Xie
    Yefen Wu
    Yang Li
    Zhiguo Jiang
    Rusong Meng
    Frontiers of Computer Science, 2015, 9 : 720 - 728
  • [22] Adaptive segmentation based on multi-classification model for dermoscopy images
    Xie, Fengying
    Wu, Yefen
    Li, Yang
    Jiang, Zhiguo
    Meng, Rusong
    FRONTIERS OF COMPUTER SCIENCE, 2015, 9 (05) : 720 - 728
  • [23] EnML: Multi-label Ensemble Learning for Urdu Text Classification
    Mehmood, Faiza
    Shahzadi, Rehab
    Ghafoor, Hina
    Asim, Muhammad Nabeel
    Ghani, Muhammad Usman
    Mahmood, Waqar
    Dengel, Andreas
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (09)
  • [24] DISTRIBUTED ENSEMBLE LEARNING IN TEXT CLASSIFICATION
    Silva, Catarina
    Ribeiro, Bernardete
    Lotric, Uros
    Dobnikar, Andrej
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS: ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS, 2008, : 420 - +
  • [25] A Multi-Classification Sentiment Analysis Model of Chinese Short Text Based on Gated Linear Units and Attention Mechanism
    Liu, Lei
    Chen, Hao
    Sun, Yinghong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [26] Machine Learning for Multi-Classification of Botnets Attacks
    Tran, Thanh Cong
    Dang, Tran Khanh
    PROCEEDINGS OF THE 2022 16TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2022), 2022,
  • [27] Multi-Classification of Rainfall Weather Based on Deep Learning-Mod
    Lu, Zhiying
    Ding, Xudong
    Ren, Yimo
    Sun, Xiaolei
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6374 - 6379
  • [28] Ensemble Learning Based Feature Selection with an Application to Text Classification
    Onan, Aytug
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [29] Ontology-based multi-classification learning for video concept detection
    Wu, Y
    Tseng, BL
    Smith, JR
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1003 - 1006
  • [30] Deep Learning-Based Multi-classification for Malware Detection in IoT
    Wang, Zhiqiang
    Liu, Qian
    Wang, Zhuoyue
    Chi, Yaping
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (17)