TextCNN-based ensemble learning model for Japanese Text Multi-classification

被引:4
|
作者
Chen, Hua [1 ]
Zhang, Zepeng [1 ]
Huang, Shiting [1 ]
Hu, Jiayu [1 ]
Ni, Wenlong [1 ]
Liu, Jianming [1 ]
机构
[1] Jiangxi Normal Univ, Sch Comp & Informat Engn, Nanchang, Peoples R China
关键词
ALBERT; RoBERTa; DistilBERT; TextCNN; Ensemble learning; Japanese text classification;
D O I
10.1016/j.compeleceng.2023.108751
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we aim at improving Japanese text classification using TextCNN-based ensemble learning model. Specifically, we first construct three different sub-classifiers, combining AL-BERT, RoBERTa, DistilBERT with TextCNN, respectively; and then explore the effectiveness of ensemble learning model to leverage complementary information from different sub-classifiers for better text classification. We also conduct a series of experiments with the dataset collected from Japanese Wikipedia pages, which was divided into 31 categories. The experimental results show that the proposed approach achieves a good performance. The accuracy, precision, recall and F1 scores reach 0.881, 0.884, 0.880 and 0.881, respectively, which shows that the TextCNN-based ensemble learning model can be used for Japanese Text Multi-Classification effectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] TextCNN-based Text Classification for E-government
    Wu Suyan
    Su Entong
    Lei Binyang
    Wu Jiangrui
    2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, : 929 - 934
  • [2] An Ensemble Learning Method for the Fault Multi-classification of Smart Meters
    Liang, Shuhua
    Chen, Changji
    Wu, Dalei
    Chen, Longjin
    Wu, Qingyao
    Gu, Ting Ting
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2024, 31 (05): : 1514 - 1522
  • [3] Microblog Text Classification System Based on TextCNN and LSA Model
    Zhang, Weiyu
    Xu, Can
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 469 - 474
  • [4] A Hybrid Fuzzy Correlation Based Text Multi-Classification Method
    Chueh, Hao-En
    Liao, Kuo-Hsiung
    Pi, Shih-Ming
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (03): : 785 - 790
  • [5] An automated multi-classification of communicable diseases using ensemble learning for disease surveillance
    Thakur, Kavita
    Sandhu, Navneet Kaur
    Kumar, Yogesh
    Thakkar, Hiren Kumar
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (08) : 3737 - 3756
  • [6] Cardiovascular disease classification based on a multi-classification integrated model
    Zhang, Ai-Ping
    Wang, Guang-xin
    Zhang, Wei
    Zhang, Jing-Yu
    NETWORKS AND HETEROGENEOUS MEDIA, 2023, 18 (04) : 1630 - 1656
  • [7] Text Classification Based on a Novel Ensemble Multi-Label Learning Method
    Zhang, Tao
    Wu, Jiansheng
    Hu, Haifeng
    2014 2ND INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2014, : 964 - 968
  • [8] An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying
    Alqahtani, Abdulkarim Faraj
    Ilyas, Mohammad
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (01): : 156 - 170
  • [9] Geographical Entity Management Model Based on Multi-Classification
    Shi, Lin
    Lan, Xiaoji
    Xiao, Ming
    Liu, Ning
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (09)
  • [10] Transfer Learning for Malware Multi-Classification
    Al Kadri, Mohamad
    Nassar, Mohamed
    Safa, Haidar
    IDEAS '19: PROCEEDINGS OF THE 23RD INTERNATIONAL DATABASE APPLICATIONS & ENGINEERING SYMPOSIUM (IDEAS 2019), 2019, : 151 - 157