Multi Text Classification Model Based on BRET-CNN-BiLSTM

被引:0
|
作者
Xu, ErZhuo [1 ]
Qin, Donghong [2 ]
Huang, Jun [1 ]
Zhang, Jinbo [2 ]
机构
[1] Guangxi Minzu Univ, Sch Elect Informat, Nanning, Peoples R China
[2] Guangxi Minzu Univ, Sch Artificial Intelligence, Nanning, Peoples R China
关键词
BRET; BiLSTM; multi-head attention; CNN; flooding;
D O I
10.1109/BDAI56143.2022.9862653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When the Bert pre-trained model is used for the Chinese text classification task, the internal parameters of the model are relatively fixed. In the process of training smaller data sets, it is easy to cause over-fitting phenomenon. Therefore, a multiple model structure based on Bert-CNN-BiLSTM is proposed. In this structure, the Bert model is used as the text information extractor, after the output features of Bert model, multi head attention and TEXT-CNN model are used for further feature information extraction, low-dimensional feature vectors with more dense semantic information are generated and spliced to improve the information entropy of text vectors. Finally, the BiLSTM with self-attention is used to extract the information of different words in text information and then classify the text information, and the loss function with Flooding mechanism is used to carry out back propagation to further prevent the over-fitting problem of the model on smaller data sets and enhance the generalization ability of the model. Compared with the traditional Bert, LSTM, TEXT-CNN models, the accuracy, precision, recall and F1 measure of this model are all better than those of traditional Bert, LSTM and TEXT-CNN models. Experiments show that the model can effectively extract the feature information from the text and improve the accuracy of the text classification task.
引用
收藏
页码:184 / 189
页数:6
相关论文
共 50 条
  • [1] A Multi-Channel BiLSTM-CNN model for Multilabel Emotion Classification of Informal Text
    Rajabi, Zahra
    Uzuner, Oziem
    Shehu, Amarda
    [J]. 2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, : 303 - 306
  • [2] Cross-Domain Text Sentiment Classification Method Based on the CNN-BiLSTM-TE Model
    Zeng, Yuyang
    Zhang, Ruirui
    Yang, Liang
    Song, Sujuan
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (04): : 818 - 833
  • [3] Text Classification for Fault Knowledge Graph Construction Based on CNN-BiLSTM
    Chen, Tianchang
    Lu, Ningyun
    Lei, Xue
    Ma, Leiming
    Tang, Hao
    Jiang, Bin
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2727 - 2732
  • [4] Chinese News Text Classification based on Attention-based CNN-BiLSTM
    Wang, Meng
    Cai, Qiong
    Wang, Liya
    Li, Jun
    Wang, Xiaoke
    [J]. MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [5] Enhancing Text Sentiment Classification with Hybrid CNN-BiLSTM Model on WhatsApp Group
    Susandri, Susandri
    Defit, Sarjon
    Tajuddin, Muhammad
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (03) : 355 - 363
  • [6] An R-Transformer_BiLSTM Model Based on Attention for Multi-label Text Classification
    Yaoyao Yan
    Fang’ai Liu
    Xuqiang Zhuang
    Jie Ju
    [J]. Neural Processing Letters, 2023, 55 : 1293 - 1316
  • [7] An Improved Model for Medical Forum Question Classification Based on CNN and BiLSTM
    Mutabazi, Emmanuel
    Ni, Jianjun
    Tang, Guangyi
    Cao, Weidong
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [8] An R-Transformer_BiLSTM Model Based on Attention for Multi-label Text Classification
    Yan, Yaoyao
    Liu, Fang'ai
    Zhuang, Xuqiang
    Ju, Jie
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1293 - 1316
  • [9] Multi-channel Attention Mechanism Text Classification Model Based on CNN and LSTM
    Teng, Jinbao
    Kong, Weiwei
    Tian, Qiaoxin
    Wang, Zhaoqian
    Li, Long
    [J]. Computer Engineering and Applications, 2024, 57 (23) : 154 - 162
  • [10] Music Audio Sentiment Classification Based on CNN-BiLSTM and Attention Model
    Chen Zhen
    Liu Changhui
    [J]. 2021 4TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION ENGINEERING (RCAE 2021), 2021, : 156 - 160