A BERT-Based Hybrid Short Text Classification Model Incorporating CNN and Attention-Based BiGRU

被引:18
|
作者
Bao, Tong [1 ,2 ]
Ren, Ni [1 ,2 ]
Luo, Rui [1 ]
Wang, Baojia [1 ]
Shen, Gengyu [1 ]
Guo, Ting [1 ]
机构
[1] Jiangsu Acad Agr Sci, Informat Ctr, Nanjing, Peoples R China
[2] Jiangsu Univ, Inst Sci & Technol Informat, Zhenjiang, Peoples R China
关键词
Deep Learning; Fusion Framework; Natural Language Processing; Short Text Classification; CONVOLUTIONAL NEURAL-NETWORKS; LSTM;
D O I
10.4018/JOEUC.294580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Short text classification is a research focus for natural language processing (NLP), which is widely used in news classification, sentiment analysis, mail filtering, and other fields. In recent years, deep learning techniques are applied to text classification and have made some progress. Different from ordinary text classification, short text has the problem of less vocabulary and feature sparsity, which raise higher request for text semantic feature representation. To address this issue, this paper proposes a feature fusion framework based on the bidirectional encoder representations from transformers (BERT). In this hybrid method, BERT is used to train word vector representation. Convolutional neural network (CNN) captures static features. As a supplement, a bi-gated recurrent neural network (BiGRU) is adopted to capture contextual features. Furthermore, an attention mechanism is introduced to assign the weight of salient words. The experimental results confirmed that the proposed model significantly outperforms the other state-of-the-art baseline methods.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Attention-based LSTM, GRU and CNN for short text classification
    Yu, Shujuan
    Liu, Danlei
    Zhu, Wenfeng
    Zhang, Yun
    Zhao, Shengmei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (01) : 333 - 340
  • [2] CRAN: A Hybrid CNN-RNN Attention-Based Model for Text Classification
    Guo, Long
    Zhang, Dongxiang
    Wang, Lei
    Wang, Han
    Cui, Bin
    [J]. CONCEPTUAL MODELING, ER 2018, 2018, 11157 : 571 - 585
  • [3] Research on Internet Text Sentiment Classification Based on BERT and CNN-BiGRU
    Wei, Guoli
    [J]. 2022 11TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2022), 2022, : 285 - 289
  • [4] A Multiscale Interactive Attention Short Text Classification Model Based on BERT
    Zhou, Lu
    Wang, Peng
    Zhang, Huijun
    Wu, Shengbo
    Zhang, Tao
    [J]. IEEE Access, 2024, 12 : 160992 - 161001
  • [5] Short-Text Classification Detector: A Bert-Based Mental Approach
    Hu, Yongjun
    Ding, Jia
    Dou, Zixin
    Chang, Huiyou
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [6] An Attention-based Hybrid LSTM-CNN Model for Arrhythmias Classification
    Liu, Fan
    Zhou, Xingshe
    Wang, Tianben
    Cao, Jinli
    Wang, Zhu
    Wang, Hua
    Zhang, Yanchun
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] BVMHA: Text classification model with variable multihead hybrid attention based on BERT
    Peng, Bo
    Zhang, Tao
    Han, Kundong
    Zhang, Zhe
    Ma, Yuquan
    Ma, Mengnan
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 1443 - 1454
  • [8] Improving Bert-Based Model for Medical Text Classification with an Optimization Algorithm
    Gasmi, Karim
    [J]. ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 1653 : 101 - 111
  • [9] Sentiment Analysis of Review Text Based on BiGRU-Attention and Hybrid CNN
    Zhu, Qiannan
    Jiang, Xiaofan
    Ye, Renzhen
    [J]. IEEE ACCESS, 2021, 9 : 149077 - 149088
  • [10] BAE: BERT-based Adversarial Examples for Text Classification
    Garg, Siddhant
    Ramakrishnan, Goutham
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6174 - 6181