A text classification network model combining machine learning and deep learning

被引:0
|
作者
Chen, Hao [1 ]
Zhang, Haifei [1 ]
Yang, Yuwei [1 ]
He, Long [1 ]
机构
[1] Nantong Inst Technol, Sch Comp & Informat Engn, Nantong 226002, Peoples R China
关键词
text classification; neural networks; machine learning; deep learning; term frequency-inverse document frequency; TF-IDF; text convolutional neural networks; TextCNN; rotary transformer; RoFormer; attention mechanism;
D O I
10.1504/IJSNET.2024.137333
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification is significant in natural language processing tasks, which can deal with a large amount of data scientifically. However, for text feature extraction, it is not easy to simultaneously consider the characteristics of short and long texts. Moreover, it does not reflect the importance of words in the text, resulting in unsatisfactory text classification results. Therefore, this paper proposes a machine learning and deep learning model. Specifically, text features are extracted by joint training, and then an attention mechanism is introduced to classify short texts and long texts. Firstly, the pre-processed data is subjected to term frequency-inverse document frequency, text convolutional neural networks and rotary transformer models for joint extraction of text features. Subsequently, the attention mechanism is introduced for the weight distribution problem after model fusion to improve the focus on keywords. Eventually, the experimental results indicate that the model proposed in this paper has a good effect on long and short-text classification. We achieved 95.8%, 92.5% and 95.4% accuracy on three public datasets, respectively. In this way, the proposed model is significant in text classification.
引用
收藏
页码:182 / 192
页数:12
相关论文
共 50 条
  • [1] MII: A Novel Text Classification Model Combining Deep Active Learning with BERT
    Zhang, Anman
    Li, Bohan
    Wang, Wenhuan
    Wan, Shuo
    Chen, Weitong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1499 - 1514
  • [2] MII: A novel text classification model combining deep active learning with BERT
    Zhang, Anman
    Li, Bohan
    Wang, Wenhuan
    Wan, Shuo
    Chen, Weitong
    [J]. Computers, Materials and Continua, 2020, 63 (03): : 1499 - 1514
  • [3] An Integrated Model Combining Machine Learning and Deep Learning Algorithms for Classification of Rupture Status of IAs
    Chen, Rong
    Mo, Xiao
    Chen, Zhenpeng
    Feng, Pujie
    Li, Haiyun
    [J]. FRONTIERS IN NEUROLOGY, 2022, 13
  • [4] A Hybrid Deep Learning Model for Text Classification
    Chen, Xianglong
    Ouyang, Chunping
    Liu, Yongbin
    Luo, Lingyun
    Yang, Xiaohua
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2018, : 46 - 52
  • [5] Deep Learning Model of Image Classification Using Machine Learning
    Lv, Qing
    Zhang, Suzhen
    Wang, Yuechun
    [J]. ADVANCES IN MULTIMEDIA, 2022, 2022
  • [6] Chinese Text Classification Model Based on Deep Learning
    Li, Yue
    Wang, Xutao
    Xu, Pengjian
    [J]. FUTURE INTERNET, 2018, 10 (11):
  • [7] Text Classification of Mixed Model Based on Deep Learning
    Lee, Sang-Hwa
    [J]. TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2023, 17 (03): : 367 - 374
  • [8] Text classification based on machine learning for Tibetan social network
    Lv, Hui
    Li, Fenfang
    Liang, Yatao
    Duo, La
    Shen, Jun
    Li, Yan
    Zhou, Qingguo
    [J]. 2022 TENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, CBD, 2022, : 145 - 150
  • [9] Comparative Study between Traditional Machine Learning and Deep Learning Approaches for Text Classification
    Kamath, Cannannore Nidhi
    Bukhari, Syed Saqib
    Dengel, Andreas
    [J]. PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [10] Patent Text Classification based on Deep Learning and Vocabulary Network
    Li, Ran
    Yu, Wangke
    Huang, Qianliang
    Liu, Yuying
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (01) : 54 - 61