A text classification network model combining machine learning and deep learning

被引:0
|
作者
Chen, Hao [1 ]
Zhang, Haifei [1 ]
Yang, Yuwei [1 ]
He, Long [1 ]
机构
[1] Nantong Inst Technol, Sch Comp & Informat Engn, Nantong 226002, Peoples R China
关键词
text classification; neural networks; machine learning; deep learning; term frequency-inverse document frequency; TF-IDF; text convolutional neural networks; TextCNN; rotary transformer; RoFormer; attention mechanism;
D O I
10.1504/IJSNET.2024.137333
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification is significant in natural language processing tasks, which can deal with a large amount of data scientifically. However, for text feature extraction, it is not easy to simultaneously consider the characteristics of short and long texts. Moreover, it does not reflect the importance of words in the text, resulting in unsatisfactory text classification results. Therefore, this paper proposes a machine learning and deep learning model. Specifically, text features are extracted by joint training, and then an attention mechanism is introduced to classify short texts and long texts. Firstly, the pre-processed data is subjected to term frequency-inverse document frequency, text convolutional neural networks and rotary transformer models for joint extraction of text features. Subsequently, the attention mechanism is introduced for the weight distribution problem after model fusion to improve the focus on keywords. Eventually, the experimental results indicate that the model proposed in this paper has a good effect on long and short-text classification. We achieved 95.8%, 92.5% and 95.4% accuracy on three public datasets, respectively. In this way, the proposed model is significant in text classification.
引用
收藏
页码:182 / 192
页数:12
相关论文
共 50 条
  • [31] Question Text Classification Method of Tourism Based on Deep Learning Model
    Luo, Wanli
    Zhang, Lei
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [32] Target Advertising Classification using Combination of Deep Learning and Text model
    Phaisangittisagul, E.
    Koobkrabee, Y.
    Wirojborisuth, K.
    Ratanasrimetha, T.
    Aummaro, S.
    [J]. 2019 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY FOR EMBEDDED SYSTEMS (IC-ICTES), 2019,
  • [33] A text classification model constructed by Latent Dirichlet Allocation and Deep Learning
    Liu, Yu
    Jin, Zhengping
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 2501 - 2504
  • [34] Network Data Stream Classification by Deep Packet Inspection and Machine Learning
    Yin, Chunyong
    Wang, Hongyi
    Wang, Jin
    [J]. ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 245 - 251
  • [35] Deep Belief Network based Machine Learning for Daily Activities Classification
    Phiasai, Tejtasin
    Chinpanthana, Nutchanun
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY, AIVR 2021, 2021, : 83 - 88
  • [36] Applications of Deep Learning in News Text Classification
    Zhang, Menghan
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [37] Review of text classification methods on deep learning
    Wu, Hongping
    Liu, Yuling
    Wang, Jingwen
    [J]. Computers, Materials and Continua, 2020, 63 (03): : 1309 - 1321
  • [38] HDLTex: Hierarchical Deep Learning for Text Classification
    Kowsari, Kamran
    Brown, Donald E.
    Heidarysafa, Mojtaba
    Meimandi, Kiana Jafari
    Gerber, Matthew S.
    Barnes, Laura E.
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 364 - 371
  • [39] A Deep Learning Approach for Arabic Text Classification
    Sundus, Katrina
    Al-Haj, Fatima
    Hammo, Bassam
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 258 - 264
  • [40] Deep Learning for Hindi Text Classification: A Comparison
    Joshi, Ramchandra
    Goel, Purvi
    Joshi, Raviraj
    [J]. INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 94 - 101