A multi granularity information fusion text classification model based on attention mechanism

被引:0
|
作者
Chen, Jingfang [1 ,2 ]
机构
[1] Hunan Int Econ Univ, Changsha, Peoples R China
[2] Stamford Int Univ, Bangkok, Thailand
关键词
Multi-granularity; information fusion; text classification; aattention mechanism;
D O I
10.3233/JIFS-233388
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing research on Chinese text classification primarily focuses on classifying data information at different granularities, such as character, word, sentence, and chapter. However, this approach often fails to capture the semantic information embedded in these different levels of granularity. To enhance the extraction of the text's core content, this study proposes a text classification model that incorporates an attention mechanism to fuse multi-granularity information. The model begins by constructing embedding vectors for characters, words, and sentences. Character and word vectors are generated using the Word2Vec training model, allowing the data to be converted into these respective vectors. To capture contextual semantic features, a bidirectional long and short-term memory network is employed for character andword vectors. Sentence vectors, on the other hand, are processed using the FastText model to extract the features they contain. To extract further important semantic information from the different feature vectors, they are fed into an attention mechanism layer. This layer enables the model to prioritize and emphasize the most significant information within the text. Experimental results demonstrate that the proposed model outperforms both single-granularity classification and combinations of two or more granularities. The model exhibits improved classification accuracy across three publicly available Chinese datasets.
引用
收藏
页码:7631 / 7645
页数:15
相关论文
共 50 条
  • [1] Feature Fusion Text Classification Model Combining CNN and BiGRU with Multi-Attention Mechanism
    Zhang, Jingren
    Liu, Fang'ai
    Xu, Weizhi
    Yu, Hui
    [J]. FUTURE INTERNET, 2019, 11 (11):
  • [2] Multi-model Fusion Attention Network for News Text Classification
    Li Z.
    Wu J.
    Miao J.
    Yu X.
    Li S.
    [J]. International Journal for Engineering Modelling, 2022, 35 (02) : 1 - 15
  • [3] A Multi-Task Text Classification Model Based on Label Embedding of Attention Mechanism
    Yuemei X.
    Zuwei F.
    Han C.
    [J]. Data Analysis and Knowledge Discovery, 2022, 6 (2-3): : 105 - 116
  • [4] Multi-channel Attention Mechanism Text Classification Model Based on CNN and LSTM
    Teng, Jinbao
    Kong, Weiwei
    Tian, Qiaoxin
    Wang, Zhaoqian
    Li, Long
    [J]. Computer Engineering and Applications, 2024, 57 (23) : 154 - 162
  • [5] Research on Text Classification by Fusing Multi-Granularity Information
    Xin, Miaomiao
    Ma, Li
    Hu, Bofa
    [J]. Computer Engineering and Applications, 2023, 59 (09) : 104 - 111
  • [6] Text sentiment analysis of fusion model based on attention mechanism
    Deng, Hongjie
    Ergu, Daji
    Liu, Fangyao
    Cai, Ying
    Ma, Bo
    [J]. 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2020 & 2021): DEVELOPING GLOBAL DIGITAL ECONOMY AFTER COVID-19, 2022, 199 : 741 - 748
  • [7] A Multi-Layer Feature Fusion Model Based on Convolution and Attention Mechanisms for Text Classification
    Yang, Hua
    Zhang, Shuxiang
    Shen, Hao
    Zhang, Gexiang
    Deng, Xingquan
    Xiong, Jianglin
    Feng, Li
    Wang, Junxiong
    Zhang, Haifeng
    Sheng, Shenyang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [8] Hierarchical Multi-Granularity Attention- Based Hybrid Neural Network for Text Classification
    Liu Z.
    Lu C.
    Huang H.
    Lyu S.
    Tao Z.
    [J]. IEEE Access, 2020, 8 : 149362 - 149371
  • [9] Short Text Classification Model Based on Multi-Attention
    Liu, Yunxiang
    Xu, Qi
    [J]. 2020 13TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2020), 2020, : 225 - 229
  • [10] geoGAT: Graph Model Based on Attention Mechanism for Geographic Text Classification
    Jing, Weipeng
    Song, Xianyang
    Di, Donglin
    Song, Houbing
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (05)