Heterogeneous Graph-Convolution-Network-Based Short-Text Classification

被引:4
|
作者
Hua, Jiwei [1 ]
Sun, Debing [1 ]
Hu, Yanxiang [1 ]
Wang, Jiayu [1 ]
Feng, Shuquan [1 ]
Wang, Zhaoyang [1 ]
机构
[1] Tianjin Normal Univ, Sch Comp & Informat Engn, Tianjin 300387, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 06期
关键词
short-text classification; physical information; graph convolution neural network; BERT;
D O I
10.3390/app14062279
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With the development of online interactive media platforms, a large amount of short text has appeared on the internet. Determining how to classify these short texts efficiently and accurately is of great significance. Graph neural networks can capture information dependencies in the entire short-text corpus, thereby enhancing feature expression and improving classification accuracy. However, existing works have overlooked the role of entities in these short texts. In this paper, we propose a heterogeneous graph-convolution-network-based short-text classification (SHGCN) method that integrates heterogeneous graph convolutional neural networks of text, entities, and words. Firstly, the model constructs a graph network of the text and extracts entity nodes and word nodes. Secondly, the relationship of the graph nodes in the heterogeneous graphs is determined by the mutual information between the words, the relationship between the documents and words, and the confidence between the words and entities. Then, the feature is represented through a word graph and combined with its BERT embedding, and the word feature is strengthened through BiLstm. Finally, the enhanced word features are combined with the document graph representation features to predict the document categories. To verify the performance of the model, experiments were conducted on the public datasets AGNews, R52, and MR. The classification accuracy of SHGCN reached 88.38%, 93.87%, and 82.87%, respectively, which is superior to that of some existing advanced classification methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Motif-Based Hypergraph Convolution Network for Semi-Supervised Node Classification on Heterogeneous Graph
    Wu, Yue
    Wang, Ying
    Wang, Xin
    Xu, Zheng-Xiang
    Li, Li-Na
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (11): : 2248 - 2260
  • [42] The Research of Chinese Short-text Classification Based on Domain Keyword Set Extension and HowNet
    Li, Xiangdong
    Gao, Fan
    Ding, Cong
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND COMPUTER APPLICATION, 2016, 30 : 244 - 247
  • [43] Point Cloud Classification Network Based on Dynamic Graph Convolution
    Wu, Ke
    Dai, Hong
    Wang, Shuang
    Liu, Chengrui
    [J]. ENGINEERING LETTERS, 2023, 31 (04) : 1859 - 1866
  • [44] A Network Traffic Classification Method Based on Graph Convolution and LSTM
    Pan, Yang
    Zhang, Xiao
    Jiang, Hui
    Li, Cong
    [J]. IEEE ACCESS, 2021, 9 : 158261 - 158272
  • [45] Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification
    Hu, Linmei
    Yang, Tianchi
    Shi, Chuan
    Ji, Houye
    Li, Xiaoli
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4821 - 4830
  • [46] Self-supervised short text classification with heterogeneous graph neural networks
    Cao, Meng
    Yuan, Jinliang
    Yu, Hualei
    Zhang, Baoming
    Wang, Chongjun
    [J]. EXPERT SYSTEMS, 2023, 40 (06)
  • [47] Short-Text Classification Method with Text Features from Pre-trained Models
    Chen, Jie
    Ma, Jing
    Li, Xiaofeng
    [J]. Data Analysis and Knowledge Discovery, 2021, 5 (09) : 21 - 30
  • [48] Density-based clustering of short-text corpora
    Ingaramo, Diego A.
    Errecalde, Marcelo L.
    Rosso, Paolo
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 81 - 88
  • [49] Text classification based on PEGCN: Graph convolution classification using location information and edge features
    Zhang, Ruidong
    Guo, Zelin
    Huan, Hai
    [J]. EXPERT SYSTEMS, 2024, 41 (03)
  • [50] User-generated short-text classification using cograph editing-based network clustering with an application in invoice categorization
    Wahid, Dewan F.
    Hassini, Elkafi
    [J]. DATA & KNOWLEDGE ENGINEERING, 2023, 148