Web Page Classification Based on Graph Neural Network

被引:1
|
作者
Guo, Tao [1 ]
Cui, Baojiang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
D O I
10.1007/978-3-030-79728-7_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web page, a kind of semi-structured document, includes a lot of additional attribute content besides text information. Traditional web page classification technology is mostly based on text classification methods. They ignore the additional attribute information of web page text. We propose WEB-GNN, an approach for Web page classification. There are two major contributions to this work. First, we propose a web page graph representation method called W2G that reconstructs text nodes into graph representation based on text visual association relationship and DOM-tree hierarchy relationship and realizes the efficient integration of web page content and structure. Our second contribution is to propose a web page classification method based on graph convolutional neural network. It takes the web page graph representation as to the input, integrates text features and structure features through graph convolution layer, and generates the advanced webpage feature representation. Experimental results on the Web-black dataset suggest that the proposed method significantly outperforms text-only method.
引用
收藏
页码:188 / 198
页数:11
相关论文
共 50 条
  • [31] Personalized Web Page Ranking Based Graph Convolutional Network for Community Detection in Attribute Networks
    Zhang, Weitong
    Shang, Ronghua
    Li, Zhiyuan
    Sun, Rui
    Du, Jun
    IEEE ACCESS, 2023, 11 : 84270 - 84282
  • [32] Optimization of Computer Web Page Interface Based on BP Neural Network Algorithm and Multimedia
    Ma, Yan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [33] Optimization of Computer Web Page Interface Based on BP Neural Network Algorithm and Multimedia
    Ma, Yan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [34] Text Classification Based on Graph Convolution Neural Network and Attention Mechanism
    Zhai, Sheping
    Zhang, Wenqing
    Cheng, Dabao
    Bai, Xiaoxia
    ACM International Conference Proceeding Series, 2022, : 137 - 142
  • [35] Wacml: based on graph neural network for imbalanced node classification algorithm
    Wang, Junfeng
    Yang, Jiayue
    Lidun
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [36] Graph structure estimation neural network-based service classification
    Li, Yanxinwen
    Xie, Ziming
    Cao, Buqing
    Lou, Hua
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2024, 20 (04) : 436 - 451
  • [37] Graph Neural Networks-Based Multilabel Classification of Citation Network
    Lachaud, Guillaume
    Conde-Cespedes, Patricia
    Trocan, Maria
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 128 - 140
  • [38] Classification Model for Scholarly Articles Based on Improved Graph Neural Network
    Xuejian H.
    Yuyang L.
    Tinghuai M.
    Data Analysis and Knowledge Discovery, 2022, 6 (10) : 93 - 102
  • [39] Web Page Recommendation Using Distributional Recurrent Neural Network
    Chaithra
    Lingaraju, G. M.
    Jagannatha, S.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 45 (01): : 803 - 817
  • [40] A competitive neural network approach to web-page categorization
    Liu, Zhi-Qiang
    Zhang, Y.A.-Jun
    International Journal of Uncertainty, Fuzziness and Knowlege-Based Systems, 2001, 9 (06): : 731 - 741