Text Graph Transformer for Document Classification

被引:0
|
作者
Zhang, Haopeng [1 ]
Zhang, Jiawei [1 ]
机构
[1] Florida State Univ, IFM Lab, Dept Comp Sci, Tallahassee, FL 32306 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a fundamental problem in natural language processing. Recent studies applied graph neural network (GNN) techniques to capture global word co-occurrence in a corpus. However, previous works are not scalable to large-sized corpus and ignore the heterogeneity of the text graph. To address these problems, we introduce a novel Transformer based heterogeneous graph neural network, namely Text Graph Transformer (TG-Transformer). Our model learns effective node representations by capturing structure and heterogeneity from the text graph. We propose a mini-batch text graph sampling method that significantly reduces computing and memory costs to handle large-sized corpus. Extensive experiments have been conducted on several benchmark datasets, and the results demonstrate that TG-Transformer outperforms state-of-the-art approaches on text classification task.
引用
收藏
页码:8322 / 8327
页数:6
相关论文
共 50 条
  • [31] Graph Fusion Network for Text Classification
    Dai, Yong
    Shou, Linjun
    Gong, Ming
    Xia, Xiaolin
    Kang, Zhao
    Xu, Zenglin
    Jiang, Daxin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 236
  • [32] Graph topology enhancement for text classification
    Song, Rui
    Giunchiglia, Fausto
    Zhao, Ke
    Tian, Mingjie
    Xu, Hao
    [J]. APPLIED INTELLIGENCE, 2022, 52 (13) : 15091 - 15104
  • [33] Text with Knowledge Graph Augmented Transformer for Video Captioning
    Gu, Xin
    Chen, Guang
    Wang, Yufei
    Zhang, Libo
    Luo, Tiejian
    Wen, Longyin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18941 - 18951
  • [34] Text classification using improved bidirectional transformer
    Tezgider, Murat
    Yildiz, Beytullah
    Aydin, Galip
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (09):
  • [35] Keyphrase Graph in Text Representation for Document Similarity Measurement
    ThanhThuong T Huynh
    TruongAn Phamnguyen
    Nhon V Do
    [J]. KNOWLEDGE INNOVATION THROUGH INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_20), 2020, 327 : 459 - 472
  • [36] Texture graph transformer for prostate cancer classification
    Zhang, Guokai
    Gao, Lin
    Liu, Huan
    Wang, Shuihua
    Xu, Xiaowen
    Zhao, Binghui
    [J]. Biomedical Signal Processing and Control, 2025, 99
  • [37] Text document classification using swarm intelligence
    Vizine, AL
    de Castro, LN
    Gudwin, RR
    [J]. 2005 INTERNATIONAL CONFERENCE ON INTEGRATION OF KNOWLEDGE INTENSIVE MULTI-AGENT SYSTEMS: KIMAS'05: MODELING, EXPLORATION, AND ENGINEERING, 2005, : 134 - 139
  • [38] Integrating Rich Document Representations for Text Classification
    Jiang, Suqi
    Lewris, Jason
    Voltmer, Michael
    Wang, Hongning
    [J]. 2016 IEEE SYSTEMS AND INFORMATION ENGINEERING DESIGN SYMPOSIUM (SIEDS), 2016, : 303 - 308
  • [39] Document segmentation and classification into musical scores and text
    Pedersoli, Fabrizio
    Tzanetakis, George
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2016, 19 (04) : 289 - 304
  • [40] A New Method of Automatic Text Document Classification
    Yatsko, V. A.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2021, 55 (03) : 122 - 133