Graph-based Text Classification by Contrastive Learning with Text-level Graph Augmentation

被引:3
|
作者
Li, Ximing [1 ,2 ]
Wang, Bing [1 ,2 ]
Wang, Yang [3 ]
Wang, Meng [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Jilin, Jilin, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Comp & Knowledge Engn, Jilin, Jilin, Peoples R China
[3] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label classification; graph representation; label correlation; contrastive learning; graph augmentation;
D O I
10.1145/3638353
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text Classification (TC) is a fundamental task in the information retrieval community. Nowadays, the mainstay TC methods are built on the deep neural networks, which can learn much more discriminative text features than the traditional shallow learning methods. Among existing deep TC methods, the ones based on Graph Neural Network (GNN) have attracted more attention due to the superior performance. Technically, the GNN-based TC methods mainly transform the full training dataset to a graph of texts; however, they often neglect the dependency between words, so as to miss potential semantic information of texts, which may be significant to exactly represent them. To solve the aforementioned problem, we generate graphs of words instead, so as to capture the dependency information of words. Specifically, each text is translated into a graph of words, where neighboring words are linked. We learn the node features of words by a GNN-like procedure and then aggregate them as the graph feature to represent the current text. To further improve the text representations, we suggest a contrastive learning regularization term. Specifically, we generate two augmented text graphs for each original text graph, we constrain the representations of the two augmented graphs from the same text close and the ones from different texts far away. We propose various techniques to generate the augmented graphs. Upon those ideas, we develop a novel deep TC model, namely Text-level Graph Networks with Contrastive Learning (TGNcl). We conduct a number of experiments to evaluate the proposed TGNcl model. The empirical results demonstrate that TGNcl can outperform the existing state-of-the-art TC models.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Adversarial Graph Augmentation to Improve Graph Contrastive Learning
    Suresh, Susheel
    Li, Pan
    Hao, Cong
    Neville, Jennifer
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [32] TW-TGNN: Two Windows Graph-Based Model for Text Classification
    Wu, Xinyu
    Luo, Zheng
    Du, Zhanwei
    Wang, Jiaxin
    Gao, Chao
    Li, Xianghua
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [33] Unified Graph-Based Missing Label Propagation Method for Multilabel Text Classification
    Taha, Adil Yaseen
    Tiun, Sabrina
    Rahman, Abdul Hadi Abd
    Ayob, Masri
    Abdulameer, Ali Sabah
    [J]. SYMMETRY-BASEL, 2022, 14 (02):
  • [34] MA-TGNN: Multiple Aggregators Graph-Based Model for Text Classification
    Huang, Chengcheng
    Yin, Shiqun
    Li, Lei
    Zhang, Yaling
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2023, 2023, 14119 : 66 - 77
  • [35] Features based adaptive augmentation for graph contrastive learning
    Ali, Adnan
    Li, Jinlong
    [J]. DIGITAL SIGNAL PROCESSING, 2024, 145
  • [36] Graph-based learning for phonetic classification
    Alexandrescu, Andrei
    Kirchhoff, Katrin
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 359 - +
  • [37] Graph Contrastive Learning with Adaptive Augmentation
    Zhu, Yanqiao
    Xu, Yichen
    Yu, Feng
    Liu, Qiang
    Wu, Shu
    Wang, Liang
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2069 - 2080
  • [38] Graph-based Turkish text normalization and its impact on noisy text processing
    Demir, Seniz
    Topcu, Berkay
    [J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2022, 35
  • [39] HCL: Hybrid Contrastive Learning for Graph-based Recommendation
    Ma, Xiyao
    Gao, Zheng
    Hu, Qian
    AbdelHady, Mohamed
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [40] EdgeSumm: Graph-based framework for automatic text summarization
    El-Kassas, Wafaa S.
    Salama, Cherif R.
    Rafea, Ahmed A.
    Mohamed, Hoda K.
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)