Graph-based Semi-supervised Learning for Text Classification

被引:10
|
作者
Widmann, Natalie [1 ]
Verberne, Suzan [2 ]
机构
[1] Radboud Univ Nijmegen, Artificial Intelligence, Nijmegen, Netherlands
[2] Leiden Ctr Data Sci, LIACS, Leiden, Netherlands
关键词
text representation; graph models; text classification; semi-supervised learning;
D O I
10.1145/3121050.3121055
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a graph-based representation of document collections in which both documents and features are represented by nodes. The nodes are connected with weights based on word order, context similarity and word frequency. Graph-based representations can overcome the limitations of bag-of-words based representations that suffer from sparseness for collections with short documents. In a series of experiments, we evaluate multiple types of graph-based text features in the context of semi-supervised text classification, and investigate the effect of the number of labeled documents in the collection. We find that graph-based semi-supervised learning outperforms bag-of-words semi-supervised learning but not bag-of-words supervised learning in 20-class text categorization. A large asset of graph-based representations is that they are flexible in the types of nodes and relations that are included.
引用
收藏
页码:59 / 66
页数:8
相关论文
共 50 条
  • [1] Graph-based semi-supervised learning
    Zhang, Changshui
    Wang, Fei
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2009, 14 (04) : 445 - 448
  • [2] Graph-based semi-supervised learning
    Changshui Zhang
    Fei Wang
    [J]. Artificial Life and Robotics, 2009, 14 (4) : 445 - 448
  • [3] Graph-based semi-supervised learning
    Subramanya, Amarnag
    Talukdar, Partha Pratim
    [J]. Synthesis Lectures on Artificial Intelligence and Machine Learning, 2014, 29 : 1 - 126
  • [4] Graph-Based Semi-supervised Learning for Phone and Segment Classification
    Liu, Yuzong
    Kirchhoff, Katrin
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1839 - 1842
  • [5] Fairness in graph-based semi-supervised learning
    Tao Zhang
    Tianqing Zhu
    Mengde Han
    Fengwen Chen
    Jing Li
    Wanlei Zhou
    Philip S Yu
    [J]. Knowledge and Information Systems, 2023, 65 : 543 - 570
  • [6] On Consistency of Graph-based Semi-supervised Learning
    Du, Chengan
    Zhao, Yunpeng
    Wang, Feng
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 483 - 491
  • [7] Fairness in graph-based semi-supervised learning
    Zhang, Tao
    Zhu, Tianqing
    Han, Mengde
    Chen, Fengwen
    Li, Jing
    Zhou, Wanlei
    Yu, Philip S.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (02) : 543 - 570
  • [8] Graph-based semi-supervised learning: A review
    Chong, Yanwen
    Ding, Yun
    Yan, Qing
    Pan, Shaoming
    [J]. NEUROCOMPUTING, 2020, 408 (408) : 216 - 230
  • [9] Fractional Graph-based Semi-Supervised Learning
    de Nigris, S.
    Bautista, E.
    Abry, P.
    Avrachenkov, K.
    Gonclaves, P.
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 356 - 360
  • [10] Graph-based Semi-supervised Learning with Manifold Preprocessing for Image Classification
    Gong, Yun-Chao
    Liu, Feng
    Chen, Chuanliang
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 391 - +