Multi-Label Regularized Generative Model for Semi-Supervised Collective Classification in Large-Scale Networks

被引:7
|
作者
Wu, Qingyao [1 ]
Chen, Jian [1 ]
Ho, Shen-Shyang [2 ]
Li, Xutao [2 ]
Min, Huaqing [1 ]
Han, Chao [1 ]
机构
[1] S China Univ Technol, Sch Software Engn, Guangzhou, Guangdong, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
Collective classification; Generative model; Semi-supervised learning; Multi-label learning; Large-scale sparsely labeled networks;
D O I
10.1016/j.bdr.2015.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of collective classification(CC) for large-scale network data has received considerable attention in the last decade. Enabling CC usually increases accuracy when given a fully-labeled network with a large amount of labeled data. However, such labels can be difficult to obtain and learning a CC model with only a few such labels in large-scale sparsely labeled networks can lead to poor performance. In this paper, we show that leveraging the unlabeled portion of the data through semi-supervised collective classification(SSCC) is essential to achieving high performance. First, we describe a novel data-generating algorithm, called generative model with network regularization(GMNR), to exploit both labeled and unlabeled data in large-scale sparsely labeled networks. In GMNR, a network regularizer is constructed to encode the network structure information, and we apply the network regularizer to smooth the probability density functions of the generative model. Second, we extend our proposed GMNR algorithm to handle network data consisting of multi-label instances. This approach, called the multi-label regularized generative model(MRGM), includes an additional label regularizer to encode the label correlation, and we show how these smoothing regularizers can be incorporated into the objective function of the model to improve the performance of CC in multi-label setting. We then develop an optimization scheme to solve the objective function based on EM algorithm. Empirical results on several real-world network data classification tasks show that our proposed methods are better than the compared collective classification algorithms especially when labeled data is scarce. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:187 / 201
页数:15
相关论文
共 50 条
  • [1] Discrete semi-supervised learning for multi-label image classification and large-scale image retrieval
    Lang He
    Liang Xie
    Haohao Shu
    Shengyuan Hu
    [J]. Multimedia Tools and Applications, 2019, 78 : 24519 - 24537
  • [2] Discrete semi-supervised learning for multi-label image classification and large-scale image retrieval
    He, Lang
    Xie, Liang
    Shu, Haohao
    Hu, Shengyuan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 24519 - 24537
  • [3] Semi-supervised multi-label collective classification ensemble for functional genomics
    Qingyao Wu
    Yunming Ye
    Shen-Shyang Ho
    Shuigeng Zhou
    [J]. BMC Genomics, 15
  • [4] Semi-supervised multi-label collective classification ensemble for functional genomics
    Wu, Qingyao
    Ye, Yunming
    Ho, Shen-Shyang
    Zhou, Shuigeng
    [J]. BMC GENOMICS, 2014, 15
  • [5] Robust Multi-Label Semi-Supervised Classification
    Li, Sheng
    Fu, Yun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 27 - 36
  • [6] Semi-supervised imbalanced multi-label classification with label propagation
    Du, Guodong
    Zhang, Jia
    Zhang, Ning
    Wu, Hanrui
    Wu, Peiliang
    Li, Shaozi
    [J]. PATTERN RECOGNITION, 2024, 150
  • [7] Semi-Supervised Dimension Reduction for Multi-label Classification
    Qian, Buyue
    Davidson, Ian
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 569 - 574
  • [8] Semi-supervised robust deep neural networks for multi-label image classification
    Cevikalp, Hakan
    Benligiray, Burak
    Gerek, Omer Nezih
    [J]. PATTERN RECOGNITION, 2020, 100
  • [9] A survey of multi-label classification based on supervised and semi-supervised learning
    Han, Meng
    Wu, Hongxin
    Chen, Zhiqiang
    Li, Muhang
    Zhang, Xilong
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 697 - 724
  • [10] A survey of multi-label classification based on supervised and semi-supervised learning
    Meng Han
    Hongxin Wu
    Zhiqiang Chen
    Muhang Li
    Xilong Zhang
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 697 - 724