Semi-supervised graph clustering: a kernel approach

被引:0
|
作者
Brian Kulis
Sugato Basu
Inderjit Dhillon
Raymond Mooney
机构
[1] University of Texas,Department of Computer Sciences
[2] Google,undefined
[3] Inc.,undefined
来源
Machine Learning | 2009年 / 74卷
关键词
Semi-supervised clustering; Kernel ; -means; Graph clustering; Spectral learning;
D O I
暂无
中图分类号
学科分类号
摘要
Semi-supervised clustering algorithms aim to improve clustering results using limited supervision. The supervision is generally given as pairwise constraints; such constraints are natural for graphs, yet most semi-supervised clustering algorithms are designed for data represented as vectors. In this paper, we unify vector-based and graph-based approaches. We first show that a recently-proposed objective function for semi-supervised clustering based on Hidden Markov Random Fields, with squared Euclidean distance and a certain class of constraint penalty functions, can be expressed as a special case of the weighted kernel k-means objective (Dhillon et al., in Proceedings of the 10th International Conference on Knowledge Discovery and Data Mining, 2004a). A recent theoretical connection between weighted kernel k-means and several graph clustering objectives enables us to perform semi-supervised clustering of data given either as vectors or as a graph. For graph data, this result leads to algorithms for optimizing several new semi-supervised graph clustering objectives. For vector data, the kernel approach also enables us to find clusters with non-linear boundaries in the input data space. Furthermore, we show that recent work on spectral learning (Kamvar et al., in Proceedings of the 17th International Joint Conference on Artificial Intelligence, 2003) may be viewed as a special case of our formulation. We empirically show that our algorithm is able to outperform current state-of-the-art semi-supervised algorithms on both vector-based and graph-based data sets.
引用
收藏
页码:1 / 22
页数:21
相关论文
共 50 条
  • [41] Semi-supervised Affinity Propagation Clustering Algorithm Based On Kernel Function
    Zhao Xiaoqiang
    Xie Yaping
    [J]. 2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 3275 - 3279
  • [42] A semi-supervised clustering approach using labeled data
    Taghizabet, A.
    Tanha, J.
    Amini, A.
    Mohammadzadeh, J.
    [J]. SCIENTIA IRANICA, 2023, 30 (01) : 104 - 115
  • [43] A Semi-Supervised Clustering Approach for Semantic Slot Labelling
    Cuayahuitl, Heriberto
    Dethlefs, Nina
    Hastie, Helen
    [J]. 2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 500 - 505
  • [44] Fast Semi-Supervised Fuzzy Clustering :Approach and Application
    Cai, Jia-xin
    Yang, Feng
    Feng, Guo-can
    [J]. PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 108 - +
  • [45] Semi-supervised clustering methods
    Bair, Eric
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [46] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    [J]. 2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [47] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    [J]. INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [48] Experimental Study of Semi-Supervised Graph 2-Clustering Problem
    Morshinin A.V.
    [J]. Journal of Mathematical Sciences, 2023, 275 (1) : 107 - 115
  • [49] SGAClust: Semi-supervised Graph Attraction Clustering of gene expression data
    Mandal, Koyel
    Sarmah, Rosy
    [J]. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2022, 11 (01):
  • [50] A Chinese expert disambiguation method based on semi-supervised graph clustering
    Jiang, Jin
    Yan, Xin
    Yu, Zhengtao
    Guo, Jianyi
    Tian, Wei
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (02) : 197 - 204