Semi-supervised graph clustering: a kernel approach

被引:0
|
作者
Brian Kulis
Sugato Basu
Inderjit Dhillon
Raymond Mooney
机构
[1] University of Texas,Department of Computer Sciences
[2] Google,undefined
[3] Inc.,undefined
来源
Machine Learning | 2009年 / 74卷
关键词
Semi-supervised clustering; Kernel ; -means; Graph clustering; Spectral learning;
D O I
暂无
中图分类号
学科分类号
摘要
Semi-supervised clustering algorithms aim to improve clustering results using limited supervision. The supervision is generally given as pairwise constraints; such constraints are natural for graphs, yet most semi-supervised clustering algorithms are designed for data represented as vectors. In this paper, we unify vector-based and graph-based approaches. We first show that a recently-proposed objective function for semi-supervised clustering based on Hidden Markov Random Fields, with squared Euclidean distance and a certain class of constraint penalty functions, can be expressed as a special case of the weighted kernel k-means objective (Dhillon et al., in Proceedings of the 10th International Conference on Knowledge Discovery and Data Mining, 2004a). A recent theoretical connection between weighted kernel k-means and several graph clustering objectives enables us to perform semi-supervised clustering of data given either as vectors or as a graph. For graph data, this result leads to algorithms for optimizing several new semi-supervised graph clustering objectives. For vector data, the kernel approach also enables us to find clusters with non-linear boundaries in the input data space. Furthermore, we show that recent work on spectral learning (Kamvar et al., in Proceedings of the 17th International Joint Conference on Artificial Intelligence, 2003) may be viewed as a special case of our formulation. We empirically show that our algorithm is able to outperform current state-of-the-art semi-supervised algorithms on both vector-based and graph-based data sets.
引用
收藏
页码:1 / 22
页数:21
相关论文
共 50 条
  • [31] Semi-supervised clustering with deep metric learning and graph embedding
    Xiaocui Li
    Hongzhi Yin
    Ke Zhou
    Xiaofang Zhou
    [J]. World Wide Web, 2020, 23 : 781 - 798
  • [32] Semi-supervised clustering with deep metric learning and graph embedding
    Li, Xiaocui
    Yin, Hongzhi
    Zhou, Ke
    Zhou, Xiaofang
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (02): : 781 - 798
  • [33] Semi-supervised Clustering via Pairwise Constrained Optimal Graph
    Nie, Feiping
    Zhang, Han
    Wang, Rong
    Li, Xuelong
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3160 - 3166
  • [34] Semi-supervised kernel regression
    Wang, Meng
    Hua, Xian-Sheng
    Song, Yan
    Dai, Li-Rong
    Zhang, Hong-Jiang
    [J]. ICDM 2006: Sixth International Conference on Data Mining, Proceedings, 2006, : 1130 - 1135
  • [35] Multiple kernel "approach to semi-supervised fuzzy clustering algorithm for land-cover classification
    Sinh Dinh Mai
    Long Thanh Ngo
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 68 : 205 - 213
  • [36] Kernel semi-supervised graph embedding model for multimodal and mixmodal data
    Qi Zhang
    Rui Li
    Tianguang Chu
    [J]. Science China Information Sciences, 2020, 63
  • [37] Graph Convolutional Networks using Heat Kernel for Semi-supervised Learning
    Xu, Bingbing
    Shen, Huawei
    Cao, Qi
    Cen, Keting
    Cheng, Xueqi
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 1928 - 1934
  • [38] Kernel parameter optimization for semi-supervised fuzzy clustering with pairwise constraints
    Na, Wang
    Xia, Li
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2008, 17 (02) : 297 - 300
  • [39] Kernel semi-supervised graph embedding model for multimodal and mixmodal data
    Zhang, Qi
    Li, Rui
    Chu, Tianguang
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (01)
  • [40] Graph-based semi-supervised learning and spectral kernel design
    Johnson, Ric
    Zhang, Tong
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2008, 54 (01) : 275 - 288