Semi-supervised hybrid clustering by integrating Gaussian mixture model and distance metric learning

被引:0
|
作者
Yihao Zhang
Junhao Wen
Xibin Wang
Zhuo Jiang
机构
[1] Chongqing University,College of Computer Science
[2] Chongqing University,College of Software Engineering
关键词
Semi-supervised clustering; Gaussian mixture model; Distance metric learning; Expectation maximization;
D O I
暂无
中图分类号
学科分类号
摘要
Semi-supervised clustering aim to aid and bias the unsupervised clustering by employing a small amount of supervised information. The supervised information is generally given as pairwise constraints, which was used to either modify the objective function or to learn the distance measure. Many previous work have shown that the cluster algorithm based on distance metric is significantly better than the cluster algorithm based on probability distribution in the some data set, there are a totally opposite result in another data set, so how to balance the two methods become a key problem. In this paper, we proposed a semi-supervised hybrid clustering algorithm that provides a principled framework integrating distance metric into Gaussian mixture model, which consider not only the intrinsic geometry information but also the probability distribution information of the data. In comparison to only using the pairwise constraints, the labeled data was used to initialize Gaussian distribution parameter and to construct the weight matrix of regularizer, and then we adopt Kullback-Leibler Divergence as the “distance” measurement to regularize the objective function. Experiments on several UCI data sets and the real world data sets of Chinese Word Sense Induction demonstrate the effectiveness of our semi-supervised cluster algorithm.
引用
收藏
页码:113 / 130
页数:17
相关论文
共 50 条
  • [1] Semi-supervised hybrid clustering by integrating Gaussian mixture model and distance metric learning
    Zhang, Yihao
    Wen, Junhao
    Wang, Xibin
    Jiang, Zhuo
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (01) : 113 - 130
  • [2] Integrating distance metric learning and cluster-level constraints in semi-supervised clustering
    Nogueira, Bruno Magalhaes
    Benevides Tomas, Yuri Karan
    Marcacini, Ricardo Marcondes
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4118 - 4125
  • [3] Kernelized evolutionary distance metric learning for semi-supervised clustering
    Kalintha, Wasin
    Ono, Satoshi
    Numao, Masayuki
    Fukui, Ken-ichi
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (06) : 1271 - 1297
  • [4] Kernelized Evolutionary Distance Metric Learning for Semi-Supervised Clustering
    Kalintha, Wasin
    Ono, Satoshi
    Numao, Masayuki
    Fukui, Ken-ichi
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4945 - 4946
  • [5] Semi-supervised distributed clustering with Mahalanobis distance metric learning
    Yuecheng, Yu
    Jiandong, Wang
    Guansheng, Zheng
    Bin, Gu
    [J]. International Journal of Digital Content Technology and its Applications, 2010, 4 (09) : 132 - 140
  • [6] Manifold Regularized Gaussian Mixture Model for Semi-supervised Clustering
    Gan, Haitao
    Sang, Nong
    Huang, Rui
    Chen, Xi
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 361 - 365
  • [7] Hybrid Recommender System using Semi-Supervised Clustering based on Gaussian Mixture Model
    Zhang, Yihao
    Liu, Xiaoyang
    Liu, Wanping
    Zhu, Changpeng
    [J]. 2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 155 - 158
  • [8] Distance metric learning guided adaptive subspace semi-supervised clustering
    Yin, Xuesong
    Hu, Enliang
    [J]. FRONTIERS OF COMPUTER SCIENCE IN CHINA, 2011, 5 (01): : 100 - 108
  • [9] Semi-Supervised Distance Metric Learning for Collaborative Image Retrieval and Clustering
    Hoi, Steven C. H.
    Liu, Wei
    Chang, Shih-Fu
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2010, 6 (03)
  • [10] Distance metric learning guided adaptive subspace semi-supervised clustering
    Xuesong Yin
    Enliang Hu
    [J]. Frontiers of Computer Science in China, 2011, 5 : 100 - 108