Laplacian Regularized Gaussian Mixture Model for Data Clustering

被引:174
|
作者
He, Xiaofei [1 ]
Cai, Deng [1 ]
Shao, Yuanlong [1 ]
Bao, Hujun [1 ]
Han, Jiawei [2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, State Key Lab CAD&CG, Hangzhou 310058, Zhejiang, Peoples R China
[2] Univ Illinois, Dept Comp Sci, Siebel Ctr, Urbana, IL 61801 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Gaussian mixture model; clustering; graph laplacian; manifold structure; NONLINEAR DIMENSIONALITY REDUCTION; MANIFOLD; PARTS;
D O I
10.1109/TKDE.2010.259
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaussian Mixture Models (GMMs) are among the most statistically mature methods for clustering. Each cluster is represented by a Gaussian distribution. The clustering process thereby turns to estimate the parameters of the Gaussian mixture, usually by the Expectation-Maximization algorithm. In this paper, we consider the case where the probability distribution that generates the data is supported on a submanifold of the ambient space. It is natural to assume that if two points are close in the intrinsic geometry of the probability distribution, then their conditional probability distributions are similar. Specifically, we introduce a regularized probabilistic model based on manifold structure for data clustering, called Laplacian regularized Gaussian Mixture Model (LapGMM). The data manifold is modeled by a nearest neighbor graph, and the graph structure is incorporated in the maximum likelihood objective function. As a result, the obtained conditional probability distribution varies smoothly along the geodesics of the data manifold. Experimental results on real data sets demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:1406 / 1418
页数:13
相关论文
共 50 条
  • [1] Laplacian regularized generalized Dirichlet mixture distribution for data clustering
    Li, Baohua
    Hu, Lixia
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2020, 49 (01) : 16 - 28
  • [2] Regularized Gaussian Mixture Model for High-Dimensional Clustering
    Zhao, Yang
    Shrivastava, Abhishek K.
    Tsui, Kwok Leung
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (10) : 3677 - 3688
  • [3] Manifold Regularized Gaussian Mixture Model for Semi-supervised Clustering
    Gan, Haitao
    Sang, Nong
    Huang, Rui
    Chen, Xi
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 361 - 365
  • [4] Multivariate data clustering for the Gaussian mixture model
    Kavaliauskas, M
    Rudzkis, R
    [J]. INFORMATICA, 2005, 16 (01) : 61 - 74
  • [5] Gaussian Mixture Model Clustering with Incomplete Data
    Zhang, Yi
    Li, Miaomiao
    Wang, Siwei
    Dai, Sisi
    Luo, Lei
    Zhu, En
    Xu, Huiying
    Zhu, Xinzhong
    Yao, Chaoyun
    Zhou, Haoran
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [6] A clustering package for nucleotide sequences using Laplacian Eigenmaps and Gaussian Mixture Model
    Bruneau, Marine
    Mottet, Thierry
    Moulin, Serge
    Kerbiriou, Mael
    Chouly, Franz
    Chretien, Stephane
    Guyeux, Christophe
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 93 : 66 - 74
  • [7] Combined Gaussian Mixture Model and Pathfinder Algorithm for Data Clustering
    Huang, Huajuan
    Liao, Zepeng
    Wei, Xiuxi
    Zhou, Yongquan
    [J]. ENTROPY, 2023, 25 (06)
  • [8] Data Clustering by Laplacian Regularized l1-Graph
    Yang, Yingzhen
    Wang, Zhangyang
    Yang, Jianchao
    Wang, Jiangping
    Chang, Shiyu
    Huang, Thomas S.
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 3148 - 3149
  • [9] A Deep Fusion Gaussian Mixture Model for Multiview Land Data Clustering
    Li, Peng
    Chen, Zhikui
    Gao, Jing
    Zhang, Jianing
    Jin, Shan
    Zhao, Wenhan
    Xia, Feng
    Wang, Lu
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [10] Regularized Gaussian Mixture Model based discretization for gene expression data association mining
    Ruichu Cai
    Zhifeng Hao
    Wen Wen
    Lijuan Wang
    [J]. Applied Intelligence, 2013, 39 : 607 - 613